feat: audio domain tagging for benchmark datasets #25
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Context: Audio benchmark datasets mix wildly different recording conditions — acted studio speech, naturalistic conversation, broadcast panel shows, call centre audio. Lumping them into a single pool hides per-domain failure modes; testing against British comedy panel show audio ("As Yet Untitled") showed SER models reading "neutral" across the board on naturalistic non-NA-accent speech.
Scope:
audio_domainstring fieldacted_na,acted_eu,naturalistic_en_gb,naturalistic_en_us,broadcast,call_centre,phone_degradedaudio_domainfieldOut of scope: Automatic domain prediction (see separate issue for lightweight domain classifier).
Acceptance criteria:
audio_domainstring fieldRelated:
circuitforge-plans/avocet/— audio model evaluation extension; see also cf-voice/Linnet SER evaluation work