eval: NVIDIA Sortformer + Parakeet streaming pair for Osprey phone-call use case #8
Labels
No labels
a11y
acoustic
backlog
bug
cf-core-dep
diarization
enhancement
inference
privacy
stt
testing
tier:paid
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: Circuit-Forge/cf-voice#8
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Sources:
What they are
NVIDIA designed these as a complementary streaming pair:
Neither model is useful standalone for cf-voice's full pipeline, but together they cover streaming diarization + transcription.
Performance
Why Osprey specifically
Osprey handles government hold-line phone calls:
Blockers before adopting
cf-voice backend comparison context
See cf-voice#5 (cohere-transcribe-diarize) and cf-voice#6 (ARK-ASR-0.6B) for the other candidates. This pair targets a different niche: streaming real-time phone calls rather than offline lecture/meeting transcription.