- Client sends audio (Auto finalization)
- Client sends audio and signals when to finalize transcripts (Manual finalization)
- Push-to-talk apps
- Pipelines where you know speech is over and are waiting for the transcript
Guides
- Deepgram Turn-based Audio (Flux)
- Deepgram Live Audio (Nova)
- Auto finalization
Ink automatically finalizes transcripts - Manual finalization
Your client decides when to finalize transcripts
- Auto finalization
- ElevenLabs Realtime Speech to Text
- Auto finalization
Similar to ElevenLabs’scommit_strategy=vad - Manual finalization
Similar to ElevenLabs’scommit_strategy=manual
- Auto finalization
- OpenAI Realtime Transcription
- Auto finalization
Similar to OpenAI’sturn_detection: server_vad - Manual finalization
Similar to OpenAI’sturn_detection: null
- Auto finalization
- OpenAI Speech to Text
Batch audio transcription