Integrations
Comparing Ink against other models
Ink supports two realtime transcription modes:- Client sends audio (Auto finalization)
- Client sends audio and signals when to finalize transcripts (Manual finalization)
- Push-to-talk apps
- Pipelines where you know speech is over and are waiting for the transcript
Migration guides
- Deepgram Turn-based Audio (Flux)
- Deepgram Live Audio (Nova)
- Auto finalization
Ink automatically finalizes transcripts - Manual finalization
Your client decides when to finalize transcripts
- Auto finalization
- ElevenLabs Realtime Speech to Text
- Auto finalization
Similar to ElevenLabs’scommit_strategy=vad - Manual finalization
Similar to ElevenLabs’scommit_strategy=manual
- Auto finalization
- OpenAI Realtime Transcription
- Auto finalization
Similar to OpenAI’sturn_detection: server_vad - Manual finalization
Similar to OpenAI’sturn_detection: null
- Auto finalization
- OpenAI Speech to Text
Batch audio transcription
References
Code examples
Simple scripts you can run yourself
Client Libraries
The official Cartesia SDKs
AI tools
Skills and docs for your coding agents