Skip to main content
The quickest way to get started with Ink is by using an integration. If your preferred integration isn’t listed and you already use another speech-to-text model, the migration guides explain how to wire Ink into an existing codebase. See the references section for complete documentation of all API endpoints and best practices.

Integrations

Comparing Ink against other models

Ink supports two realtime transcription modes:
  1. Client sends audio (Auto finalization)
  2. Client sends audio and signals when to finalize transcripts (Manual finalization)
Most speech-to-text APIs combine both behaviors, but Cartesia separates them for improved model performance. Auto finalization is recommended for most agents. At the same time, there are many use-cases where manual finalization is necessary. Some examples are:
  1. Push-to-talk apps
  2. Pipelines where you know speech is over and are waiting for the transcript

Migration guides

References

Code examples

Simple scripts you can run yourself

Client Libraries

The official Cartesia SDKs

AI tools

Skills and docs for your coding agents