Models | Cartesia

Cartesia provides a family of state-of-the-art models, including our highly-accurate, low-latency Sonic TTS model family.

● the latest stable snapshot of the model

To use the stable version of the model, we recommend using the base model name (e.g. sonic-2). In many cases the stable and preview snapshots are the same, but in some cases the preview snapshot may have additional features or improvements.

`sonic-2`

Sonic-2 is our most capable text to speech model. It provides ultra-realistic speech with accurate transcript following, minimal hallucinations, and best in class voice cloning. It’s latency optimized and achieves 90ms model latency.

Additional Capabilities:

Higher fidelity voice cloning
Timestamps for all 15 languages
Infill support

Snapshot	Release Date	Languages	Status
● `sonic-2-2025-05-08`	May 8, 2025	en, fr, de, es, pt, zh, ja, hi, it, ko, nl, pl, ru, sv, tr	Stable
`sonic-2-2025-04-16`	April 16, 2025	en, fr, de, es, pt, zh, ja, hi, it, ko, nl, pl, ru, sv, tr	Stable
`sonic-2-2025-03-07`	March 7, 2025	en, fr, de, es, pt, zh, ja, hi, it, ko, nl, pl, ru, sv, tr	Stable

Note: For versions after sonic-2-2025-03-07, _experimental_controls is not supported. If the latest model does not meet your needs, and you would like to use the controls, please make requests to sonic-2-2025-03-07.

To learn how to use the Sonic TTS family, see Make an API request.

`sonic-turbo`

All the power of Sonic, with half the latency (as low as 40ms).

Snapshot	Release Date	Languages	Status
● `sonic-turbo-2025-03-07`	March 7, 2025	en, fr, de, es, pt, zh, ja, hi, it, ko, nl, pl, ru, sv, tr	Stable

`sonic`

The first version of our flagship text-to-speech model. It produces high-accuracy, expressive speech, and is optimized for efficiency to achieve low latency.

Snapshot	Release Date	Languages	Status
● `sonic-2024-12-12`	December 12, 2024	en, fr, de, es, pt, zh, ja, hi, it, ko, nl, pl, ru, sv, tr	Stable
`sonic-2024-10-19`	October 19, 2024	en, fr, de, es, pt, zh, ja, hi, it, ko, nl, pl, ru, sv, tr

Selecting a Model

When making API calls, you can specify either:

1 // Use the base model (automatically routes to the latest snapshot)
2 const model = "sonic-2";
3 
4 // Or specify a particular snapshot for consistency
5 const model = "sonic-2-2025-03-07";

Continuous updates

All models have a base model name (e.g. sonic-2, sonic-turbo, sonic). We recommend using these for prototyping and development, then switching to a date-versioned model for production use cases to ensure stability.

Language Support

English (en)
French (fr)
German (de)
Spanish (es)
Portuguese (pt)
Chinese (zh)
Japanese (ja)
Hindi (hi)
Italian (it)
Korean (ko)
Dutch (nl)
Polish (pl)
Russian (ru)
Swedish (sv)
Turkish (tr)

Future Updates

New snapshots are released periodically with improvements in performance, additional language support, and new capabilities. Check back regularly for updates.

Deprecated and Preview Model Aliases

The following model aliases are now deprecated. Please use the recommended model names instead:

Deprecated Alias	Use Instead
`sonic-preview`	`sonic-2`
`sonic-english`	`sonic-2024-10-19`
`sonic-multilingual`	`sonic-2024-10-19`

We recommend updating your integrations to use the current model names for the best performance and support.