Skip to main content
Cartesia provides a family of state-of-the-art models, including our highly-accurate, low-latency Sonic text-to-speech (TTS) model family.

sonic-3

Sonic-3 is our most expressive streaming TTS model, delivering emotional delivery and transcript accuracy across multiple languages. It provides fine-grained control on volume, speed, and emotion through API parameters and SSML tags. Key Features:
  • 42 languages supported
  • Volume and speed controls per generation, through API parameters and SSML tags
  • Natural intonation that understands and matches the meaning of the text.
For example, you can set generation_config.speed=1.2 to speed up speech by 20%, or prepend <emotion value='excited'/> to a transcript to generate it in an excited tone. For more information, see Volume, Speed, and Emotion.

Language support

Sonic-3 supports the following languages:
English (en)French (fr)German (de)Spanish (es)
Portuguese (pt)Chinese (zh)Japanese (ja)Hindi (hi)
Italian (it)Korean (ko)Dutch (nl)Polish (pl)
Russian (ru)Swedish (sv)Turkish (tr)Tagalog (tl)
Bulgarian (bg)Romanian (ro)Arabic (ar)Czech (cs)
Greek (el)Finnish (fi)Croatian (hr)Malay (ms)
Slovak (sk)Danish (da)Tamil (ta)Ukrainian (uk)
Hungarian (hu)Norwegian (no)Vietnamese (vi)Bengali (bn)
Thai (th)Hebrew (he)Georgian (ka)Indonesian (id)
Telugu (te)Gujarati (gu)Kannada (kn)Malayalam (ml)
Marathi (mr)Punjabi (pa)

Selecting a Model

SnapshotRelease DateLanguagesStatus
sonic-3-2025-10-27October 27, 2025en, de, es, fr, ja, pt, zh, hi, ko, it, nl, pl, ru, sv, tr, tl, bg, ro, ar, cs, el, fi, hr, ms, sk, da, ta, uk, hu, no, vi, bn, th, he, ka, id, te, gu, kn, ml, mr, paStable
When making API calls, you can specify either:
// Use the base model (automatically routes to the latest snapshot)
const modelId = "sonic-3";

// Or specify a particular snapshot for consistency
const modelId = "sonic-3-2025-10-27";

Continuous updates and model snapshots

All models have a base model name (e.g. sonic-3, sonic-turbo). We recommend using these for prototyping and development, then switching to a date-versioned model for production use cases to ensure stability.

Older Models

For information on sonic-2, sonic-turbo, sonic-multilingual, and sonic, see our page on Older Models.