sonic-3
Sonic-3 is our most expressive streaming TTS model, delivering emotional delivery and transcript
accuracy across multiple languages. It provides fine-grained control on volume, speed, and
emotion through API parameters and SSML tags.
Key Features:
- 42 languages supported
- Volume and speed controls per generation, through API parameters and SSML tags
- Natural intonation that understands and matches the meaning of the text.
generation_config.speed=1.2 to speed up speech by 20%, or prepend
<emotion value='excited'/> to a transcript to generate it in an excited tone.
For more information, see Volume, Speed, and Emotion.
Language support
Sonic-3 supports the following languages:English (en) | French (fr) | German (de) | Spanish (es) |
Portuguese (pt) | Chinese (zh) | Japanese (ja) | Hindi (hi) |
Italian (it) | Korean (ko) | Dutch (nl) | Polish (pl) |
Russian (ru) | Swedish (sv) | Turkish (tr) | Tagalog (tl) |
Bulgarian (bg) | Romanian (ro) | Arabic (ar) | Czech (cs) |
Greek (el) | Finnish (fi) | Croatian (hr) | Malay (ms) |
Slovak (sk) | Danish (da) | Tamil (ta) | Ukrainian (uk) |
Hungarian (hu) | Norwegian (no) | Vietnamese (vi) | Bengali (bn) |
Thai (th) | Hebrew (he) | Georgian (ka) | Indonesian (id) |
Telugu (te) | Gujarati (gu) | Kannada (kn) | Malayalam (ml) |
Marathi (mr) | Punjabi (pa) |
Selecting a Model
| Snapshot | Release Date | Languages | Status |
|---|---|---|---|
sonic-3-2025-10-27 | October 27, 2025 | en, de, es, fr, ja, pt, zh, hi, ko, it, nl, pl, ru, sv, tr, tl, bg, ro, ar, cs, el, fi, hr, ms, sk, da, ta, uk, hu, no, vi, bn, th, he, ka, id, te, gu, kn, ml, mr, pa | Stable |
Continuous updates and model snapshots
All models have a base model name (e.g.sonic-3, sonic-turbo).
We recommend using these for prototyping and development, then switching to a date-versioned model for production use cases to ensure stability.
Older Models
For information onsonic-2, sonic-turbo, sonic-multilingual, and sonic, see our page on
Older Models.