Models
Learn about the Sonic variants available on Cartesia
Sonic English
Sonic English is our latest English text-to-speech model. It demonstrates strong overall capabilities and is optimized for efficiency to achieve low latency.
Model ID: sonic-english
Release date: May 2024
Last updated: Aug 2024
Supported languages: English
Capabilities:
- Supports abbreviations, acronyms, initialisms and phonemes (alpha).
- Supports numbers, dates, phone numbers and SSNs.
Known issues:
- Audio generations can loop or diverge on transcripts that have repeated words in succession.
- Audio generations may occasionally sound fast.
- Some long numbers and phone numbers may sound rushed as well.
Sonic Multilingual [Alpha]
Sonic Multilingual is the first multilingual Sonic variant, demonstrating great transcript following and low latency.
Model ID: sonic-multilingual
Release date: Jun 2024
Last updated: September 2024
Supported languages:
- English (
en
) - French (
fr
) - German (
de
) - Spanish (
es
) - Portuguese (
pt
) - Chinese (
zh
) - Japanese (
ja
) - Hindi (
hi
) - Italian (
it
) - Korean (
ko
) - Dutch (
nl
) - Polish (
pl
) - Russian (
ru
) - Swedish (
sv
) - Turkish (
tr
)
Capabilities:
- Supports numbers, dates, phone numbers in English, French, German, Spanish, and Chinese
Known issues:
- Some inaccuracies in numbers, dates, and phone numbers in Japanese and Portuguese.
- Audio generations may occasionally sound fast.
Recommendations:
- It is recommended to use voices from the same language as the transcript for the best performance.