Learn about the Sonic variants available on Cartesia

Sonic English

Sonic English is our latest English text-to-speech model. It demonstrates strong overall capabilities and is optimized for efficiency to achieve low latency.

Model ID: sonic-english

Release date: May 2024

Last updated: Aug 2024

Supported languages: English

Capabilities:

  • Supports abbreviations, acronyms, initialisms and phonemes (alpha).
  • Supports numbers, dates, phone numbers and SSNs.

Known issues:

  • Audio generations can loop or diverge on transcripts that have repeated words in succession.
  • Audio generations may occasionally sound fast.
  • Some long numbers and phone numbers may sound rushed as well.

Sonic Multilingual [Alpha]

Sonic Multilingual is the first multilingual Sonic variant, demonstrating great transcript following and low latency.

Model ID: sonic-multilingual

Release date: Jun 2024

Last updated: September 2024

Supported languages:

  1. English (en)
  2. French (fr)
  3. German (de)
  4. Spanish (es)
  5. Portuguese (pt)
  6. Chinese (zh)
  7. Japanese (ja)
  8. Hindi (hi)
  9. Italian (it)
  10. Korean (ko)
  11. Dutch (nl)
  12. Polish (pl)
  13. Russian (ru)
  14. Swedish (sv)
  15. Turkish (tr)

Capabilities:

  • Supports numbers, dates, phone numbers in English, French, German, Spanish, and Chinese

Known issues:

  • Some inaccuracies in numbers, dates, and phone numbers in Japanese and Portuguese.
  • Audio generations may occasionally sound fast.

Recommendations:

  • It is recommended to use voices from the same language as the transcript for the best performance.