Skip to main content
All models in the Sonic TTS family support custom pronunciations in your transcripts. Try out the pronunciation tool on our demo page.
  • Sonic-3
  • Sonic-turbo and Sonic-2
sonic-3 supports custom pronunciation dictionaries, which allow specifying how to pronounce a specific word or words more easily and sustainabily.At its core, a dictionary is a simple search and replace, which directs the model to use another string in lieu of the text for the transcript. The pronunciation can either be an IPA pronunciation, or a “sounds-like” guidance:
[
  {
    "text": "bayou",
    "pronunciation": "<<ˈ|b|ɑ|ˈ|j|u>>"
  },
  {
    "text": "jambalaya",
    "pronunciation": "<<ˈ|dʒ|ə|m|ˈ|b|ə|ˈ|l|aɪ|ˈ|ə>>"
  },
  {
    "text": "tchoupitoulas",
    "pronunciation": "chop-uh-TOO-liss"
  }
]
These JSONs can then be saved as a pronunciation dictionaries through our API, or through our playground. The playground gives affordances for creating and manipulating dictionaries also directly in the UI:image.pngOnce the dictionaries are created, they can be used in any of the TTS APIs by specifying the id in pronunciation_dict_id.With the above dictionary, the string: I ate some jambalaya on tchoupitoulas street would become I ate some <<ˈ|dʒ|ə|m|ˈ|b|ə|ˈ|l|aɪ|ˈ|ə>> on chop-uh-TOO-liss street before being handed off to the model, which in turn, would do a better job in pronouncing it properly.