- Sonic-3
- Sonic-turbo and Sonic-2
sonic-3 supports custom pronunciation dictionaries, which allow specifying how to pronounce a specific word or words more easily and sustainably.At its core, a dictionary is a simple search and replace, which directs the model to use another string in lieu of the text for the transcript. The pronunciation can either be an IPA pronunciation, or a “sounds-like” guidance:
pronunciation_dict_id.With the above dictionary, the string: I ate some jambalaya on tchoupitoulas street would become I ate some <<ˈ|dʒ|ə|m|ˈ|b|ə|ˈ|l|aɪ|ˈ|ə>> on chop-uh-TOO-liss street before being handed off to the model, which in turn, would do a better job in pronouncing it properly.Case Sensitivity
Dictionary matching is case-sensitive, with one exception: a lowercase entry also matches its sentence-start capitalized form. For example,cat matches both cat and Cat, but not CAT. An entry for CAT only matches CAT.This applies to multi-word entries too. An entry for green valley matches green valley and Green valley, but not Green Valley.Use lowercase entries for common words. These match the word both mid-sentence (cat) and at the start of a sentence (Cat), covering the two most common positions.Use exact capitalization for proper nouns. A term like “LaTeX” should be entered as LaTeX so it doesn’t collide with a different pronunciation for the common word latex. For multi-word proper nouns, enter the exact casing as it appears in your transcripts — for example, Green Valley if the transcript capitalizes both words.