
pronunciation_dict_id.
With the dictionary above, the string I ate some jambalaya on tchoupitoulas street becomes I ate some <<ˈ|dʒ|ə|m|ˈ|b|ə|ˈ|l|aɪ|ˈ|ə>> on chop-uh-TOO-liss street before being handed off to the model.
Case Sensitivity
Dictionary matching is case-sensitive, with one exception: a lowercase entry also matches its sentence-start capitalized form. For example,cat matches both cat and Cat, but not CAT. An entry for CAT only matches CAT.
This applies to multi-word entries too. An entry for green valley matches green valley and Green valley, but not Green Valley.
Use lowercase entries for common words. These match the word both mid-sentence (cat) and at the start of a sentence (Cat), covering the two most common positions.
Use exact capitalization for proper nouns. A term like LaTeX should be entered as LaTeX so it doesn’t collide with a different pronunciation for the common word latex. For multi-word proper nouns, enter the exact casing as it appears in your transcripts — for example, Green Valley if the transcript capitalizes both words.