Authorizations
Headers
API version header. Must be set to the API version, e.g. '2024-06-10'.
2024-06-10, 2024-11-13, 2025-04-16 "2024-11-13"
Body
- TTSRequestIdSpecifier
- TTSRequestEmbeddingSpecifier
The language that the given voice should speak the transcript in.
Options: English (en), French (fr), German (de), Spanish (es), Portuguese (pt), Chinese (zh), Japanese (ja), Hindi (hi), Italian (it), Korean (ko), Dutch (nl), Polish (pl), Russian (ru), Swedish (sv), Turkish (tr).
en, fr, de, es, pt, zh, ja, hi, it, ko, nl, pl, ru, sv, tr The maximum duration of the audio in seconds. You do not usually need to specify this. If the duration is not appropriate for the length of the transcript, the output audio may be truncated.
This feature is experimental and may not work for all voices.
Speed setting for the model. Defaults to normal.
Influences the speed of the generated speech. Faster speeds may reduce hallucination rate.
slow, normal, fast Whether to return word-level timestamps. If false (default), no word timestamps will be produced at all. If true, the server will return timestamp events containing word-level timing information.
Whether to return phoneme-level timestamps. If false (default), no phoneme timestamps will be produced - if add_timestamps is true, the produced timestamps will be word timestamps instead. If true, the server will return timestamp events containing phoneme-level timing information.
Whether to use normalized timestamps (True) or original timestamps (False).
Optional context ID for this request.