Documentation Index
Fetch the complete documentation index at: https://docs.cartesia.ai/llms.txt
Use this file to discover all available pages before exploring further.
API version header.
2024-06-10, 2024-11-13, 2025-04-16, 2026-03-01 "2024-06-10"
The language that the given voice should speak the transcript in.
Options: English (en), French (fr), German (de), Spanish (es), Portuguese (pt), Chinese (zh), Japanese (ja), Hindi (hi), Italian (it), Korean (ko), Dutch (nl), Polish (pl), Russian (ru), Swedish (sv), Turkish (tr).
en, fr, de, es, pt, zh, ja, hi, it, ko, nl, pl, ru, sv, tr The maximum duration of the audio in seconds. You do not usually need to specify this. If the duration is not appropriate for the length of the transcript, the output audio may be truncated.
This feature is experimental and may not work for all voices.
Speed setting for the model. Defaults to normal.
Influences the speed of the generated speech. Faster speeds may reduce hallucination rate.
slow, normal, fast Audio bytes
The response is of type file.