Documentation Index
Fetch the complete documentation index at: https://docs.cartesia.ai/llms.txt
Use this file to discover all available pages before exploring further.
A short-lived access token to make API requests from a client.
API version header.
2024-06-10, 2024-11-13, 2025-04-16, 2026-03-01 "2026-03-01"
The language that the given voice should speak the transcript in. For valid options, see Models.
en, fr, de, es, pt, zh, ja, hi, it, ko, nl, pl, ru, sv, tr, tl, bg, ro, ar, cs, el, fi, hr, ms, sk, da, ta, uk, hu, no, vi, bn, th, he, ka, id, te, gu, kn, ml, mr, pa Whether to save the generated audio file. When true, the response will include a Cartesia-File-ID header.
The ID of a pronunciation dictionary to use for the generation. Pronunciation dictionaries are supported by sonic-3 models and newer.
Configure the various attributes of the generated speech. These are only for sonic-3 and have no effect on earlier models.
See Volume, Speed, and Emotion in Sonic-3 for a guide on this option.
Use generation_config.speed for sonic-3.
Speed setting for the model. Defaults to normal.
This feature is experimental and may not work for all voices.
Influences the speed of the generated speech. Faster speeds may reduce hallucination rate.
slow, normal, fast Audio bytes
The response is of type file.