Voice Changer (Bytes)

POST

Takes an audio file of speech, and returns an audio file of speech spoken with the same intonation, but with a different voice.

This endpoint is priced at 15 characters per second of input audio.

Headers

Auth
X-API-KeystringRequired
Cartesia-Version"2024-06-10"RequiredDefaults to 2024-06-10

Request

This endpoint expects a multipart form containing a file.
clipfileRequired
voice[id]stringRequired
output_format[container]enumRequired
Allowed values: rawwavmp3
output_format[sample_rate]integerRequired
output_format[encoding]enumOptional

Required for raw and wav containers.

Allowed values: pcm_f32lepcm_s16lepcm_mulawpcm_alaw
output_format[bit_rate]integerOptional

Required for mp3 containers.

Response

This endpoint returns a file.