Voice Changer (Bytes)
Takes an audio file of speech, and returns an audio file of speech spoken with the same intonation, but with a different voice.
This endpoint is priced at 15 characters per second of input audio.
Headers
X-API-Key
Cartesia-Version
Request
This endpoint expects a multipart form containing a file.
clip
voice[id]
output_format[container]
Allowed values:
output_format[sample_rate]
output_format[encoding]
Required for raw
and wav
containers.
Allowed values:
output_format[bit_rate]
Required for mp3
containers.
Response
This endpoint returns a file.