Skip to main content
POST
/
voices
/
clone
Clone Voice
curl --request POST \
  --url https://api.cartesia.ai/voices/clone \
  --header 'Authorization: Bearer <token>' \
  --header 'Cartesia-Version: <cartesia-version>' \
  --header 'Content-Type: multipart/form-data' \
  --form 'name=<string>' \
  --form 'description=<string>' \
  --form language=en \
  --form 'base_voice_id=<string>' \
  --form clip=@example-file
{
  "id": "<string>",
  "user_id": "<string>",
  "is_public": true,
  "name": "<string>",
  "description": "<string>",
  "created_at": "2023-11-07T05:31:56Z",
  "language": "en"
}

Authorizations

Authorization
string
header
required

Cartesia API key

Headers

Cartesia-Version
enum<string>
required

API version header. Must be set to the API version, e.g. '2024-06-10'.

Available options:
2024-06-10,
2024-11-13,
2025-04-16
Example:

"2025-04-16"

Body

multipart/form-data
clip
file
name
string

The name of the voice.

description
string | null

A description for the voice.

language
enum<string>

The language of the voice.

Available options:
en,
fr,
de,
es,
pt,
zh,
ja,
hi,
it,
ko,
nl,
pl,
ru,
sv,
tr,
tl,
bg,
ro,
ar,
cs,
el,
fi,
hr,
ms,
sk,
da,
ta,
uk,
hu,
no,
vi,
bn,
th,
he,
ka,
id,
te,
gu,
kn,
ml,
mr,
pa
base_voice_id
string

Optional base voice ID that the cloned voice is derived from.

Response

200 - application/json
id
string
required

The ID of the voice.

user_id
string
required

The ID of the user who owns the voice.

is_public
boolean
required

Whether the voice is publicly accessible.

name
string
required

The name of the voice.

description
string
required

The description of the voice.

created_at
string<date-time>
required

The date and time the voice was created.

language
enum<string>
required

The language that the given voice should speak the transcript in. For valid options, see Models.

Available options:
en,
fr,
de,
es,
pt,
zh,
ja,
hi,
it,
ko,
nl,
pl,
ru,
sv,
tr,
tl,
bg,
ro,
ar,
cs,
el,
fi,
hr,
ms,
sk,
da,
ta,
uk,
hu,
no,
vi,
bn,
th,
he,
ka,
id,
te,
gu,
kn,
ml,
mr,
pa