Create Voice

Create voice from raw features. If you’d like to clone a voice from an audio file, please use Clone Voice instead.

Headers

X-API-KeystringRequired
Cartesia-Version"2024-11-13"Required

Request

This endpoint expects an object.
namestringRequired

The name of the voice.

descriptionstringRequired

The description of the voice.

embeddinglist of doublesRequired

A 192-dimensional vector (i.e. a list of 192 numbers) that represents the voice.

languageenumOptional

The language that the given voice should speak the transcript in.

Options: English (en), French (fr), German (de), Spanish (es), Portuguese (pt), Chinese (zh), Japanese (ja), Hindi (hi), Italian (it), Korean (ko), Dutch (nl), Polish (pl), Russian (ru), Swedish (sv), Turkish (tr).

base_voice_idstringOptional

Pull in features from a base voice, used for features like voice mixing.

Response

This endpoint returns an object.
idstring

The ID of the voice.

is_ownerboolean

Whether the current user is the owner of the voice.

namestring

The name of the voice.

descriptionstring

The description of the voice.

created_atdatetime

The date and time the voice was created.

languageenum

The language that the given voice should speak the transcript in.

Options: English (en), French (fr), German (de), Spanish (es), Portuguese (pt), Chinese (zh), Japanese (ja), Hindi (hi), Italian (it), Korean (ko), Dutch (nl), Polish (pl), Russian (ru), Swedish (sv), Turkish (tr).

embeddinglist of doublesOptional

The vector embedding of the voice. Only included when expand includes embedding.

is_starredbooleanOptional

Whether the current user has starred the voice. Only included when expand includes is_starred.