Skip to main content
On June 1, 2026, we are discontinuing our voice embedding (aka stability) TTS models. Voices listed on play.cartesia.ai/deprecation/voices will stop working. Simply click “Auto Migrate” to make these voices compatible with the latest Sonic 3, 2, and Turbo snapshots. If you use voice embeddings rather than voice IDs, see Voice IDs. For an overview of all changes, see API Changes.

Where do these voices come from?

Voices created by these endpoints rely on our voice embedding models:

Creating voices

You can move to our Clone Voice API or use our web UI to create voices from 3–10 seconds of source audio. You can test these API changes by setting your Cartesia Version to 2026-03-01. We recommend upgrading your Cartesia Version on production traffic before June 1 to make sure nothing breaks. Here is an example using the Cartesia SDK:
your_api_key: str = ""

client = Cartesia(api_key=your_api_key)

print("Cloning a voice")
with open("3 to 10 seconds of source audio.wav", mode="rb") as f:
    voice = client.voices.clone(
        clip=f,
        # this must match the source audio
        language="en",
        name="My Voice",
        mode="similarity",
)
print(f"Cloned voice {voice.id}")

print("Generating audio...")
generated_audio = client.tts.bytes(
    # voice embeddings will not work after June 1, 2026!
    voice={"mode": "id", "id": voice.id},
    model_id="sonic-3",
    transcript="Hello from Cartesia!",
    language="en",
    output_format={
        "container": "wav",
        "encoding": "pcm_f32le",
        "sample_rate": 44100
    },
)