Skip to main content
Cartesia provides first-party speech APIs and SDKs, and integrates with many other products and developer frameworks. The pages in this section describe each path at a high level; detailed setup usually lives in partner documentation and repositories.

Prerequisites

You’ll need these for almost every integration below. Individual pages also list extras (partner accounts, runtimes, SDK installs).

Integrations

LiveKit

Realtime rooms and agents—Cartesia via LiveKit Inference or the Cartesia plugin.

Pipecat

Python voice and multimodal agents with official Cartesia TTS/STT services.

Twilio

Programmable Voice and Media Streams with Cartesia TTS (Node walkthrough).

Tencent RTC

TRTC realtime media with Cartesia for conversational AI workloads.

Thoughtly

No-code phone agents; Cartesia is the default voice stack for new agents.

Rasa

Rasa Pro voice assistants with Cartesia as the TTS backend.

Vision Agents (by Stream)

Stream’s Vision Agents framework with a Cartesia TTS plugin.

MCP

cartesia-mcp for Cursor, Claude Desktop, and other MCP clients.
Step-by-step setup lives on each page and in partner docs and repos.