Cartesia provides first-party speech APIs and SDKs, and integrates with many other products and developer frameworks. The pages in this section describe each path at a high level; detailed setup usually lives in partner documentation and repositories.Documentation Index
Fetch the complete documentation index at: https://docs.cartesia.ai/llms.txt
Use this file to discover all available pages before exploring further.
Prerequisites
You’ll need these for almost every integration below. Individual pages also list extras (partner accounts, runtimes, SDK installs).- Cartesia API key — create and manage keys in the Playground.
- A voice — pick one in the Playground or API; see Choosing a voice and Voice IDs.
Integrations
LiveKit
Realtime rooms and agents—Cartesia via LiveKit Inference or the Cartesia plugin.
Pipecat
Python voice and multimodal agents with official Cartesia TTS/STT services.
Twilio
Programmable Voice and Media Streams with Cartesia TTS (Node walkthrough).
Tencent RTC
TRTC realtime media with Cartesia for conversational AI workloads.
Thoughtly
No-code phone agents; Cartesia is the default voice stack for new agents.
Rasa
Rasa Pro voice assistants with Cartesia as the TTS backend.
Vision Agents (by Stream)
Stream’s Vision Agents framework with a Cartesia TTS plugin.
MCP
cartesia-mcp for Cursor, Claude Desktop, and other MCP clients.