Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.cartesia.ai/llms.txt

Use this file to discover all available pages before exploring further.

Pipecat logo

Overview

Pipecat is an open-source Python framework for realtime voice agents. Building voice agents requires the creation and orchestration of pipelines, media and communication transports (such as Daily or LiveKit), and pluggable AI models. Cartesia is available as a first-party provider plugin for TTS and STT services in the Pipecat repo.

Prerequisites

Pipecat’s examples require a recent Python installation (see the Pipecat repo’s root-level README for current prerequisites). Install the pipecat-ai Python package with the cartesia extra for TTS/STT (bracket syntax):
pip install "pipecat-ai[cartesia,...]"

# or

uv add "pipecat-ai[cartesia,...]"
You’d also need to choose the transport extras your sample needs - you can do this by matching whatever the upstream README lists for that example.

Getting Started - TTS (Websockets)

Pipecat’s getting-started example provides you with a small, copy-friendly path to wire Cartesia TTS into a Pipecat TTS WebSocket API, and:

Cartesia & Pipecat | Getting Started

Getting-started examples in the Pipecat repo.

Getting Started - TTS and STT (Websockets & HTTP)

For smaller voice-focused samples using Cartesia STT and TTS you can choose between two transports - WebSockets or HTTP:

Pipecat & Cartesia Voice (WebSockets)

Voice bot using Cartesia STT & TTS over WebSocket.

Pipecat & Cartesia Voice (HTTP)

Same flow using Cartesia STT & TTS over HTTP.

Orchestrated Conversational AI

For a fuller example app that shows an end to end Voice Agent experience (VAD -> STT -> LLM -> TTS) orchestrated with Pipecat, see StudyPal:

Pipecat > StudyPal

StudyPal example in the pipecat-examples repo.