> ## Documentation Index
> Fetch the complete documentation index at: https://docs.cartesia.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Vision Agents by Stream

<Frame background="subtle">
  <img src="https://mintcdn.com/cartesia-2650f86a/AxUE_Qu8Ppp08fvH/assets/images/integration-headers/vision-agents-by-stream.png?fit=max&auto=format&n=AxUE_Qu8Ppp08fvH&q=85&s=4445b271f8c108a1d4a4ed2ba5e009d2" alt="Vision Agents logo" width="1440" height="756" data-path="assets/images/integration-headers/vision-agents-by-stream.png" />
</Frame>

[Stream](https://getstream.io/) maintains **[Vision Agents](https://visionagents.ai)**—an open-source Python framework for voice- and vision-driven agents with realtime media over **Stream**’s WebRTC edge. Cartesia is supported as the **TTS** provider; install steps, environment variables, and parameters are in Stream’s **[Cartesia integration](https://visionagents.ai/integrations/cartesia)**.

You need a **Stream** developer account for realtime transport and a **Cartesia API key** for speech.

The ["Simple Agent"](https://github.com/GetStream/Vision-Agents/tree/main/examples/01_simple_agent_example) example in GitHub and the [voice](https://visionagents.ai/introduction/voice-agents) / [video](https://visionagents.ai/introduction/video-agents) intros are good starting points.

# Demo

<Card title="Vision Agents Cartesia Demo" icon="fa-solid fa-link" href="https://github.com/GetStream/Vision-Agents/tree/main/examples/01_simple_agent_example">
  Try out the Simple Agent Cartesia demo.
</Card>
