This endpoint creates a bidirectional WebSocket connection. The connection supports multiplexing, so you can send multiple requests and receive the corresponding responses in parallel.
The WebSocket API is built around contexts:
context_id. Further inputs on the same context_id will continue the generation, maintaining prosody.context_id you passed in so that you can match requests and responses.Read the guide on working with contexts to learn more.
For the best performance, we recommend the following usage pattern:
For conversational agent use cases, we recommend the following usage pattern:
To learn more about managing concurrent generations and WebSocket connection limits, see the concurrency limits and timeouts page.
Documentation Index
Fetch the complete documentation index at: https://docs.cartesia.ai/llms.txt
Use this file to discover all available pages before exploring further.