Back to guides
Migrating from OpenAI Realtime Transcription and Cartesia best practices
Endpoints
Cartesia Native:/stt - Full feature supportOpenAI Compatible:
/audio/transcriptions - Drop-in replacement for OpenAI
Using the OpenAI SDK
Replace your OpenAI base URL withhttps://api.cartesia.ai to use the compatibility layer for Cartesia:
Supported parameters
file- The audio file to transcribemodel- Useink-whisperlanguage- Input audio language (ISO-639-1 format)timestamp_granularities- Include["word"]to get word-level timestamps
ink-whisper:
Direct API Usage
Both endpoints accept identical parameters and return the same JSON response format:Cartesia Native Endpoint
OpenAI-Compatible Endpoint
Summary
To migrate from OpenAI Speech to Text to Cartesia:- Update the base URL: Change from
https://api.openai.com/v1tohttps://api.cartesia.ai - Update authentication: Replace your OpenAI API key with your Cartesia API key
- Update model names: Use
ink-whisperinstead of OpenAI’s model names - Keep the same endpoint: Continue using
/audio/transcriptions - Avoid unsupported parameters: Remove
prompt,temperature, andresponse_formatparameters - Use timestamp_granularities (Optional): Add
timestamp_granularities: ["word"]to get word-level timestamps