Cartesia
- Supported service:
tts
- Key:
cartesia
- Integrated: Yes, but you may want to provide your own key to use your custom voices. See BYO Keys for more info.
Service options
The audio output sample rate in Hz for the TTS audio.
The sample rate must be one of these values: 8000
, 16000
, 22050
, 24000
, 44100
, 48000
.
Configuration options
Select from the available Cartesia models.
For the best performance for English, we recommend using the "sonic-english"
model. For multilingual, we recommend using the "sonic-multilingual"
model.
The voice you want your TTS service to use. Select any voice from the available Cartesia voices.
You can click the “Try it out” button on https://cartesia.ai/sonic to sign up for a free account and sample the built-in voices. You can also find voice IDs from their playground page.
The language you want your TTS service to use. To select a non-English language, select the sonic-mulitlingual
model and specify the language. Learn more.
The rate at which the text is spoken. Learn more.
Speed options include slowest
, slow
, normal
, fast
, and fastest
.
For more granular control, you can define speed as a number within the range [-1.0, 1.0]. A value of 0 represents the default speed, while negative values slow down the speech and positive values speed it up.
The emotion parameter is an array of “tags” in the form emotion_name:level
. For example, positivity:high
or curiosity
. Learn more.
Emotion names: anger
, positivity
, surprise
, sadness
, curiosity
.
Emotion levels: lowest
, low
, (omit for moderate addition of emotion), high
, highest
.
Control whether the TTS service filters out markdown, code blocks, or tables from its output.
Basic markdown filtering is enabled by default. Enable code and table filtering as needed. Filtering code and tables can help the TTS avoid mistakes.