Text to Speech
Converts text to speech using specified voice and parameters
Authorizations
Body
The voice ID to use for text-to-speech conversion. See the available voices documentation 👉 GET /voices/get-voices
The text to convert to speech
raw
, wav
, mp3
pcm_f32le
, pcm_s16le
, mulaw
, alaw
, mp3
The sample rate of the audio file in Hertz (Hz). This determines the number of samples of audio carried per second. The minimum value is 8000 Hz, which is typically used for telephony. The default value is 16000 Hz, which provides a good balance between quality and file size. The maximum value is 48000 Hz, which is used for high-quality audio recordings.
8000 < x < 48000
en
, fr
, de
, es
, pt
, zh
, ja
, hi
, it
, ko
, nl
, pl
, ru
, sv
, tr
Adjusts the speed of the voice. Acceptable values range from -1 (slowest) to 1 (fastest), with 0 being the default normal speed.
-1 < x < 1
Array of voice emotions to apply. Acceptable values include various levels of anger, positivity, surprise, sadness, and curiosity.
anger:lowest
, anger:low
, anger
, anger:high
, anger:highest
, positivity:lowest
, positivity:low
, positivity
, positivity:high
, positivity:highest
, surprise:lowest
, surprise:low
, surprise
, surprise:high
, surprise:highest
, sadness:lowest
, sadness:low
, sadness
, sadness:high
, sadness:highest
, curiosity:lowest
, curiosity:low
, curiosity
, curiosity:high
, curiosity:highest
Response
The response is of type file
.
Was this page helpful?