TTS Voice Controls
Adjust speaking speed and override pronunciation for specific words using Aura-2 controls.
Adjust speaking speed and override pronunciation for specific words using Aura-2 controls.
Aura-2 Controls enable fine-grained adjustments to speech output, allowing you to modify speaking speed and override pronunciation for specific words. These controls are designed for enterprise use cases requiring precise voice quality for industry-specific terminology, brand names, and complex content.
Adjust the speaking rate of generated audio. Speed control modifies the pace of speech while maintaining natural prosody and voice quality.
For Spanish voices, the recommended speed range is 0.9 - 1.5. Values below 0.9 may introduce disfluencies.
Speed values outside the 0.7x–1.5x range will return an error.
Override the default pronunciation of specific words using International Phonetic Alphabet (IPA) notation.
Pronunciation overrides are specified inline within the text using escaped JSON objects:
Where:
word is the original text (used for billing and display)pronounce is the IPA phonetic transcription\{ and \})The curly braces must be escaped with \\{ and \\} in the cURL command.
A few rules of thumb for producing IPA for your own vocabulary:
Best practices:
Speed and pronunciation controls can be used together in the same request.
Use raw string (r'...') with escaped braces \{ and \} for pronunciation control in Python.
Example: Hello, \{"word": "Mr.", "pronounce": "ˈmɪstɚ"\} Bond. is billed as Hello, Mr. Bond. (16 characters)