Skip to Content

Frequently Asked Questions

Audio models

What model should I use for text-to-speech?

Use step-tts-2. See Audio Models for an overview.

Where can I find the TTS API parameters?

See Generate audio for the full request schema and examples.

What audio formats are supported?

wav, mp3, flac, opus, pcm. Default is mp3.

Is there a limit on input length?

Yes. The maximum input length is 1,000 characters per request.

Last updated on