> ## Documentation Index
> Fetch the complete documentation index at: https://platform.stepfun.ai/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# Voice interaction developer guide

StepFun provides developers with voice interaction models that support audio generation and [voice cloning](/en/api-reference/audio/create-voice). By integrating these models, applications can extend beyond standard large language model understanding and enable voice interaction.

## Quick Start

### Quickly Generate an Audio Clip

Copy the following code to quickly generate an audio file.

```bash theme={null}
curl --location 'https://api.stepfun.ai/v1/audio/speech' \
--header 'Content-Type: application/json' \
--header "Authorization: Bearer $STEP_API_KEY" \
--data '{
   "model":"step-tts-2",
   "input":"StepFun is building the next generation of AGI.",
   "voice":"lively-girl"
}'\
--output "step.mp3"
```

<div id="supported-voices" />

## Voice Recommendations by Scenario

StepFun offers dozens of recommended voices across seven major scenarios. You can preview different voices here and use them via the API.
We strongly recommend using [voice cloning](/en/api-reference/audio/create-voice) to create custom voices. The step-tts-2 model delivers industry-leading cloning performance, and cloned voices support all emotion and style controls at zero additional cost.

### 1. Marketing

Marketing scenarios require voices with charisma, persuasiveness, and warmth that can effectively convey product value and inspire purchase intent. Step-TTS delivers full emotional expression, building trust and professionalism to make marketing content more compelling.

| Supported Models               | Voice Name    | Voice ID            | Audio Samples                                                                                                                                                                                                                                                                                                           |
| :----------------------------- | :------------ | :------------------ | :---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| stepaudio-2.5-tts / step-tts-2 | Lively Breezy | livelybreezy-female | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764227859773_6r0lga.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764227873237_xbptdq.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Upright Youth | zhengpaiqingnian    | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764227897199_sz9mse.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764227907908_w73z24.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |

### 2. Customer Service

Customer service scenarios require voices that are warm, patient, and professional, capable of calming users and providing clear solutions. We offer two types of customer service voices — step-tts-2 voices stand out with rich audio quality, full emotion, and a lifelike human feel, making the first four recommendations especially suited for phone scenarios.

| Supported Models               | Voice Name           | Voice ID             | Audio Samples                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |
| :----------------------------- | :------------------- | :------------------- | :----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| stepaudio-2.5-tts / step-tts-2 | Straightforward Male | shuangkuainansheng   | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232419372_cjweb4.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232431616_d45kct.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232439733_ucrkox.wav" target="_blank" rel="noopener noreferrer">Sample 3</a> |
| stepaudio-2.5-tts / step-tts-2 | Capable Female       | ganliannvsheng       | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232450294_e3wqt8.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232467535_p2pu4n.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232485079_rarrcy.wav" target="_blank" rel="noopener noreferrer">Sample 3</a> |
| stepaudio-2.5-tts / step-tts-2 | Warm Female          | qinhenvsheng         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232648658_g46auz.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232658059_p1hw59.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232666804_uxi2nq.wav" target="_blank" rel="noopener noreferrer">Sample 3</a> |
| stepaudio-2.5-tts / step-tts-2 | Energetic Female     | huolinvsheng         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232752142_umr3gn.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232784129_7y8d7t.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232765549_zhtxfv.wav" target="_blank" rel="noopener noreferrer">Sample 3</a> |
| stepaudio-2.5-tts / step-tts-2 | Elegant Gentle       | elegantgentle-female | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232963968_2lpzrh.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764232975950_nb22zj.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>                                                                                                                                                              |
| stepaudio-2.5-tts / step-tts-2 | Lively Breezy        | livelybreezy-female  | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233092953_7uj9hs.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233081037_2uelez.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>                                                                                                                                                              |
| stepaudio-2.5-tts / step-tts-2 | Gentle Male          | wenrounansheng       | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233106756_zi400x.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233116023_vvny08.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>                                                                                                                                                              |
| stepaudio-2.5-tts / step-tts-2 | Classic Female       | jingdiannvsheng      | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233202847_b14wof.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233185685_fx2jym.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>                                                                                                                                                              |
| stepaudio-2.5-tts / step-tts-2 | Mature Gentle        | wenroushunv          | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233217617_grbf54.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233227812_ca92c6.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>                                                                                                                                                              |
| stepaudio-2.5-tts / step-tts-2 | Sweet Female         | tianmeinvsheng       | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233448772_4id5xn.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233457923_qdn6vq.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>                                                                                                                                                              |
| stepaudio-2.5-tts / step-tts-2 | Pure Girl            | qingchunshaonv       | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233506642_dd78q1.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233498003_buc059.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>                                                                                                                                                              |
| stepaudio-2.5-tts / step-tts-2 | Spirited Male        | yuanqinansheng       | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233516928_rpphje.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764233522624_r33ce3.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>                                                                                                                                                              |

### 3. Audiobook

Audiobooks require voices that are expressive and emotionally engaging, capable of vividly bringing different characters and story atmospheres to life. Our TTS stands out with its delicate emotional expression and versatile vocal styles, enabling listeners to fully immerse themselves in the world of the story.

| Supported Models               | Voice Name          | Voice ID         | Audio Samples                                                                                                                                                                                                                                                                                                           |
| :----------------------------- | :------------------ | :--------------- | :---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| stepaudio-2.5-tts / step-tts-2 | Lively Girl         | lively-girl      | <a href="https://static-openapi.stepfun.ai/static/platform-docs/resource/lively-girl_In.mp3" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.ai/static/platform-docs/resource/lively-girl_Years.mp3" target="_blank" rel="noopener noreferrer">Sample 2</a>            |
| stepaudio-2.5-tts / step-tts-2 | Scholarly Gentleman | ruyananshi       | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231706722_cerhao.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231727050_6znwwg.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Gentle Female       | wenrounvsheng    | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231734698_puka7d.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231740416_syoeyy.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Tender Gentleman    | wenrougongzi     | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231762453_68oj6b.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231771111_0r0h3f.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Magnetic Male       | cixingnansheng   | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231784986_xvr1ij.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231791652_9d9uy1.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Spirited Girl       | yuanqishaonv     | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231810865_xiz5jk.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231820537_fz8979.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Upright Youth       | zhengpaiqingnian | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231834824_31a18s.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231842616_3y3jsx.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Spirited Male       | yuanqinansheng   | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231882851_ha2oe6.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231889066_vty0d3.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Broadcast Male      | boyinnansheng    | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231898586_yvqsfk.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231903863_n0rq6k.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Deep Male           | shenchennanyin   | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231909638_diuypc.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764231916499_zk19wv.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |

### 4. Emotional Companionship

Emotional companionship requires voices that are warm, gentle, and empathetic, capable of providing users with comfort and psychological support. Our TTS features delicate, soothing voice timbres with strong emotional expressiveness, helping you create a safe and comforting interaction environment for users.

| Supported Models               | Voice Name            | Voice ID              | Audio Samples                                                                                                                                                                                                                                                                                                               |
| :----------------------------- | :-------------------- | :-------------------- | :-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| stepaudio-2.5-tts / step-tts-2 | Soft-spoken Gentleman | soft-spoken-gentleman | <a href="https://static-openapi.stepfun.ai/static/platform-docs/resource/Soft-spoken-Gentleman-1.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.ai/static/platform-docs/resource/Soft-spoken-Gentleman-2.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Elegant Gentle        | elegantgentle-female  | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230170888_rpysp8.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230185710_01rt1x.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Lively Breezy         | livelybreezy-female   | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230220618_qbsx4o.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230236362_tfyy2l.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Gentle Male           | wenrounansheng        | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230277656_1jj8at.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230256668_9lhgua.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Tender Gentleman      | wenrougongzi          | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230295150_t7wi3h.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230307812_0gi9a8.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Classic Female        | jingdiannvsheng       | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230324896_hhowz6.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230343351_k3vj3k.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Friendly Female       | qinqienvsheng         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230361427_d9rn3n.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764230377953_9uheoc.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Sweet Female          | tianmeinvsheng        | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229581488_9bjx0x.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229591698_yk6d1a.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Magnetic Male         | cixingnansheng        | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229600823_8hr2u3.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229613085_i8q5vt.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Spirited Girl         | yuanqishaonv          | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229624319_gy96i6.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229636363_rl7puv.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Girl Next Door        | linjiajiejie          | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229893983_biv5nz.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229906559_3bkhy1.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Scholarly Gentleman   | ruyananshi            | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229965580_i9iwog.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229990027_178cz8.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Deep Male             | shenchennanyin        | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229487791_a43xpe.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229478620_pgi8xr.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Gentle Female         | wenrounvsheng         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229464183_kpaxa8.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229454546_dxzd93.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |
| stepaudio-2.5-tts / step-tts-2 | Cute Soft Female      | ruanmengnvsheng       | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229441268_p9xrn8.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229433217_mzc9eo.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>     |

### 5. Voice Assistant

Voice assistant scenarios require voices that are clear, natural, and efficient, capable of accurately understanding and responding to user commands. Our TTS features natural prosody and full emotional expression, making your voice assistant both professional and approachable.

| Supported Models               | Voice Name          | Voice ID             | Audio Samples                                                                                                                                                                                                                                                                                                           |
| :----------------------------- | :------------------ | :------------------- | :---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| stepaudio-2.5-tts / step-tts-2 | Elegant Gentle      | elegantgentle-female | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228832095_29ybp3.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228840760_7lfeif.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Lively Breezy       | livelybreezy-female  | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228850379_g5udri.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228867575_iogyx5.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Pure Girl           | qingchunshaonv       | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228890474_009vob.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228880711_uo1s1z.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Spirited Girl       | yuanqishaonv         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228956977_jkgzvk.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228942094_6cgojl.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Girl Next Door      | linjiajiejie         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228982358_2c8eqk.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228972645_kzl33n.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Scholarly Gentleman | ruyananshi           | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229029290_58c9et.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229015620_qu5naz.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Clever Girl         | jilingshaonv         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229042857_sbtid3.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229055562_gz7tjs.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Cute Soft Female    | ruanmengnvsheng      | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229191918_0u5rmk.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229207559_ifljss.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Kid Sister          | linjiameimei         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229220926_vubdhb.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229230493_3vr4dl.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Intellectual Lady   | zhixingjiejie        | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229251134_r2dv2t.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764229242754_6j194i.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |

### 6. Video Dubbing

Video dubbing requires voices that are expressive, rhythmic, and visually evocative, capable of blending seamlessly with visual content. Our TTS excels in precise emotional delivery and fine-grained speech rhythm control, enhancing the impact and overall appeal of your videos.

| Supported Models               | Voice Name           | Voice ID             | Audio Samples                                                                                                                                                                                                                                                                                                             |
| :----------------------------- | :------------------- | :------------------- | :------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
| stepaudio-2.5-tts / step-tts-2 | Vibrant Youth        | vibrant-youth        | <a href="https://static-openapi.stepfun.ai/static/platform-docs/resource/vibrant-youth_Every.mp3" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.ai/static/platform-docs/resource/vibrant-youth_This.mp3" target="_blank" rel="noopener noreferrer">Sample 2</a>        |
| stepaudio-2.5-tts / step-tts-2 | Magnetic-voiced Male | magnetic-voiced-male | <a href="https://static-openapi.stepfun.ai/static/platform-docs/resource/Magnetic-voiced-Male-1.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.ai/static/platform-docs/resource/Magnetic-voiced-Male-2.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Girl Next Door       | linjiajiejie         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228368225_q6vf14.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228358790_54hsjm.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>   |
| stepaudio-2.5-tts / step-tts-2 | Kid Sister           | linjiameimei         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228406946_msnk7y.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228422631_jv0wae.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>   |
| stepaudio-2.5-tts / step-tts-2 | College Student      | qingniandaxuesheng   | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228453152_11xqp7.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228437723_owq9wd.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>   |
| stepaudio-2.5-tts / step-tts-2 | Cute Soft Female     | ruanmengnvsheng      | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228469573_c9d14l.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228481762_npxp0m.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>   |
| stepaudio-2.5-tts / step-tts-2 | Elegant Female       | youyanvsheng         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228538767_2lkmyq.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228548744_a6t35v.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>   |
| stepaudio-2.5-tts / step-tts-2 | Cool Beauty          | lengyanyujie         | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228563074_mkfaxu.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228577493_12hmg5.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>   |
| stepaudio-2.5-tts / step-tts-2 | Intellectual Lady    | zhixingjiejie        | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228598185_j5cgak.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228588844_rylx9j.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>   |
| stepaudio-2.5-tts / step-tts-2 | Bold Sister          | shuangkuaijiejie     | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228610601_8ca7fa.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228620326_6f4lyx.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>   |
| stepaudio-2.5-tts / step-tts-2 | Quiet Scholar        | wenjingxuejie        | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228640932_1hg09k.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228632403_m8zdxc.wav" target="_blank" rel="noopener noreferrer">Sample 2</a>   |

### 7. Education & Training

Education and training scenarios require voices that are clear, accurate, and inspiring, capable of effectively conveying knowledge and sparking learning interest. Our TTS excels at capturing the vocal characteristics of instructors across different emotional states.

| Supported Models               | Voice Name     | Voice ID             | Audio Samples                                                                                                                                                                                                                                                                                                           |
| :----------------------------- | :------------- | :------------------- | :---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| stepaudio-2.5-tts / step-tts-2 | Elegant Gentle | elegantgentle-female | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228058073_102wb4.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228049743_8dc93a.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Gentle Male    | wenrounansheng       | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228096723_naf04h.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228085782_awotak.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Lively Breezy  | livelybreezy-female  | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228115572_zfgvsz.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228129774_s17gwn.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |
| stepaudio-2.5-tts / step-tts-2 | Mature Gentle  | wenroushunv          | <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228171350_vs3v5a.wav" target="_blank" rel="noopener noreferrer">Sample 1</a> · <a href="https://static-openapi.stepfun.com/static/platform-docs/resource/1764228180145_lct34k.wav" target="_blank" rel="noopener noreferrer">Sample 2</a> |

## System Voice ID List

| Voice Name            | Voice ID              | Supported Models              | Recommended Use Cases                                               |
| :-------------------- | :-------------------- | :---------------------------- | :------------------------------------------------------------------ |
| Vibrant Youth         | vibrant-youth         | stepaudio-2.5-tts, step-tts-2 | Audiobook, video dubbing                                            |
| Lively Girl           | lively-girl           | stepaudio-2.5-tts, step-tts-2 | Audiobook, video dubbing                                            |
| Soft-spoken Gentleman | soft-spoken-gentleman | stepaudio-2.5-tts, step-tts-2 | Emotional companionship, audiobook                                  |
| Magnetic-voiced Male  | magnetic-voiced-male  | stepaudio-2.5-tts, step-tts-2 | Audiobook, video dubbing                                            |
| Confident Male        | zixinnansheng         | stepaudio-2.5-tts, step-tts-2 | Audiobook, emotional companionship, education, marketing            |
| Elegant Gentle        | elegantgentle-female  | stepaudio-2.5-tts, step-tts-2 | Customer service, voice-over, education, emotional companionship    |
| Lively Breezy         | livelybreezy-female   | stepaudio-2.5-tts, step-tts-2 | Emotional companionship, customer service, education, marketing     |
| Gentle Male           | wenrounansheng        | stepaudio-2.5-tts, step-tts-2 | Voice-over, emotional companionship, customer service, education    |
| Tender Gentleman      | wenrougongzi          | stepaudio-2.5-tts, step-tts-2 | Emotional companionship, audiobook                                  |
| Spirited Male         | yuanqinansheng        | stepaudio-2.5-tts, step-tts-2 | Audiobook, voice-over, customer service                             |
| Classic Female        | jingdiannvsheng       | stepaudio-2.5-tts, step-tts-2 | Customer service, emotional companionship                           |
| Mature Gentle         | wenroushunv           | stepaudio-2.5-tts, step-tts-2 | Customer service, voice-over, education                             |
| Sweet Female          | tianmeinvsheng        | stepaudio-2.5-tts, step-tts-2 | Emotional companionship, customer service                           |
| Pure Girl             | qingchunshaonv        | stepaudio-2.5-tts, step-tts-2 | Customer service, voice assistant                                   |
| Magnetic Male         | cixingnansheng        | stepaudio-2.5-tts, step-tts-2 | Audiobook, emotional companionship                                  |
| Spirited Girl         | yuanqishaonv          | stepaudio-2.5-tts, step-tts-2 | Audiobook, emotional companionship, voice assistant                 |
| Girl Next Door        | linjiajiejie          | stepaudio-2.5-tts, step-tts-2 | Voice-over, emotional companionship, voice assistant, video dubbing |
| Upright Youth         | zhengpaiqingnian      | stepaudio-2.5-tts, step-tts-2 | Marketing, audiobook                                                |
| College Student       | qingniandaxuesheng    | stepaudio-2.5-tts, step-tts-2 | Voice-over                                                          |
| Broadcast Male        | boyinnansheng         | stepaudio-2.5-tts, step-tts-2 | Audiobook, voice-over                                               |
| Scholarly Gentleman   | ruyananshi            | stepaudio-2.5-tts, step-tts-2 | Audiobook, emotional companionship, voice-over, voice assistant     |
| Deep Male             | shenchennanyin        | stepaudio-2.5-tts, step-tts-2 | Emotional companionship, audiobook                                  |
| Friendly Female       | qinqienvsheng         | stepaudio-2.5-tts, step-tts-2 | Voice-over                                                          |
| Gentle Female         | wenrounvsheng         | stepaudio-2.5-tts, step-tts-2 | Audiobook, emotional companionship                                  |
| Clever Girl           | jilingshaonv          | stepaudio-2.5-tts, step-tts-2 | Voice assistant, voice-over                                         |
| Cute Soft Female      | ruanmengnvsheng       | stepaudio-2.5-tts, step-tts-2 | Emotional companionship, voice assistant, video dubbing             |
| Elegant Female        | youyanvsheng          | stepaudio-2.5-tts, step-tts-2 | Video dubbing                                                       |
| Cool Beauty           | lengyanyujie          | stepaudio-2.5-tts, step-tts-2 | Video dubbing                                                       |
| Bold Sister           | shuangkuaijiejie      | stepaudio-2.5-tts, step-tts-2 | Voice-over                                                          |
| Quiet Scholar         | wenjingxuejie         | stepaudio-2.5-tts, step-tts-2 | Voice-over                                                          |
| Kid Sister            | linjiameimei          | stepaudio-2.5-tts, step-tts-2 | Video dubbing, voice-over, voice assistant                          |
| Intellectual Lady     | zhixingjiejie         | stepaudio-2.5-tts, step-tts-2 | Video dubbing, voice-over, voice assistant                          |
| Straightforward Male  | shuangkuainansheng    | stepaudio-2.5-tts, step-tts-2 | Customer service, voice assistant                                   |
| Capable Female        | ganliannvsheng        | stepaudio-2.5-tts, step-tts-2 | Customer service, voice assistant                                   |
| Warm Female           | qinhenvsheng          | stepaudio-2.5-tts, step-tts-2 | Customer service, voice assistant                                   |
| Energetic Female      | huolinvsheng          | stepaudio-2.5-tts, step-tts-2 | Customer service, voice assistant                                   |

## Voice Tags List

Voice tags support three categories: speaking style, emotion, and language. Emotion tags must be set in the `voice_label.emotion` field, while speaking-style tags must be set in the `voice_label.style` field.

<Warning>stepaudio-2.5-tts does NOT support voice tags. Use the `instruction` parameter for emotion and style control instead.</Warning>

| No. | Tag Name    | Tag Type       | step-tts-2 |
| --- | ----------- | -------------- | ---------- |
| 1   | Happy       | Emotion        | ✓          |
| 2   | Very Happy  | Emotion        | ✓          |
| 3   | Sad         | Emotion        | ✓          |
| 4   | Angry       | Emotion        | ✓          |
| 5   | Very Angry  | Emotion        | ✓          |
| 6   | Coquettish  | Emotion        | ✓          |
| 7   | Slow        | Speaking Style | ✓          |
| 8   | Very Slow   | Speaking Style | ✓          |
| 9   | Fast        | Speaking Style | ✓          |
| 10  | Very Fast   | Speaking Style | ✓          |
| 11  | Fearful     | Emotion        | ✓          |
| 12  | Surprised   | Emotion        | ✓          |
| 13  | Excited     | Emotion        | ✓          |
| 14  | Admiring    | Emotion        | ✓          |
| 15  | Confused    | Emotion        | ✓          |
| 16  | Cold        | Delivery Style | ✓          |
| 17  | Embarrassed | Delivery Style | ✓          |
| 18  | Frustrated  | Delivery Style | ✓          |
| 19  | Proud       | Delivery Style | ✓          |
| 20  | Tender      | Delivery Style | ✓          |
| 21  | Sweet       | Delivery Style | ✓          |
| 22  | Outgoing    | Delivery Style | ✓          |
| 23  | Serious     | Delivery Style | ✓          |
| 24  | Arrogant    | Delivery Style | ✓          |
| 25  | Elderly     | Delivery Style | ✓          |
| 26  | Shouting    | Delivery Style | ✓          |
| 27  | Sarcastic   | Delivery Style | ✓          |
| 28  | Stuttering  | Delivery Style | ✓          |

## Output Format

StepFun TTS models support audio output in `wav`, `mp3`, `flac`, `opus`, and `pcm` formats. The default format is `mp3`. You can choose the format that best suits your use case.

## Output Languages

StepFun TTS models support generating audio in Chinese, English, mixed Chinese-English, and Japanese.

## FAQ

**Do I own the audio I generate?**

Yes. You own the audio you create. However, we recommend informing your end users that the audio was generated by AI so they are aware of its nature.

**How do I adjust the volume of the generated audio?**

You can set the `volume` parameter when calling the generation API. Valid values range from 0.1 to 2.0, representing 10% volume to 200% volume.

**How do I adjust the speaking rate of the generated audio?**

You can set the `speed` parameter when calling the generation API. Valid values range from 0.5 to 2.0, representing half-speed to double-speed.
