Skip to main content

Documentation Index

Fetch the complete documentation index at: https://platform.stepfun.ai/docs/llms.txt

Use this file to discover all available pages before exploring further.

Pricing Details

Pricing for Reasoning Models

ModelBilling UnitInput (Cache Miss)Input (Cache Hit)Output Price
step-3.5-flash-26031M tokens$0.10$0.02$0.30
step-3.5-flash1M tokens$0.10$0.02$0.30

Pricing for Speech Models

ModelModel TypeUnit Price
stepaudio-2.5-ttsContextual text-to-speech model$0.85 / 10,000 characters
step-tts-2Text-to-speech model$0.40 / 10,000 characters
stepaudio-2.5-tts / step-tts-2Voice cloning model$1.50 / voice
stepaudio-2.5-asrSpeech recognition model$0.022 / hour
Here, one Chinese character counts as one character, two English letters count as one character, and two punctuation marks count as one character.

Tiered Rate Limits

Top-Up and Rate Limits Table

To ensure fair overall resource allocation and prevent abuse, we apply rate limits based on your account’s cumulative top-up amount. Details are below:
User TierCumulative Top-Up AmountConcurrencyRPMTPM
V0$05105,000,000
V1$151001,00020,000,000
V2$702005,00030,000,000
V3$30040010,00040,000,000
V4$7001,00020,00050,000,000
V5$1,50010,000200,000100,000,000

Definitions

  • Concurrency: number of requests at the same time
  • RPM: request per minute, the maximum number of requests you can make to us per minute
  • TPM: token per minute, the maximum number of tokens you can interact with us per minute

Notes

  • Our default rate limits are intended to allocate resources more fairly. If you believe you need higher and more stable limits, please contact our staff in advance; we will respond within two business days. Contact email: platform@stepfun.com
  • We will do our best to ensure normal usage, but when resources reach capacity, we may take temporary throttling measures and adjust rate limits.