Skip to Content
PricingPricing and Rate Limits

Pricing and Rate Limits

Pricing Details

Pricing for Reasoning Models

ModelBilling UnitInput PriceOutput Price
step-3.5-flash1M tokens$0.1$0.3

Pricing for Speech Models

ModelModel TypeUnit Price
step-tts-2Next-generation text-to-speech model$50/M characters
step-tts-2Voice cloning modelFree (limited time)

Here, one Chinese character counts as one character, two English letters count as one character, and two punctuation marks count as one character.

Tiered Rate Limits

Top-Up and Rate Limits Table

To ensure fair overall resource allocation and prevent abuse, we apply rate limits based on your account’s cumulative top-up amount. Details are below:

User TierCumulative Top-Up AmountConcurrencyRPMTPM
V0$01630,000
V1$1536042,000
V2$707120120,000
V3$30020420360,000
V4$70040840960,000
V5$1,500701,4001,600,000

Definitions

  • Concurrency: number of requests at the same time
  • RPM: request per minute, the maximum number of requests you can make to us per minute
  • TPM: token per minute, the maximum number of tokens you can interact with us per minute

Notes

  • Our default rate limits are intended to allocate resources more fairly. If you believe you need higher and more stable limits, please contact our staff in advance; we will respond within two business days. Contact email: platform@stepfun.com
  • We will do our best to ensure normal usage, but when resources reach capacity, we may take temporary throttling measures and adjust rate limits.
Last updated on