Endpoint
POST https://api.stepfun.ai/v1/audio/transcriptions
Request parameters
-
modelstringrequired
Model name, fixed tostep-asr. -
response_formatstringrequired
Output format. Supported:json,text,srt,vtt. -
fileFilerequired
Audio file. Supported formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm, aac, opus.
File size must be under 100 MB. -
hotwordsstringoptional
Hotword list as a JSON string that parses to an array, e.g.,["1","2","3","4","abc"].
Response
Returns the transcription in the requestedresponse_format.
- json
- text
- srt
- vtt
textstring
Recognized text.
Example
- curl