Stepfun’sDocumentation Index
Fetch the complete documentation index at: https://platform.stepfun.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
step-3.7-flash model supports video understanding. Pass a video link in the conversation context and the model will read the video, answer questions about it, or use it for generation.
Video uploads support three forms: a directly accessible video URL, inline Base64 (data:video/mp4;base64,...), or astepfile://reference after uploading via the Files API. Supported container formats are MP4, QuickTime, and Matroska.
Using step-3.7-flash
step-3.7-flash natively supports multimodal input. Use the Chat API and include a video_url item along with your prompt in the user message—the model will read the video and generate based on its content.
Lower costs with prompt caching
Notes
- Place the video before the instruction in the message to improve results.
- Downloading and safety checks take time. Design UI feedback so users know the request is in progress.
step-3.7-flashcurrently supports a single MP4 video under 128MB. For larger or other formats, split the video into sub-128MB MP4s with ffmpeg.- Because the server downloads your video, network speed affects latency. Host videos on fast, publicly accessible storage (e.g., object storage with CDN).
FAQ
Speed up video understanding with the Files API
If you pass an external URL, Stepfun must fetch it, so download speed affects generation time. Host the video on CDN or high-bandwidth storage for faster downloads. If you reuse the same video (e.g., for few-shot), upload it via the Files API withpurpose=storage to avoid repeated downloads and bandwidth costs.

stepfile:// when referencing it in chat messages so the model knows to fetch the video from Stepfun file storage, reducing download time and improving overall latency.
ffmpeg slicing tips
Split a video file
If a file exceeds 128MB, split it into multiple clips and summarize each clip for context, so you can discuss the full video. Example: splitsample.mp4 into 120-second segments.
Convert to MP4
Convert other formats to MP4 before sending. Example: convertsample.mkv to sample.mp4: