Overview
Video generation is asynchronous. You submit a request, receive a task ID andpoll_url, then poll for the result.
For the most reliable polling behavior, follow the exact
poll_url returned by the create request.Request Body
Video model ID. API default is
sora-2. See the Video Generation Guide for the current model matrix and supported capabilities.Text description of the video to generate.
Publicly accessible URL of the starting image for image-to-video (I2V) generation. For best cross-model compatibility, prefer
image_url.Inline image as a data URL (for example,
data:image/jpeg;base64,...). Supported by compatible models, but image_url provides the broadest compatibility across video models.Video duration in seconds (model-dependent).
Aspect ratio (e.g.,
16:9, 9:16, 1:1).Video resolution (e.g.,
1080p, 720p).Frames per second (1-120).
What to avoid in the video generation.
Random seed for reproducible generation.
Classifier-free guidance scale (0-20).
Motion intensity (0-1).
URL of starting frame image.
URL of ending frame image.
A unique identifier for the end-user.
Response
Unique task identifier for polling.
Recommended polling URL for this task. Use this exact path when checking status.
Initial status:
pending.Model used.
Image to Video
Keyframe Control
Usestart_image and end_image to control the first and last frames (supported by Vidu Q2 Pro):
Available Models
The model list is updated frequently. For the latest available models and pricing, visit the Models page.
Sora (OpenAI)
| Model | Capabilities |
|---|---|
sora-2 | T2V (default) |
sora-2-pro | T2V, higher quality |
sora-2-characters | T2V, character-focused |
Kling (Kuaishou)
| Model | Capabilities |
|---|---|
kling-v2.6-pro | T2V, latest gen |
kling-v2.6-std | T2V, fast |
kling-v2.5-turbo-std | T2V, turbo tier |
kling-v2.1-master | T2V, I2V, master quality |
kling-v2.1-pro | T2V, I2V |
kling-video | T2V, I2V |
kling-video-extend | Video extension |
kling-video-o1-pro | T2V, O1 reasoning |
kling-effects | Visual effects |
Veo (Google)
| Model | Capabilities |
|---|---|
veo2 | T2V |
veo3.1 | T2V, I2V, keyframe |
veo3.1-pro | T2V, I2V, keyframe, professional |
veo3.1-4k | T2V, I2V, keyframe, 4K output |
veo3.1-fast | T2V, I2V, keyframe, fast |
veo3 | T2V, I2V, previous gen |
Seedance (ByteDance)
| Model | Capabilities |
|---|---|
seedance-2-0 | T2V, I2V, keyframe, extension, editing |
seedance-1-5-pro | T2V, I2V |
seedance-1-0-pro | T2V, I2V |
seedance-1-0-lite-t2v | T2V, lightweight |
seedance-1-0-lite-i2v | I2V, lightweight |
Other Providers
| Model | Provider | Capabilities |
|---|---|---|
hailuo-2.3 | MiniMax | T2V, I2V |
hailuo-2.3-pro | MiniMax | T2V, I2V, higher quality |
hailuo-2.3-fast | MiniMax | I2V, fast tier |
wan-2.6 | Alibaba | T2V |
wan2.6-i2v | Alibaba | I2V |
runwayml-gen4-turbo-5 | Runway | I2V (5s) |
runwayml-gen4-turbo-10 | Runway | I2V (10s) |
luma-video-api | Luma | T2V |
luma-video-extend-api | Luma | Video extension |
viduq3-pro | Shengshu | T2V, I2V, keyframe |
viduq3-turbo | Shengshu | T2V, I2V, keyframe, turbo tier |
viduq2-pro | Shengshu | I2V, keyframe, reference |
grok-video-3 | xAI | T2V |
higgsfield-turbo | Higgsfield | T2V |