创建视频

概述

视频生成是异步的。你提交请求后，会收到一个任务 ID 和 poll_url，然后通过轮询获取结果。

创建响应会返回规范异步标识 id，并通常同时回传 task_id。请优先轮询 poll_url；如果需要固定状态入口，请使用 GET /v1/tasks/{id}。

如果创建响应返回 poll_url，请直接使用该 URL。若它指向 /v1/tasks/{id}，请将其视为规范的固定状态查询入口。

为了获得最可靠的轮询行为，请严格使用创建请求返回的 poll_url。

音频输出是否开启取决于具体模型。在 LemonData 中，Veo 3 家族在省略 output_audio 时默认按开启音频处理；当模型支持音频控制时，可通过 output_audio 显式切换。为了兼容历史请求，也接受 camelCase 别名 outputAudio。

生产集成中，建议优先使用公网可访问的 https URL 作为图片、视频和音频输入。兼容模型仍支持内联 data: URL，但大体积 base64 在重试、观测和排障时通常更不友好。

请求体

model

string

默认值:"sora-2"

视频模型 ID。API 默认值为 sora-2。有关当前公开视频模型矩阵和支持能力，请参阅视频生成指南。

prompt

string

必填

要生成视频的文本描述。大多数公开视频模型都要求该字段。

operation

string

要执行的视频操作。公开契约支持 text-to-video、image-to-video、reference-to-video、start-end-to-video、video-to-video、video-extension、audio-to-video 和 motion-control。LemonData 可以根据输入自动推断操作，但生产环境仍建议显式传入 operation。

image_url

string

用于图生视频的起始图片 URL。为了获得最广泛的跨模型兼容性，建议优先使用 image_url。

image

string

以内联 data URL 形式提供的图片（例如 data:image/jpeg;base64,...）。兼容模型支持该方式，但 image_url 的兼容性更广。

reference_images

array

用于参考图生视频的参考图输入。最多提供 3 张。建议优先使用公网 https URL；兼容模型也支持内联 data: URL。

reference_image_type

string

可选的参考图角色字段，用于区分 asset 和 style 两类参考图的模型。

video_url

string

源视频的公网 URL。当前公开 video-to-video 流程以及 motion-control 模型都需要该字段。

audio_url

string

用于 audio-to-video 模型的公网音频 URL。

task_id

string

某些续写、扩展或衍生流程使用的 Provider 侧任务 ID。

extend_at

integer

某些 video-extension 流程使用的模型侧扩展起点参数。

extend_times

string

某些 video-extension 流程使用的模型侧扩展次数或倍率参数。

duration

integer

视频时长（秒），取决于模型。

aspect_ratio

string

宽高比，例如 16:9、9:16、1:1。

resolution

string

输出分辨率，例如 720p、1080p、4k，具体取决于模型。

output_audio

boolean

模型相关的音频输出开关。在 LemonData 中，Veo 3 家族在省略该字段时默认按 true 处理；其他公开视频模型按各自治理后的默认行为执行。为了兼容历史请求，也接受 camelCase 别名 outputAudio。

fps

integer

每秒帧数（1-120），仅在模型公开支持 FPS 控制时生效。

negative_prompt

string

希望在视频生成中避免的内容。

seed

integer

用于可复现生成的随机种子。

cfg_scale

number

提示词遵循强度（0-20），仅在公开模型支持该控制项时生效。

motion_strength

number

运动强度（0-1），仅在公开模型支持该字段时生效。

start_image

string

start-end-to-video 中使用的起始帧图片 URL 或兼容图片输入。

end_image

string

start-end-to-video 中使用的结束帧图片 URL 或兼容图片输入。

size

string

某些 OpenAI 兼容视频模型使用的尺寸档位参数。

watermark

boolean

某些模型暴露的水印开关。

effect_type

string

某些特效或编辑流程使用的模型侧效果选择器。

user

string

终端用户的唯一标识符。

兼容说明

公开契约中的规范字段为 snake_case：reference_images、reference_image_type、output_audio。
为兼容已有调用，LemonData 也接受 camelCase 别名 referenceImages、referenceImageType 和 outputAudio。
如果省略 operation，LemonData 会根据输入自动推断操作；但生产环境仍建议显式传入。

输入最佳实践

对于 image_url、reference_images、video_url 和 audio_url，建议优先使用公网可访问的 https URL。
可以避免在同一请求中混用内联 base64 和远程 URL；统一一种表示方式更容易排障和重试。
如果使用签名 URL，请确保有效期足够覆盖重试窗口和异步任务创建过程。

响应

string

规范异步任务 ID。

task_id

string

用于轮询的唯一任务标识符。

poll_url

string

此任务建议使用的轮询 URL。检查状态时请使用该精确路径。

status

string

初始状态：pending。

created

integer

创建任务时的 Unix 时间戳。

model

string

所使用的模型。

video_url

string

结果已就绪时可直接使用的视频 URL。

video

object

可用时返回单个视频对象，包含 url、duration、width 和 height。

videos

array

当上游返回多个输出时，可能出现视频数组。

error

string

任务失败时返回的错误信息或结构化错误对象。

curl -X POST "https://api.lemondata.cc/v1/videos/generations" \
  -H "Authorization: Bearer sk-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "sora-2",
    "prompt": "A cat walking through a garden, cinematic lighting",
    "operation": "text-to-video",
    "duration": 4,
    "aspect_ratio": "16:9"
  }'

{
  "id": "ldtask_aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa",
  "task_id": "ldtask_aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa",
  "poll_url": "/v1/tasks/ldtask_aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa",
  "status": "pending",
  "model": "sora-2",
  "created": 1706000000
}

图生视频

response = requests.post(
    "https://api.lemondata.cc/v1/videos/generations",
    headers={"Authorization": "Bearer sk-your-api-key"},
    json={
        "model": "hailuo-2.3-standard",
        "prompt": "The scene begins from the provided image and adds gentle natural motion.",
        "operation": "image-to-video",
        "image_url": "https://example.com/image.jpg",
        "duration": 6,
        "aspect_ratio": "16:9"
    }
)

参考图生视频

当模型支持专门的参考图条件控制时，请使用 operation=reference-to-video。在 LemonData 的公开契约中，参考图统一通过 reference_images 传入。

response = requests.post(
    "https://api.lemondata.cc/v1/videos/generations",
    headers={"Authorization": "Bearer sk-your-api-key"},
    json={
        "model": "veo3.1",
        "prompt": "Keep the same subject identity and palette while adding subtle motion.",
        "operation": "reference-to-video",
        "reference_images": [
            "https://example.com/ref-a.jpg",
            "https://example.com/ref-b.jpg"
        ],
        "reference_image_type": "asset",
        "duration": 8,
        "resolution": "720p",
        "aspect_ratio": "9:16"
    }
)

首尾帧控制

使用 start_image 和 end_image 控制首帧和尾帧：

response = requests.post(
    "https://api.lemondata.cc/v1/videos/generations",
    headers={"Authorization": "Bearer sk-your-api-key"},
    json={
        "model": "viduq2-pro",
        "prompt": "Smooth transition from day to night",
        "operation": "start-end-to-video",
        "start_image": "https://example.com/day.jpg",
        "end_image": "https://example.com/night.jpg",
        "duration": 5,
        "resolution": "720p",
        "aspect_ratio": "16:9"
    }
)

视频转视频

当模型接受现有视频作为主输入时，请使用 operation=video-to-video。

response = requests.post(
    "https://api.lemondata.cc/v1/videos/generations",
    headers={"Authorization": "Bearer sk-your-api-key"},
    json={
        "model": "topaz-video-upscale",
        "operation": "video-to-video",
        "video_url": "https://example.com/source.mp4",
        "prompt": "Upscale the clip while preserving the original motion.",
        "resolution": "1080p"
    }
)

动作控制

当模型同时需要主体图片和动作参考视频时，请使用 operation=motion-control。LemonData 会把公开的 image_url + video_url 请求形态映射到上游动作控制契约。

response = requests.post(
    "https://api.lemondata.cc/v1/videos/generations",
    headers={"Authorization": "Bearer sk-your-api-key"},
    json={
        "model": "kling-3.0-motion-control",
        "operation": "motion-control",
        "prompt": "Keep the subject stable while following the motion reference.",
        "image_url": "https://example.com/subject.png",
        "video_url": "https://example.com/motion.mp4",
        "resolution": "720p"
    }
)

Audio-to-Video 与 Video Extension 当前可用性

LemonData 的公开契约接受 audio-to-video 与 video-extension 这两个操作值，用于模型特定流程；但在当前这版文档对应的“通用启用”公开视频模型清单中，并没有一个广泛启用的公开模型对外明确提供这两项能力。接入前请先通过 Models API 或模型页面确认实时可用性。

当前启用的公开视频模型

以下列表与当前文档构建时的启用视频模型总表对齐。若要获取最新状态，请优先查询 Models API。

OpenAI

Model	公开操作
`sora-2`	文生视频、图生视频
`sora-2-pro`	文生视频、图生视频
`sora-2-pro-storyboard`	图生视频

Kuaishou

Model	公开操作
`kling-3.0-motion-control`	动作控制
`kling-3.0-video`	文生视频、图生视频、首尾帧视频
`kling-v2.5-turbo-pro`	文生视频、图生视频、首尾帧视频
`kling-v2.5-turbo-std`	文生视频、图生视频
`kling-v2.6-pro`	文生视频、图生视频、首尾帧视频
`kling-v2.6-std`	文生视频、图生视频
`kling-v3.0-pro`	文生视频、图生视频、首尾帧视频
`kling-v3.0-std`	文生视频、图生视频、首尾帧视频
`kling-video-o1-pro`	文生视频、图生视频、参考图生视频、首尾帧视频、视频转视频
`kling-video-o1-std`	文生视频、图生视频、参考图生视频、首尾帧视频、视频转视频

Google

Model	公开操作
`veo3`	文生视频、图生视频
`veo3-fast`	文生视频、图生视频
`veo3-pro`	文生视频、图生视频
`veo3.1`	文生视频、图生视频、参考图生视频、首尾帧视频
`veo3.1-fast`	文生视频、图生视频、参考图生视频、首尾帧视频
`veo3.1-pro`	文生视频、图生视频、首尾帧视频

ByteDance

Model	公开操作
`seedance-1.5-pro`	文生视频、图生视频

MiniMax

Model	公开操作
`hailuo-2.3-fast`	图生视频
`hailuo-2.3-pro`	文生视频、图生视频
`hailuo-2.3-standard`	文生视频、图生视频

Alibaba

Model	公开操作
`wan-2.2-plus`	文生视频、图生视频
`wan-2.5`	文生视频、图生视频
`wan-2.6`	文生视频、图生视频、参考图生视频

Shengshu

Model	公开操作
`viduq2`	文生视频、参考图生视频
`viduq2-pro`	图生视频、参考图生视频、首尾帧视频
`viduq2-pro-fast`	图生视频、首尾帧视频
`viduq2-turbo`	图生视频、首尾帧视频
`viduq3-pro`	文生视频、图生视频、首尾帧视频
`viduq3-turbo`	文生视频、图生视频、首尾帧视频

xAI

Model	公开操作
`grok-imagine-image-to-video`	图生视频
`grok-imagine-text-to-video`	文生视频
`grok-imagine-upscale`	视频转视频

Other

Model	公开操作
`topaz-video-upscale`	视频转视频

核心

文本

文件与批处理

图像与媒体

异步任务

Gemini 原生

管理

概述

请求体

兼容说明

输入最佳实践

响应

图生视频

参考图生视频

首尾帧控制

视频转视频

动作控制

Audio-to-Video 与 Video Extension 当前可用性

当前启用的公开视频模型

OpenAI

Kuaishou

Google

ByteDance

MiniMax

Alibaba

Shengshu

xAI

Other

核心

文本

文件与批处理

图像与媒体

异步任务

Gemini 原生

管理

​概述

​请求体

​兼容说明

​输入最佳实践

​响应

​图生视频

​参考图生视频

​首尾帧控制

​视频转视频

​动作控制

​Audio-to-Video 与 Video Extension 当前可用性

​当前启用的公开视频模型

​OpenAI

​Kuaishou

​Google

​ByteDance

​MiniMax

​Alibaba

​Shengshu

​xAI

​Other

概述

请求体

兼容说明

输入最佳实践

响应

图生视频

参考图生视频

首尾帧控制

视频转视频

动作控制

Audio-to-Video 与 Video Extension 当前可用性

当前启用的公开视频模型

OpenAI

Kuaishou

Google

ByteDance

MiniMax

Alibaba

Shengshu

xAI

Other