Kling v3 Omni
Kling 3.0-Omni video generation model. Adds reference-video support (video source edit / video feature reference) on top of Kling v3.
Authorizations
Pass Authorization: Bearer <YOUR_API_KEY> in the request header. Go to the API Keys management page to create and manage API Keys.
Body
Fixed value kling/kling-v3-omni.
The prompt, which may contain both positive and negative descriptions. Up to 2500 characters.
The Omni model supports referencing subjects, images, and videos in the prompt using the <<<>>> syntax, e.g. <<<element_1>>>, <<<image_1>>>, <<<video_1>>>.
Required when:
extra_params.multi_shot=false, orextra_params.multi_shot=truewithextra_params.shot_type=intelligence.When
extra_params.multi_shot=trueandextra_params.shot_type=customize, this field is ignored — per-shot prompts are provided viaextra_params.multi_prompt.
"A slow pan across a misty mountain valley at sunrise, cinematic"
Output resolution. Defaults to 720P.
720P, 1080P Output aspect ratio. Defaults to 16:9.
16:9, 9:16, 1:1 Video duration in seconds. Defaults to 5. Allowed values: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15.
When using video editing (
reference_videos[].refer_type=base), this parameter is ignored — the output duration matches the input video and is billed based on the input video duration, rounded to the nearest second.
3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 Whether to generate audio. Defaults to false.
Reference image list. Can provide subject / scene / style references, or serve as the first or last frame of the video.
Input modes (determined by the reference_type combination):
- Image-to-video: 1 image with
first_frame; - First/last-frame interpolation: 1
first_frame+ 1last_frame(last-frame-only is not supported — whenlast_frameis provided,first_framemust also be provided); - Multi-reference image-to-video: N images with
asset(non-first/last-frame assets may omitreference_type, which defaults toasset).
Combining with reference_videos:
- When
reference_videos[].refer_type=base(video to be edited),first_frame/last_frameare not allowed — onlyassetis supported; - Otherwise,
asset-type images can coexist withreference_videos.
Image requirements:
- Formats:
jpg/jpeg/png; - Size ≤ 10 MB;
- Dimensions ≥ 300px, aspect ratio between
1:2.5and2.5:1.
Count limits (total of "reference images + multi-image subjects"):
- No reference video, multi-image subjects only: ≤ 7;
- No reference video, with a video subject: ≤ 4;
- With a reference video, multi-image subjects only: ≤ 4.
Multi-image subjects and video subjects are referenced from
promptvia placeholders like<<<image_n>>>/<<<video_n>>>.
7Reference video list. At most 1 clip.
Two modes (distinguished by refer_type):
base(default) — video to be edited: the source video is repainted/edited directly. In this mode,first_frame/last_framecannot be supplied viareference_images;feature— feature reference video: only motion / style features are extracted to guide generation.
Use keep_original_sound to decide whether to preserve the original audio track — this also takes effect for feature.
When a reference video is provided,
generate_audiois forced tofalse(no audio will be generated).
Video requirements:
- Formats:
MP4/MOV; - Size ≤ 200 MB;
- Duration ≥ 3s (upper bound depends on the model version);
- Dimensions: 720px – 2160px inclusive;
- Frame rate:
24 – 60 fps(output is always 24 fps); - Publicly accessible URL only; base64 is not supported.
1Kling's official extension parameters. All optional.
Response
Submitted successfully (asynchronous mode)
Unique task ID, used for subsequent polling
"task_2042864638838083584"
The model ID used for this task
"kling/kling-v3-omni"
Task creation time (ISO 8601)
"2026-04-13T07:17:39.146Z"
Unique request ID
"291a4117-9df8-4349-aee1-bdf8ffcacb9a"

