Generate videos from images using the Kling v3.0 model with multi-shot, audio, element reference, and extended duration support.
first_frame and last_frame for start-to-end frame interpolationelements with frontal and reference imagesBearer authentication header of the form Bearer <token>, where <token> is your auth token.
First frame image (URL or Base64). Sets the opening frame of the generated video.
Important: When using Base64 encoding, do not add any prefixes such as data:image/png;base64,. Provide only the Base64-encoded string itself.
first_frame and last_frame must be provided.Last frame image (URL or Base64). Sets the closing frame of the generated video.
first_frame and last_frame must be provided.last_frame, dynamic_masks/static_mask, and camera_control are mutually exclusive.Positive text prompt. Cannot exceed 2500 characters.
Required when multi_shot is false or when shot_type is intelligence.
2500Negative text prompt. Cannot exceed 2500 characters.
2500Whether to generate a multi-shot video.
true: Enable multi-shot mode. prompt is ignored; use shot_type and multi_prompt instead.false: Single-shot mode (default).Shot segmentation method. Required when multi_shot is true.
customize: Custom shots, requires multi_prompt.intelligence: AI-generated shots, requires prompt.customize, intelligence Shot prompt list for multi-shot videos.
Required when multi_shot is true and shot_type is customize.
1 - 6 elementsElement definitions. Max 3 elements.
Provide frontal and reference images.
Use <<<element_1>>> in prompt to reference elements.
1 - 3 elementsGenerate audio simultaneously when generating videos.
on: Enable audio generationoff: Disable audio generation (silent video)on, off Video generation mode.
std: Standard Mode (720P), cost-effective.
pro: Professional Mode (1080P), higher quality video output.
std, pro Video length in seconds (3-15).
Accepted - Task created successfully