Kling v2.6 Video Generation
Kling v2.6
Kling v2.6 Video Generation
Generate videos with official Kling v2.6 using explicit reference-image and frame-role semantics
POST
Kling v2.6 Video Generation
- Async processing mode, returns task ID for subsequent queries
- Supports text-to-video, image-to-video, explicit first/last frame control, and audio video
- Supports standard mode (720P) and professional mode (1080P)
- Professional mode supports automatic audio generation
Authorization
All API endpoints require Bearer Token authentication.
Request Parameters
Video generation model name, fixed as
kling-v2-6.Text prompt, maximum 2500 characters.
Generation mode.
std- standard mode (720P, silent video only)pro- professional mode (1080P, supports automatic audio generation)
Video duration in seconds.Options:
5 or 10Video aspect ratio. Common values:
16:9, 9:16, 1:1Normal reference images.
- These images are treated as references only
- They are not automatically converted into first/last frames
- Use
image_with_rolesfor explicit frame control
Explicit image-role array for frame control and mixed inputs.
Negative prompt to exclude unwanted content.
Whether to automatically generate audio.
Whether to add watermark.
Input Rules
| Input shape | Behavior |
|---|---|
reference_images only | Normal references |
image_with_roles with only first_frame / last_frame | Frame control |
| Both fields used together, or roles include both frame and reference semantics | Mixed mode |
Examples
Text-to-Video
Image Reference
First and Last Frame Control
Mixed Reference and Frame Input
Pro Mode + Auto Audio
Video generation is an async task. Use the Get Video Task Status endpoint to query progress and results.