Wan2.6
Wan2.6 Video Generation
Generate high-quality videos using Alibaba Cloud Wan2.6 model — supports text-to-video, image-to-video and reference-video
POST
- Alibaba Cloud Wanxiang video generation model
- Supports three modes: Text-to-Video, Image-to-Video, and Reference Video (r2v)
- Server automatically routes to the appropriate upstream model based on your request parameters
- Supports 720p/1080p resolution, 5/10/15 second duration
- Audio is always included in the generated output
Routing Logic
The server automatically selects the upstream model based on what parameters you provide:| Parameters Provided | Upstream Mode |
|---|---|
metadata.reference_urls (video URLs) | Reference Video (r2v) |
image_urls (image) | Image-to-Video (i2v) |
prompt only | Text-to-Video (t2v) |
Authentication
All endpoints require Bearer Token authenticationGet your API Key from the API Key Management PageAdd to request header:
Request Parameters
Video generation model name, fixed as
wan2.6Video content descriptionRequired for text-to-video mode; optional for image-to-video and reference video modes (describe expected motion or style)Example:
"A cute cat stretching in the sunshine"Reference image URL array for image-to-video mode (only supports 1 image)⚠️ URL format only (base64 no longer supported)
- Publicly accessible image URL (http:// or https://)
- You can use the Upload Image API to upload local images and get URLs
["https://example.com/image.jpg"]Note: Cannot be combined with metadata.reference_urlsVideo aspect ratio (applies to text-to-video and reference video modes)Available values:
16:9- Landscape (default)9:16- Portrait1:1- Square4:3- Landscape3:4- Portrait
16:9Note: Not supported in image-to-video modeVideo resolutionAvailable values:
720p- Standard definition1080p- High definition (default)
1080p480p is not supported. Billed per second; different resolutions have different prices — refer to the model marketplace for details.Video duration (seconds)Supported values:
5, 10, 15Default: 5Negative prompt — describe content you do not want in the videoExample:
"blurry, low quality, deformed"Random seed for reproducibilityExample:
12345Auto-extend promptWhen enabled, the system automatically optimizes and enriches your prompt. Enabled by default — set to
false to disable.Include audio in the generated videoNon-flash Wan2.6 models always include audio by default. Set to
true to explicitly enable.Shot type (applies to text-to-video and reference video modes)Available values:
single- Single continuous shotmulti- Multi-shot (cinematic cuts)
Add an Alibaba Cloud watermark to the generated video
Extended parameters
Resolution and Aspect Ratio Combinations
| Aspect Ratio | Description | 720p Size | 1080p Size |
|---|---|---|---|
| 16:9 | Landscape (default) | 1280×720 | 1920×1080 |
| 9:16 | Portrait | 720×1280 | 1080×1920 |
| 1:1 | Square | 960×960 | 1440×1440 |
| 4:3 | Landscape | 1088×832 | 1632×1248 |
| 3:4 | Portrait | 832×1088 | 1248×1632 |
Response
Unique task identifier for status queries
Object type, always
generation.taskModel name used
Task status
queued- Queued for processingin_progress- Processingcompleted- Successfully completedfailed- Failed
Task progress percentage (0-100)
Task creation timestamp (Unix timestamp)
Task metadata
Usage Scenarios
Scenario 1: Text-to-Video
Scenario 2: Image-to-Video
Scenario 3: Reference Video (r2v)
Scenario 4: Text-to-Video (Full Parameters)
Querying Task ResultsVideo generation is an asynchronous task. After submission, a
task_id is returned. Use the Get Task Status interface to query generation progress and results.