Vidu Q3
Vidu Q3 Video Generation
Generate videos with Vidu Q3 models, including text-to-video, image-to-video, first-last frame, reference-to-video and subjects generation
POST
- Async task API, returns a task ID after submission
- Supported models:
viduq3-pro,viduq3-turbo,viduq3 viduq3-pro: High quality model, supports audio-video sync and storyboard generationviduq3-turbo: Fast model, intelligent scene switching, fastest generation, best cost-effectivenessviduq3: Best multi-camera consistency, intelligent camera switching, multi-reference generation
Authorization
All endpoints require Bearer Token authentication.
Request Parameters
Vidu Q3 model name.Options:
viduq3-pro- high quality, supports audio-video sync and storyboard generationviduq3-turbo- fastest generation, intelligent scene switchingviduq3- best multi-camera consistency, for multi-reference/subjects generation
Text prompt, maximum 5000 characters.Describes subject, action, scene, camera movement, and style. When using subjects, reference them via
@name in the prompt.Example: "A cat playing piano, camera slowly zooms in, cinematic quality"Video duration in seconds.
viduq3-pro/viduq3-turbo:1to16viduq3:3to16
Video resolution.Options:
540p720p1080p
Video aspect ratio.Common values:
16:9, 9:16, 1:1Q3 models support any aspect ratio. When image_urls is provided, the output ratio is usually determined by the reference image.Reference image URL array.
viduq3-pro/viduq3-turbo- Omitted: text-to-video
- 1 image: image-to-video, image as first frame
- 2 images: first-last frame
viduq3- Required, up to 7 reference images
["https://example.com/reference.jpg"]Whether to enable audio-video output.
- All Q3 models default to
true
true, the system generates speech and sound effects based on the prompt.Random seed for more reproducible results with the same parameters.
Extended parameters for fields supported by the upstream API but not exposed as top-level fields.
Model Selection
| Model | Use Case | Image Input | Resolution | Duration |
|---|---|---|---|---|
viduq3-pro | High quality text/image/first-last frame | Optional, up to 2 images | 540p / 720p / 1080p | 1-16s |
viduq3-turbo | Fast text/image/first-last frame | Optional, up to 2 images | 540p / 720p / 1080p | 1-16s |
viduq3 | Multi-reference/Subjects generation | Required, up to 7 images | 540p / 720p / 1080p | 3-16s |
Billing Dimensions
Vidu Q3 is billed by model, resolution, and generation type:viduq3-pro: high quality tier, approximately 2x the price of turboviduq3-turbo: fast tier, best cost-effectivenessviduq3: multi-reference generation, billed at reference generation rate
Response
Task ID for querying task status.
Object type, usually
generation.task.Model used for the request.
Task status:
queued, in_progress, completed, or failed.Task creation timestamp.
Examples
Text-to-Video
Image-to-Video
First-Last Frame
Multi-Reference Image Video
Subjects Mode
Video generation is asynchronous. Use the Get Video Task Status endpoint to query progress and results.