MiniMax-Hailuo-02
MiniMax-Hailuo-02 Video Generation
Generate videos using MiniMax Hailuo 02 model, supporting text-to-video, image-to-video, and first-last-frame generation
POST
- Async task API, returns task ID upon submission
- Supported model:
MiniMax-Hailuo-02 - Three generation modes: text-to-video, image-to-video, first-last-frame generation
- Supports 512P / 768P / 1080P resolution, 6-second and 10-second duration
Authentication
All endpoints require Bearer Token authentication.
Request Parameters
Model name, fixed as
MiniMax-Hailuo-02.Video content description, max 2000 characters.
- Required for text-to-video mode
- Optional for image-to-video and first-last-frame modes (can describe desired motion)
[Push in], [Pull out], [Pan left, Pedestal up]), see the Camera Directives table below.Example: "A mouse runs toward the camera, smiling and blinking"Video duration in seconds.Options:
6 or 10Video resolution.Options:
512P- Standard definition (available for image-to-video)768P- HD (default)1080P- Full HD (6-second duration only)
First frame image URL array.
- Not provided: text-to-video
- 1 image: image-to-video, image used as video starting frame
- Format: JPG, JPEG, PNG, WebP
- Size: under 20MB
- Dimensions: short side > 300px, aspect ratio between 2:5 and 5:2
["https://example.com/first-frame.jpg"]Extended parameters.
Three Generation Modes
| Mode | image_urls | metadata.last_frame_image | Description |
|---|---|---|---|
| Text-to-Video | Not provided | Not provided | Generate video from text description |
| Image-to-Video | ✅ 1 image | Not provided | Generate video with image as first frame |
| First-Last-Frame | ✅ 1 image (first frame) | ✅ Provided (last frame) | Generate transition video from first to last frame |
Resolution and Duration Combinations
| Resolution | Supported Durations | Text-to-Video | Image-to-Video | First-Last-Frame |
|---|---|---|---|---|
| 512P | 6s, 10s | ✅ | ✅ | ❌ |
| 768P | 6s, 10s | ✅ | ✅ | ✅ |
| 1080P | 6s | ✅ | ✅ | ✅ |
Camera Directives
Use[directive] syntax in prompt to control camera movement:
| Category | Directives |
|---|---|
| Pan | [Pan left] [Pan right] |
| Tilt | [Tilt up] [Tilt down] |
| Push/Pull | [Push in] [Pull out] |
| Pedestal | [Pedestal up] [Pedestal down] |
| Truck | [Truck left] [Truck right] |
| Zoom | [Zoom in] [Zoom out] |
| Other | [Shake] [Tracking shot] [Static shot] |
- Combine up to 3 directives in one bracket for simultaneous movement, e.g.,
[Pan left, Pedestal up] - Sequence directives in the prompt for ordered camera moves, e.g.,
"...[Push in], then...[Pull out]"
Response
Task ID for querying task status.
Object type, always
generation.task.Model name used for this request.
Task status:
queued, in_progress, completed, or failed.Task creation timestamp.
Examples
Text-to-Video
Image-to-Video
First-Last-Frame Generation
Camera Control
10-Second Long Video + Fast Preprocessing
Video generation is an async task. Use Get Video Task Status to query progress and results.