Video Generation
Veo Series
Google Veo 3.1 — official direct Fast / Quality tiers, reverse-engineered Lite / Quality tiers, native audio, first/last frame, Remix
POST
Veo Series
Google Veo 3.1 video generation, available through both official direct and reverse-engineered channels, with 4 variants in total.Documentation Index
Fetch the complete documentation index at: https://docs.qingbo.dev/llms.txt
Use this file to discover all available pages before exploring further.
- Official tiers (Vertex AI direct): billed per second; supports 4 / 6 / 8 second durations and 720P / 1080P / 4K resolutions.
- Reverse-engineered tiers: billed per call (fixed price per call); only 8-second fixed duration.
| Variant | Channel | Billing | Positioning |
|---|---|---|---|
veo3.1-quality-official | Official direct | Per-second | Flagship tier, final-delivery quality |
veo3.1-fast-official | Official direct | Per-second | Fast tier, suited for iteration drafts |
veo3.1-quality | Reverse-engineered | Per-call | Reverse-engineered flagship, fixed 8 sec |
veo3.1-lite | Reverse-engineered | Per-call | Reverse-engineered lightweight, fixed 8 sec |
Pricing
4K resolution does not support muted output — you must set
generate_audio=true (720p / 1080p can be either muted or with audio). In the table below, 720p_sound / 1080p_sound are the audio-on premium tiers, and 4k_sound is the only available 4K tier.Official Tiers (per-second)
720P and 1080P share the same price (Google’s official tiered pricing); enablinggenerate_audio=true doubles the unit price.
veo3.1-fast-official
| Spec | Unit price (per sec) | 8-second video |
|---|---|---|
| 720P muted | $0.08 | $0.64 |
| 1080P muted | $0.08 | $0.64 |
| 720P + audio | $0.12 | $0.96 |
| 1080P + audio | $0.12 | $0.96 |
| 4K + audio | $0.28 | $2.24 |
veo3.1-quality-official
| Spec | Unit price (per sec) | 8-second video |
|---|---|---|
| 720P muted | $0.16 | $1.28 |
| 1080P muted | $0.16 | $1.28 |
| 720P + audio | $0.32 | $2.56 |
| 1080P + audio | $0.32 | $2.56 |
| 4K + audio | $0.48 | $3.84 |
Reverse-Engineered Tiers (per-call)
Reverse-engineered tiers use a fixed per-call price independent of duration, resolution, or audio (only 8 seconds is supported). Refer to the admin panel / console for the published price.| Model ID | Billing | Duration | Per-call price |
|---|---|---|---|
veo3.1-quality | Per-call | 8 seconds (fixed) | TBD (see console) |
veo3.1-lite | Per-call | 8 seconds (fixed) | TBD (see console) |
Request Examples
GET /v1/tasks/{task_id}. See Task System for details.
Available Models
| Model ID | Channel | Variant | Billing | Duration | Resolution |
|---|---|---|---|---|---|
veo3.1-quality-official | Official direct | Quality flagship | Per-second | 4 / 6 / 8 sec | 720P / 1080P / 4K |
veo3.1-fast-official | Official direct | Fast tier | Per-second | 4 / 6 / 8 sec | 720P / 1080P / 4K |
veo3.1-quality | Reverse-engineered | Quality flagship variant | Per-call | 8 sec (fixed) | 720P / 1080P |
veo3.1-lite | Reverse-engineered | Lite lightweight variant | Per-call | 8 sec (fixed) | 720P / 1080P |
Common Parameters
Model ID — one of
veo3.1-quality-official / veo3.1-fast-official / veo3.1-quality / veo3.1-liteOperation type, valid values:
generate— text-to-video (T2V)image2video— image-to-video (I2V); requiresfirst_frame_imagefirst_last_frame— first/last-frame keyframes; requires bothfirst_frame_imageandlast_frame_imageremix— continuation based on an existing clip
Video description text; required for all actions
Frame aspect ratio, valid values:
16:9— landscape widescreen9:16— portrait
Output resolution, valid values:
720p1080p4k— official tiers only, and must havegenerate_audio=true(4K does not support muted output)
Video duration in seconds:
- Official tiers:
4/6/8 - Reverse-engineered tiers:
8only (fixed)
Generic image-reference array (in some scenarios equivalent to
first_frame_image)First-frame image URL. Required for
image2video / first_last_frame / remix modesLast-frame image URL — required only in
first_last_frame modeWebhook callback URL, invoked when the task reaches a terminal state. See Callback Mechanism
Subscribed callback event types; defaults to terminal states (
completed / failed)Model-Specific Parameters
All four variants share the same parameters; only the channel, billing model, and available duration / resolution differ. Tab order is Quality before Fast / Lite, and official before reverse-engineered.
- veo3.1-quality-official
- veo3.1-fast-official
- veo3.1-quality (reverse)
- veo3.1-lite (reverse)
Whether to generate the audio track. When enabled, unit price doubles; 4K must be
trueNumber of samples to generate; range 1-4
Image resize strategy for image-to-video, valid values:
pad— edge padding, preserves the entire framecrop— center crop, fits the output ratio
Upstream prompt enhancement; must be
true (forced on by Vertex AI)Person-generation policy, valid values:
allow_adult— allow generating adult personsdisallow— disallow generating persons
Remix-mode only; when
true, returns only the extension, not concatenated with the original clipResource Limits
| Item | Limit |
|---|---|
| Reference image | JPG / PNG / WEBP, ≤ 20MB each, 720P or higher recommended |
| Output video | MP4, link valid for 24 hours |
| Duration | Official tiers 4 / 6 / 8 sec; reverse-engineered tiers fixed 8 sec only |
| Resolution | 720P / 1080P freely; 4K only on official tiers and requires generate_audio=true |
| Audio | Natively generated, no extra TTS needed; 4K must include audio, 720P / 1080P optional |
| Persons | Constrained by person_generation; non-compliant content is blocked upstream |
Related Docs
- Task System Reference — task state machine / polling cadence / async push
- Request and Response Format — common error codes / headers / rate limits
- Authentication — API key application and usage
Veo Series