Video Generation
Vidu Q3 Series
ShengShu Vidu Q3 — reference-to-video / text-to-video / image-to-video / first-last frame / multi-image remix, smart shot transitions
POST
Vidu Q3 Series
ShengShu Vidu Q3 video generation series. Reference-to-video (R2V) is the flagship capability — accepts 1-7 public images as subject / style references, with the prompt focused on motion and camera while the appearance comes from the reference images. Best fit for character consistency / style continuity / multi-shot transitions. Four tiers:Documentation Index
Fetch the complete documentation index at: https://docs.qingbo.dev/llms.txt
Use this file to discover all available pages before exploring further.
viduq3-pro— flagship tier, auto-routes T2V / I2V / first-last frame byimage_urlscount, audio output enabled by default (dialog + SFX)viduq3-mix— overall quality king, supports multi-clip video / image remix for coherent video output, joint audio generation, 720P / 1080P, ideal for short-drama re-cutting and ad creative remixviduq3— standard tier, the default daily choice, smarter shot transitions, 540p / 720p / 1080p, 3-16 secviduq3-turbo— lightweight fast tier, fastest generation and lowest unit price, slightly lower quality than standard, ideal for previews / batch / A/B testing
Pricing
| Model | 540P | 720P | 1080P |
|---|---|---|---|
viduq3-pro | $0.0595 / sec | $0.1275 / sec | $0.136 / sec |
viduq3-mix | — | $0.10625 / sec | $0.1275 / sec |
viduq3 | $0.0425 / sec | $0.085 / sec | $0.10625 / sec |
viduq3-turbo | $0.034 / sec | $0.051 / sec | $0.0595 / sec |
Examples
GET /v1/tasks/{task_id}; see Task System for details.
Available Models
| Model ID | Resolution | Duration | Supported actions | Notes |
|---|---|---|---|---|
viduq3-pro | 540p / 720p / 1080p | 1-16 sec | generate / image2video / first_last_frame | Flagship tier, audio output |
viduq3-mix | 720P / 1080P | 1-16 sec | reference / image2video | Overall quality king, multi-asset remix |
viduq3 | 540p / 720p / 1080p | 1-16 sec | reference / image2video | Standard tier, default daily choice |
viduq3-turbo | 540p / 720p / 1080p | 1-16 sec | reference / image2video | Lightweight fast tier, fastest generation, lowest price |
Common Parameters
Model ID; see the Available Models table
Operation type. Pro tier may be omitted (auto-routed by
image_urls count); other tiers should specify it explicitly. Allowed values:generate— text-to-video (Pro only)image2video— image-to-videoreference— reference-to-video (default for standard / Mix / Turbo)first_last_frame— first/last frame (Pro only)
Video description text. Focus on motion / camera / atmosphere — character appearance is determined by
image_urlsFrame aspect ratio. Allowed values:
16:9— landscape widescreen9:16— portrait4:3— landscape3:4— portrait1:1— square
Output resolution; see Available Models for per-model support
Video duration in seconds, any integer 1-16. Mix tier supports 1-second short videos
Reference image URL array, 1-7 images:
- Standard / Mix / Turbo: subject / style references (R2V)
- Pro tier: 0 = T2V / 1 = I2V / 2 = first-last frame; see Auto-routing
First-frame image URL, required for Pro
first_last_frame modeLast-frame image URL, required for Pro
first_last_frame modeWebhook callback URL, invoked when the task reaches a terminal state. See Callback Mechanism
Subscribed callback events; defaults to terminal events (
succeeded / failed / cancelled)Model-specific Parameters
- viduq3-pro
- viduq3-mix
- viduq3
- viduq3-turbo
Whether to generate audio (default
true). Pro tier enables audio output by default (dialog + SFX); set false for silent outputimage_urls count; see Auto-routing.Auto-routing
viduq3-pro auto-selects the generation mode by image_urls array length, so you usually don’t need to pass action explicitly:
image_urls count | Routed mode | Equivalent action | Notes |
|---|---|---|---|
0 (or omitted) | Text-to-video | generate | Pure prompt-driven |
1 | Image-to-video | image2video | The image serves as the starting frame |
2 | First-last frame | first_last_frame | First image = first frame, second image = last frame |
When
first_frame_image + last_frame_image are passed explicitly, they take precedence and override the first-last-frame routing from image_urls.viduq3 / viduq3-mix / viduq3-turbo) default to R2V reference-to-video mode, with image_urls providing 1-7 subject / style references.
Resource Limits
| Item | Limit |
|---|---|
| Reference images | Up to 7, each ≤ 30MB, JPG / PNG / WEBP |
| First/last-frame images | JPG / PNG / WEBP, public download URL required |
| Prompt length | ≤ 2000 characters recommended |
| Video duration | Any integer 1-16 sec (Mix supports 1-second short videos) |
| Output | MP4, link valid for 24 hours |
Related Docs
- Task System Reference — task state machine / polling cadence / async push
- Request & Response Format — common error codes / headers / rate limits
- Authentication — API key signup and usage
Vidu Q3 Series