Alibaba Tongyi Wanxiang image generation — text-to-image / image-to-image / multi-image reference / image groups / interactive editing
Alibaba Tongyi Wanxiang 2.7 unified interface for image generation and editing. Covers wan2.7-image-pro (professional, text-to-image up to 4K) and wan2.7-image (standard, up to 2K) tiers. The two tiers share the same interface via internal tier switching, and are exposed as a single model externally. Six call modes:Documentation Index
Fetch the complete documentation index at: https://docs.qingbo.dev/llms.txt
Use this file to discover all available pages before exploring further.
prompt only, with optional thinking modeenable_sequential for comic strips / storyboards, max n = 12bbox_list targeted region editing (exclusive)$0.02295 per image.
| Model | Price (per image) |
|---|---|
wan2.7-image | $0.02295 |
| Mode | Trigger Field | action |
|---|---|---|
| Text-to-image | prompt only (optional thinking_mode) | generate |
| Image-to-image | + image_urls (single) | image2image |
| Image edit | + image_urls (single) + edit-instruction prompt | edit |
| Multi-image reference fusion | + image_urls (2-9) | reference |
| Image group (comic / storyboard) | enable_sequential: true + n ≤ 12 | group |
| Interactive bbox editing | + image_urls + bbox_list | interactive_edit |
GET /v1/tasks/{task_id}. See Task System for details.
| Model ID | Description |
|---|---|
wan2.7-image | Alibaba Tongyi Wanxiang 2.7. Unified interface for text-to-image / image-to-image / editing / multi-image reference / image groups / interactive bbox editing. Up to 4K |
wan2.7-imagegenerate — Text-to-image (default)image2image — Image-to-image (use with single image_urls)edit — Image editing (inpainting / style transfer)reference — Multi-image reference fusion (up to 9)group — Image group generation (use with enable_sequential)interactive_edit — Interactive bbox editing (use with bbox_list)enable_sequential: true) allows up to 12-1 for random; a fixed value reproduces similar results1:1 — Square16:9 / 9:16 — Widescreen landscape / portrait4:3 / 3:4 — Standard landscape / portrait3:2 / 2:3 — Camera ratio landscape / portrait1K / 2K / 4K. Note: image-group mode caps at 2Kimage2image / edit / interactive_edit: 1 imagereference: 2-9 imagesenable_sequential not true) and without image input (image_urls not provided); ignored otherwisen cap raises to 12resolution capped at 2Kthinking_mode and color_palette have no effect{hex, ratio}, with ratio summing to 100%.Activation condition: only applies in non-group mode. See Composite Field Referenceimage_urls, max 2 boxes per image.Only applies when action: "interactive_edit". See Composite Field Referencecolor_palette — Custom Color Palette{hex, ratio}
| Field | Type | Description |
|---|---|---|
hex | string | Hex color, e.g. #0A2540, case-insensitive |
ratio | number | Percentage coverage in the image. All items must sum to 100 |
ratio: strictly 100 (integer percentages)enable_sequential is true)bbox_list — Interactive Editing Boxes[[[x1, y1, x2, y2], ...], ...]
image_urls, one-to-one (counts must match)[x1, y1, x2, y2], top-left and bottom-right corners (pixel coordinates)action: "interactive_edit"| Item | Limit |
|---|---|
Reference images (reference) | 2-9 |
Reference images (image2image / edit / interactive_edit) | 1 |
| Single image size | ≤ 30MB, supports JPG/PNG/WEBP |
| Output resolution | 1K / 2K / 4K (group mode caps at 2K) |
Group count n | Up to 12 (requires enable_sequential: true) |
bbox_list boxes | Max 2 per image |
color_palette items | 3-10, ratio sums to 100% |
| Output file | JPG URL, valid for 24 hours |