Loading source
Pulling the file list, source metadata, and syntax-aware rendering for this listing.
Source from repo
Generate images via OpenAI, Google, OpenRouter, DashScope, Jimeng, Seedream, and Replicate APIs with batch support.
Files
Skill
Size
Entrypoint
Format
Open file
Syntax-highlighted preview of this file as included in the skill package.
references/providers/dashscope.md
1# DashScope (阿里通义万象)23Read when the user picks `--provider dashscope`, sets `default_model.dashscope`, or asks for Qwen-Image behavior. The SKILL.md only names the default — this file covers model families, sizing rules, and limits.45## Model Families67**`qwen-image-2.0*`** — recommended modern family. Members: `qwen-image-2.0-pro`, `qwen-image-2.0-pro-2026-03-03`, `qwen-image-2.0`, `qwen-image-2.0-2026-03-03`.89- Free-form `size` in `宽*高` format10- Total pixels must be between `512*512` and `2048*2048`11- Default ≈ `1024*1024`12- Best choice for custom ratios (e.g. `21:9`) and text-heavy Chinese/English layouts1314**Fixed-size family** — `qwen-image-max`, `qwen-image-max-2025-12-30`, `qwen-image-plus`, `qwen-image-plus-2026-01-09`, `qwen-image`.1516- Only five sizes allowed: `1664*928`, `1472*1104`, `1328*1328`, `1104*1472`, `928*1664`17- Default is `1664*928`18- `qwen-image` currently has the same capability as `qwen-image-plus`1920**`wan2.7-image*`** — multimodal Wan 2.7 family. Members: `wan2.7-image-pro`, `wan2.7-image`.2122- Free-form `size` in `宽*高` format, plus aspect-ratio inference23- `wan2.7-image-pro` text-to-image (no `--ref`): total pixels in `[768*768, 4096*4096]`, ratio in `[1:8, 8:1]`24- `wan2.7-image-pro` with reference images and `wan2.7-image` (all scenarios): total pixels in `[768*768, 2048*2048]`, ratio in `[1:8, 8:1]`25- Default: `1024*1024` (`--quality normal`) or `2048*2048` (`--quality 2k`); 4K requires explicit `--size`26- Supports up to 9 reference images in `--ref` (image editing / multi-image fusion)27- Reference images are sent inline as base64 (or passed through if the path is an `http(s)://` URL)28- API does NOT use `prompt_extend`; the skill omits it for this family29- The Wan 2.7 API defaults `n` to **4** in non-collage mode and bills per generated image. baoyu-image-gen forces `n: 1` and rejects `--n > 1` to avoid silently paying for and discarding extra images.3031**Legacy** — `z-image-turbo`, `z-image-ultra`, `wanx-v1`. Only use when the user explicitly asks for legacy behavior.3233## Size Resolution3435- `--size` wins over `--ar`36- For `qwen-image-2.0*`: prefer explicit `--size`; otherwise infer from `--ar` using the recommended table below37- For `qwen-image-max/plus/image`: only use the five fixed sizes; if the requested ratio doesn't fit, switch to `qwen-image-2.0-pro`38- For `wan2.7-image*`: explicit `--size` is validated against the per-mode pixel/ratio limits; otherwise the size is derived from `--ar` and `--quality` (`normal` ≈ 1K, `2k` ≈ 2K). To request 4K with `wan2.7-image-pro` text-to-image, pass `--size` explicitly (e.g. `4096*4096`, `3840*2160`)39- `--quality` is a baoyu-image-gen preset, not an official DashScope field. The mapping of `normal`/`2k` onto the `qwen-image-2.0*` and `wan2.7-image*` tables is an implementation choice, not an API guarantee4041### Recommended `qwen-image-2.0*` sizes4243| Ratio | `normal` | `2k` |44|-------|----------|------|45| `1:1` | `1024*1024` | `1536*1536` |46| `2:3` | `768*1152` | `1024*1536` |47| `3:2` | `1152*768` | `1536*1024` |48| `3:4` | `960*1280` | `1080*1440` |49| `4:3` | `1280*960` | `1440*1080` |50| `9:16` | `720*1280` | `1080*1920` |51| `16:9` | `1280*720` | `1920*1080` |52| `21:9` | `1344*576` | `2048*872` |5354## Reference Images5556- Only `wan2.7-image-pro` and `wan2.7-image` accept `--ref`. Other DashScope models (qwen-image-2.0*, qwen-image-max/plus/image, legacy) reject `--ref` and the user is steered to a different provider/model.57- Up to 9 reference images per request. Local files are inlined as base64 data URLs; `http(s)://` URLs are forwarded as-is.58- Supplying any `--ref` automatically clamps the wan2.7-image-pro pixel ceiling from 4K to 2K (the API only supports 4K for pure text-to-image with no image input).5960## Not Exposed6162DashScope APIs also support `negative_prompt`, `prompt_extend`, `watermark`, `thinking_mode`, `seed`, `bbox_list`, `enable_sequential`, and `color_palette`. `baoyu-image-gen` does not expose them as CLI flags today; the wan2.7 family relies on the API defaults (e.g. `thinking_mode=true`). The skill always sends `n=1` for wan2.7 — if you want grid/collage mode you currently need to call the API directly.6364## Official References6566- [Qwen-Image API](https://help.aliyun.com/zh/model-studio/qwen-image-api)67- [Text-to-image guide](https://help.aliyun.com/zh/model-studio/text-to-image)68- [Qwen-Image Edit API](https://help.aliyun.com/zh/model-studio/qwen-image-edit-api)69- [Wan 2.7 image generation & editing API](https://help.aliyun.com/zh/model-studio/wan-image-generation-and-editing-api-reference)70