Source from repo

Image Generation (AI SDK)

Generate images via OpenAI, Google, OpenRouter, DashScope, Jimeng, Seedream, and Replicate APIs with batch support.

jimliuGitHub jimliuSource repo Original GitHub link Publisher page

Files

Skill

n/a

Size

352.3 KB

Entrypoint

SKILL.md

Format

git-repo

Open file

references/config/first-time-setup.md

Syntax-highlighted preview of this file as included in the skill package.

Rendered Source

markdown375 linesFree

references/config/first-time-setup.md

1---
2name: first-time-setup
3description: First-time setup and default model selection flow for baoyu-image-gen
4---
5 
6# First-Time Setup
7 
8## Overview
9 
10Triggered when:
111. No EXTEND.md found → full setup (provider + model + preferences)
122. EXTEND.md found but `default_model.[provider]` is null → model selection only
13 
14## Setup Flow
15 
16```
17No EXTEND.md found          EXTEND.md found, model null
18        │                            │
19        ▼                            ▼
20┌─────────────────────┐    ┌──────────────────────┐
21│ AskUserQuestion     │    │ AskUserQuestion      │
22│ (full setup)        │    │ (model only)         │
23└─────────────────────┘    └──────────────────────┘
24        │                            │
25        ▼                            ▼
26┌─────────────────────┐    ┌──────────────────────┐
27│ Create EXTEND.md    │    │ Update EXTEND.md     │
28└─────────────────────┘    └──────────────────────┘
29        │                            │
30        ▼                            ▼
31    Continue                     Continue
32```
33 
34## Flow 1: No EXTEND.md (Full Setup)
35 
36**Language**: Use user's input language or saved language preference.
37 
38Use AskUserQuestion with ALL questions in ONE call:
39 
40### Question 1: Default Provider
41 
42```yaml
43header: "Provider"
44question: "Default image generation provider?"
45options:
46  - label: "Google (Recommended)"
47    description: "Gemini multimodal - high quality, reference images, flexible sizes"
48  - label: "OpenAI"
49    description: "GPT Image 2 - latest OpenAI image model, reference-image workflows"
50  - label: "Azure OpenAI"
51    description: "Azure-hosted GPT Image deployments with resource-specific routing"
52  - label: "OpenRouter"
53    description: "Router for Gemini/FLUX/OpenAI-compatible image models"
54  - label: "DashScope"
55    description: "Alibaba Cloud - Qwen-Image, strong Chinese/English text rendering"
56  - label: "Z.AI"
57    description: "GLM-image, strong poster and text-heavy image generation"
58  - label: "MiniMax"
59    description: "MiniMax image generation with subject-reference character workflows"
60  - label: "Replicate"
61    description: "Curated Replicate image families - nano-banana-2, Seedream, and Wan image models"
62  - label: "Agnes"
63    description: "Sapiens AI Agnes - optimized for high information density, complex layouts, reference-image support"
64```
65 
66### Question 2: Default Google Model
67 
68Only show if user selected Google or auto-detect (no explicit provider).
69 
70```yaml
71header: "Google Model"
72question: "Default Google image generation model?"
73options:
74  - label: "gemini-3-pro-image (Recommended)"
75    description: "Highest quality, best for production use"
76  - label: "gemini-3.1-flash-image"
77    description: "Fast generation, good quality, lower cost"
78  - label: "gemini-3-flash-preview"
79    description: "Fast generation, balanced quality and speed"
80```
81 
82### Question 2b: Default OpenRouter Model
83 
84Only show if user selected OpenRouter.
85 
86```yaml
87header: "OpenRouter Model"
88question: "Default OpenRouter image generation model?"
89options:
90  - label: "google/gemini-3.1-flash-image (Recommended)"
91    description: "Best general-purpose OpenRouter image model with reference-image workflows"
92  - label: "google/gemini-2.5-flash-image-preview"
93    description: "Fast Gemini preview model on OpenRouter"
94  - label: "black-forest-labs/flux.2-pro"
95    description: "Strong text-to-image quality through OpenRouter"
96```
97 
98### Question 2c: Default Azure Deployment
99 
100Only show if user selected Azure OpenAI.
101 
102```yaml
103header: "Azure Deploy"
104question: "Default Azure image deployment name?"
105options:
106  - label: "gpt-image-2 (Recommended)"
107    description: "Use if your Azure deployment uses the GPT Image 2 model name"
108  - label: "gpt-image-1.5"
109    description: "Previous GPT Image deployment name"
110  - label: "gpt-image-1"
111    description: "Earlier GPT Image deployment name"
112```
113 
114### Question 2d: Default MiniMax Model
115 
116Only show if user selected MiniMax.
117 
118```yaml
119header: "MiniMax Model"
120question: "Default MiniMax image generation model?"
121options:
122  - label: "image-01 (Recommended)"
123    description: "Best default, supports aspect ratios and custom width/height"
124  - label: "image-01-live"
125    description: "Faster variant, use aspect ratio instead of custom size"
126```
127 
128### Question 2e: Default Z.AI Model
129 
130Only show if user selected Z.AI.
131 
132```yaml
133header: "Z.AI Model"
134question: "Default Z.AI image generation model?"
135options:
136  - label: "glm-image (Recommended)"
137    description: "Best default for posters, diagrams, and text-heavy images"
138  - label: "cogview-4-250304"
139    description: "Legacy Z.AI image model on the same endpoint"
140```
141 
142### Question 3: Default Quality
143 
144```yaml
145header: "Quality"
146question: "Default image quality?"
147options:
148  - label: "2k (Recommended)"
149    description: "2048px - covers, illustrations, infographics"
150  - label: "normal"
151    description: "1024px - quick previews, drafts"
152```
153 
154### Question 4: Save Location
155 
156```yaml
157header: "Save"
158question: "Where to save preferences?"
159options:
160  - label: "Project (Recommended)"
161    description: ".baoyu-skills/ (this project only)"
162  - label: "User"
163    description: "~/.baoyu-skills/ (all projects)"
164```
165 
166### Save Locations
167 
168| Choice | Path | Scope |
169|--------|------|-------|
170| Project | `.baoyu-skills/baoyu-image-gen/EXTEND.md` | Current project |
171| User | `$HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md` | All projects |
172 
173### EXTEND.md Template
174 
175```yaml
176---
177version: 1
178default_provider: [selected provider or null]
179default_quality: [selected quality]
180default_aspect_ratio: null
181default_image_size: null
182default_image_api_dialect: null
183default_model:
184  google: [selected google model or null]
185  openai: null
186  azure: [selected azure deployment or null]
187  openrouter: [selected openrouter model or null]
188  dashscope: null
189  zai: [selected Z.AI model or null]
190  minimax: [selected minimax model or null]
191  replicate: null
192  agnes: null
193---
194```
195 
196If the user selects `OpenAI` but says their endpoint is only OpenAI-compatible and fronts another image model family, save `default_image_api_dialect: ratio-metadata` when they explicitly confirm the gateway expects aspect-ratio `size` plus metadata-based resolution. Otherwise leave it `null` / `openai-native`.
197 
198## Flow 2: EXTEND.md Exists, Model Null
199 
200When EXTEND.md exists but `default_model.[current_provider]` is null, ask ONLY the model question for the current provider.
201 
202### Google Model Selection
203 
204```yaml
205header: "Google Model"
206question: "Choose a default Google image generation model?"
207options:
208  - label: "gemini-3-pro-image (Recommended)"
209    description: "Highest quality, best for production use"
210  - label: "gemini-3.1-flash-image"
211    description: "Fast generation, good quality, lower cost"
212  - label: "gemini-3-flash-preview"
213    description: "Fast generation, balanced quality and speed"
214```
215 
216### OpenAI Model Selection
217 
218```yaml
219header: "OpenAI Model"
220question: "Choose a default OpenAI image generation model?"
221options:
222  - label: "gpt-image-2 (Recommended)"
223    description: "Latest GPT Image model, flexible sizes up to 4K, high-fidelity image inputs"
224  - label: "gpt-image-1.5"
225    description: "Previous GPT Image model"
226  - label: "gpt-image-1"
227    description: "Earlier GPT Image model"
228```
229 
230### Azure Deployment Selection
231 
232```yaml
233header: "Azure Deploy"
234question: "Choose a default Azure image deployment name?"
235options:
236  - label: "gpt-image-2 (Recommended)"
237    description: "Use when your Azure deployment name matches the GPT Image 2 model"
238  - label: "gpt-image-1.5"
239    description: "Use when your Azure deployment name matches the GPT Image 1.5 model"
240  - label: "gpt-image-1"
241    description: "Use when your Azure deployment name matches GPT-image-1"
242```
243 
244Notes for Azure setup:
245 
246- In `baoyu-image-gen`, Azure `--model` / `default_model.azure` should be the Azure deployment name, not just the underlying model family.
247- If the deployment name is custom, save that exact deployment name in `default_model.azure`.
248 
249### OpenRouter Model Selection
250 
251```yaml
252header: "OpenRouter Model"
253question: "Choose a default OpenRouter image generation model?"
254options:
255  - label: "google/gemini-3.1-flash-image (Recommended)"
256    description: "Recommended for image output and reference-image edits"
257  - label: "google/gemini-2.5-flash-image-preview"
258    description: "Fast preview-oriented image generation"
259  - label: "black-forest-labs/flux.2-pro"
260    description: "High-quality text-to-image through OpenRouter"
261```
262 
263### DashScope Model Selection
264 
265```yaml
266header: "DashScope Model"
267question: "Choose a default DashScope image generation model?"
268options:
269  - label: "qwen-image-2.0-pro (Recommended)"
270    description: "Best DashScope model for text rendering and custom sizes"
271  - label: "qwen-image-2.0"
272    description: "Faster 2.0 variant with flexible output size"
273  - label: "qwen-image-max"
274    description: "Legacy Qwen model with five fixed output sizes"
275  - label: "qwen-image-plus"
276    description: "Legacy Qwen model, same current capability as qwen-image"
277  - label: "wan2.7-image-pro"
278    description: "Wan 2.7 Pro — supports up to 4K text-to-image and reference-image editing"
279  - label: "wan2.7-image"
280    description: "Wan 2.7 base — faster generation, up to 2K, supports reference-image editing"
281  - label: "z-image-turbo"
282    description: "Legacy DashScope model for compatibility"
283  - label: "z-image-ultra"
284    description: "Legacy DashScope model, higher quality but slower"
285```
286 
287Notes for DashScope setup:
288 
289- Prefer `qwen-image-2.0-pro` when the user needs custom `--size`, uncommon ratios like `21:9`, or strong Chinese/English text rendering.
290- `qwen-image-max` / `qwen-image-plus` / `qwen-image` only support five fixed sizes: `1664*928`, `1472*1104`, `1328*1328`, `1104*1472`, `928*1664`.
291- `wan2.7-image-pro` and `wan2.7-image` are the only DashScope models that accept `--ref`. Pick one of these when the user wants reference-image editing or multi-image fusion via DashScope.
292- In `baoyu-image-gen`, `quality` is a compatibility preset. It is not a native DashScope parameter.
293 
294### Z.AI Model Selection
295 
296```yaml
297header: "Z.AI Model"
298question: "Choose a default Z.AI image generation model?"
299options:
300  - label: "glm-image (Recommended)"
301    description: "Current flagship image model with better text rendering and poster layouts"
302  - label: "cogview-4-250304"
303    description: "Legacy model on the sync image endpoint"
304```
305 
306Notes for Z.AI setup:
307 
308- Prefer `glm-image` for posters, diagrams, and Chinese/English text-heavy layouts.
309- In `baoyu-image-gen`, Z.AI currently exposes text-to-image only; reference images are not wired for this provider.
310- The sync Z.AI image API returns a downloadable image URL, which the runtime saves locally after download.
311 
312### Replicate Model Selection
313 
314```yaml
315header: "Replicate Model"
316question: "Choose a default Replicate image generation model?"
317options:
318  - label: "google/nano-banana-2 (Recommended)"
319    description: "Current default for general Replicate image generation in baoyu-image-gen"
320  - label: "bytedance/seedream-4.5"
321    description: "Replicate Seedream 4.5 with validated local size/ref guardrails"
322  - label: "bytedance/seedream-5-lite"
323    description: "Replicate Seedream 5 Lite with validated local size/ref guardrails"
324  - label: "wan-video/wan-2.7-image-pro"
325    description: "Replicate Wan 2.7 Image Pro with 4K text-to-image support"
326```
327 
328### MiniMax Model Selection
329 
330```yaml
331header: "MiniMax Model"
332question: "Choose a default MiniMax image generation model?"
333options:
334  - label: "image-01 (Recommended)"
335    description: "Best general-purpose MiniMax image model with custom width/height support"
336  - label: "image-01-live"
337    description: "Lower-latency MiniMax image model using aspect ratios"
338```
339 
340Notes for MiniMax setup:
341 
342- `image-01` is the safest default. It supports official `aspect_ratio` values and documented custom `width` / `height` output sizes.
343- `image-01-live` is useful when the user prefers faster generation and can work with aspect-ratio-based sizing.
344- MiniMax subject reference currently uses `subject_reference[].type = character`; docs recommend front-facing portrait references in JPG/JPEG/PNG under 10MB.
345 
346### Update EXTEND.md
347 
348After user selects a model:
349 
3501. Read existing EXTEND.md
3512. If `default_model:` section exists → update the provider-specific key
3523. If `default_model:` section missing → add the full section:
353 
354```yaml
355default_model:
356  google: [value or null]
357  openai: [value or null]
358  azure: [value or null]
359  openrouter: [value or null]
360  dashscope: [value or null]
361  zai: [value or null]
362  minimax: [value or null]
363  replicate: [value or null]
364  agnes: [value or null]
365```
366 
367Only set the selected provider's model; leave others as their current value or null.
368 
369## After Setup
370 
3711. Create directory if needed
3722. Write/update EXTEND.md with frontmatter
3733. Confirm: "Preferences saved to [path]"
3744. Continue with image generation
375

Marketplace

Source from repo

Image Generation (AI SDK)

Generate images via OpenAI, Google, OpenRouter, DashScope, Jimeng, Seedream, and Replicate APIs with batch support.

jimliuGitHub jimliuSource repo Original GitHub link Publisher page

Files

Skill

n/a

Size

352.3 KB

Entrypoint

SKILL.md

Format

git-repo

Open file

references/config/first-time-setup.md

Syntax-highlighted preview of this file as included in the skill package.

Rendered Source

markdown375 linesFree

references/config/first-time-setup.md

1---
2name: first-time-setup
3description: First-time setup and default model selection flow for baoyu-image-gen
4---
5 
6# First-Time Setup
7 
8## Overview
9 
10Triggered when:
111. No EXTEND.md found → full setup (provider + model + preferences)
122. EXTEND.md found but `default_model.[provider]` is null → model selection only
13 
14## Setup Flow
15 
16```
17No EXTEND.md found          EXTEND.md found, model null
18        │                            │
19        ▼                            ▼
20┌─────────────────────┐    ┌──────────────────────┐
21│ AskUserQuestion     │    │ AskUserQuestion      │
22│ (full setup)        │    │ (model only)         │
23└─────────────────────┘    └──────────────────────┘
24        │                            │
25        ▼                            ▼
26┌─────────────────────┐    ┌──────────────────────┐
27│ Create EXTEND.md    │    │ Update EXTEND.md     │
28└─────────────────────┘    └──────────────────────┘
29        │                            │
30        ▼                            ▼
31    Continue                     Continue
32```
33 
34## Flow 1: No EXTEND.md (Full Setup)
35 
36**Language**: Use user's input language or saved language preference.
37 
38Use AskUserQuestion with ALL questions in ONE call:
39 
40### Question 1: Default Provider
41 
42```yaml
43header: "Provider"
44question: "Default image generation provider?"
45options:
46  - label: "Google (Recommended)"
47    description: "Gemini multimodal - high quality, reference images, flexible sizes"
48  - label: "OpenAI"
49    description: "GPT Image 2 - latest OpenAI image model, reference-image workflows"
50  - label: "Azure OpenAI"
51    description: "Azure-hosted GPT Image deployments with resource-specific routing"
52  - label: "OpenRouter"
53    description: "Router for Gemini/FLUX/OpenAI-compatible image models"
54  - label: "DashScope"
55    description: "Alibaba Cloud - Qwen-Image, strong Chinese/English text rendering"
56  - label: "Z.AI"
57    description: "GLM-image, strong poster and text-heavy image generation"
58  - label: "MiniMax"
59    description: "MiniMax image generation with subject-reference character workflows"
60  - label: "Replicate"
61    description: "Curated Replicate image families - nano-banana-2, Seedream, and Wan image models"
62  - label: "Agnes"
63    description: "Sapiens AI Agnes - optimized for high information density, complex layouts, reference-image support"
64```
65 
66### Question 2: Default Google Model
67 
68Only show if user selected Google or auto-detect (no explicit provider).
69 
70```yaml
71header: "Google Model"
72question: "Default Google image generation model?"
73options:
74  - label: "gemini-3-pro-image (Recommended)"
75    description: "Highest quality, best for production use"
76  - label: "gemini-3.1-flash-image"
77    description: "Fast generation, good quality, lower cost"
78  - label: "gemini-3-flash-preview"
79    description: "Fast generation, balanced quality and speed"
80```
81 
82### Question 2b: Default OpenRouter Model
83 
84Only show if user selected OpenRouter.
85 
86```yaml
87header: "OpenRouter Model"
88question: "Default OpenRouter image generation model?"
89options:
90  - label: "google/gemini-3.1-flash-image (Recommended)"
91    description: "Best general-purpose OpenRouter image model with reference-image workflows"
92  - label: "google/gemini-2.5-flash-image-preview"
93    description: "Fast Gemini preview model on OpenRouter"
94  - label: "black-forest-labs/flux.2-pro"
95    description: "Strong text-to-image quality through OpenRouter"
96```
97 
98### Question 2c: Default Azure Deployment
99 
100Only show if user selected Azure OpenAI.
101 
102```yaml
103header: "Azure Deploy"
104question: "Default Azure image deployment name?"
105options:
106  - label: "gpt-image-2 (Recommended)"
107    description: "Use if your Azure deployment uses the GPT Image 2 model name"
108  - label: "gpt-image-1.5"
109    description: "Previous GPT Image deployment name"
110  - label: "gpt-image-1"
111    description: "Earlier GPT Image deployment name"
112```
113 
114### Question 2d: Default MiniMax Model
115 
116Only show if user selected MiniMax.
117 
118```yaml
119header: "MiniMax Model"
120question: "Default MiniMax image generation model?"
121options:
122  - label: "image-01 (Recommended)"
123    description: "Best default, supports aspect ratios and custom width/height"
124  - label: "image-01-live"
125    description: "Faster variant, use aspect ratio instead of custom size"
126```
127 
128### Question 2e: Default Z.AI Model
129 
130Only show if user selected Z.AI.
131 
132```yaml
133header: "Z.AI Model"
134question: "Default Z.AI image generation model?"
135options:
136  - label: "glm-image (Recommended)"
137    description: "Best default for posters, diagrams, and text-heavy images"
138  - label: "cogview-4-250304"
139    description: "Legacy Z.AI image model on the same endpoint"
140```
141 
142### Question 3: Default Quality
143 
144```yaml
145header: "Quality"
146question: "Default image quality?"
147options:
148  - label: "2k (Recommended)"
149    description: "2048px - covers, illustrations, infographics"
150  - label: "normal"
151    description: "1024px - quick previews, drafts"
152```
153 
154### Question 4: Save Location
155 
156```yaml
157header: "Save"
158question: "Where to save preferences?"
159options:
160  - label: "Project (Recommended)"
161    description: ".baoyu-skills/ (this project only)"
162  - label: "User"
163    description: "~/.baoyu-skills/ (all projects)"
164```
165 
166### Save Locations
167 
168| Choice | Path | Scope |
169|--------|------|-------|
170| Project | `.baoyu-skills/baoyu-image-gen/EXTEND.md` | Current project |
171| User | `$HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md` | All projects |
172 
173### EXTEND.md Template
174 
175```yaml
176---
177version: 1
178default_provider: [selected provider or null]
179default_quality: [selected quality]
180default_aspect_ratio: null
181default_image_size: null
182default_image_api_dialect: null
183default_model:
184  google: [selected google model or null]
185  openai: null
186  azure: [selected azure deployment or null]
187  openrouter: [selected openrouter model or null]
188  dashscope: null
189  zai: [selected Z.AI model or null]
190  minimax: [selected minimax model or null]
191  replicate: null
192  agnes: null
193---
194```
195 
196If the user selects `OpenAI` but says their endpoint is only OpenAI-compatible and fronts another image model family, save `default_image_api_dialect: ratio-metadata` when they explicitly confirm the gateway expects aspect-ratio `size` plus metadata-based resolution. Otherwise leave it `null` / `openai-native`.
197 
198## Flow 2: EXTEND.md Exists, Model Null
199 
200When EXTEND.md exists but `default_model.[current_provider]` is null, ask ONLY the model question for the current provider.
201 
202### Google Model Selection
203 
204```yaml
205header: "Google Model"
206question: "Choose a default Google image generation model?"
207options:
208  - label: "gemini-3-pro-image (Recommended)"
209    description: "Highest quality, best for production use"
210  - label: "gemini-3.1-flash-image"
211    description: "Fast generation, good quality, lower cost"
212  - label: "gemini-3-flash-preview"
213    description: "Fast generation, balanced quality and speed"
214```
215 
216### OpenAI Model Selection
217 
218```yaml
219header: "OpenAI Model"
220question: "Choose a default OpenAI image generation model?"
221options:
222  - label: "gpt-image-2 (Recommended)"
223    description: "Latest GPT Image model, flexible sizes up to 4K, high-fidelity image inputs"
224  - label: "gpt-image-1.5"
225    description: "Previous GPT Image model"
226  - label: "gpt-image-1"
227    description: "Earlier GPT Image model"
228```
229 
230### Azure Deployment Selection
231 
232```yaml
233header: "Azure Deploy"
234question: "Choose a default Azure image deployment name?"
235options:
236  - label: "gpt-image-2 (Recommended)"
237    description: "Use when your Azure deployment name matches the GPT Image 2 model"
238  - label: "gpt-image-1.5"
239    description: "Use when your Azure deployment name matches the GPT Image 1.5 model"
240  - label: "gpt-image-1"
241    description: "Use when your Azure deployment name matches GPT-image-1"
242```
243 
244Notes for Azure setup:
245 
246- In `baoyu-image-gen`, Azure `--model` / `default_model.azure` should be the Azure deployment name, not just the underlying model family.
247- If the deployment name is custom, save that exact deployment name in `default_model.azure`.
248 
249### OpenRouter Model Selection
250 
251```yaml
252header: "OpenRouter Model"
253question: "Choose a default OpenRouter image generation model?"
254options:
255  - label: "google/gemini-3.1-flash-image (Recommended)"
256    description: "Recommended for image output and reference-image edits"
257  - label: "google/gemini-2.5-flash-image-preview"
258    description: "Fast preview-oriented image generation"
259  - label: "black-forest-labs/flux.2-pro"
260    description: "High-quality text-to-image through OpenRouter"
261```
262 
263### DashScope Model Selection
264 
265```yaml
266header: "DashScope Model"
267question: "Choose a default DashScope image generation model?"
268options:
269  - label: "qwen-image-2.0-pro (Recommended)"
270    description: "Best DashScope model for text rendering and custom sizes"
271  - label: "qwen-image-2.0"
272    description: "Faster 2.0 variant with flexible output size"
273  - label: "qwen-image-max"
274    description: "Legacy Qwen model with five fixed output sizes"
275  - label: "qwen-image-plus"
276    description: "Legacy Qwen model, same current capability as qwen-image"
277  - label: "wan2.7-image-pro"
278    description: "Wan 2.7 Pro — supports up to 4K text-to-image and reference-image editing"
279  - label: "wan2.7-image"
280    description: "Wan 2.7 base — faster generation, up to 2K, supports reference-image editing"
281  - label: "z-image-turbo"
282    description: "Legacy DashScope model for compatibility"
283  - label: "z-image-ultra"
284    description: "Legacy DashScope model, higher quality but slower"
285```
286 
287Notes for DashScope setup:
288 
289- Prefer `qwen-image-2.0-pro` when the user needs custom `--size`, uncommon ratios like `21:9`, or strong Chinese/English text rendering.
290- `qwen-image-max` / `qwen-image-plus` / `qwen-image` only support five fixed sizes: `1664*928`, `1472*1104`, `1328*1328`, `1104*1472`, `928*1664`.
291- `wan2.7-image-pro` and `wan2.7-image` are the only DashScope models that accept `--ref`. Pick one of these when the user wants reference-image editing or multi-image fusion via DashScope.
292- In `baoyu-image-gen`, `quality` is a compatibility preset. It is not a native DashScope parameter.
293 
294### Z.AI Model Selection
295 
296```yaml
297header: "Z.AI Model"
298question: "Choose a default Z.AI image generation model?"
299options:
300  - label: "glm-image (Recommended)"
301    description: "Current flagship image model with better text rendering and poster layouts"
302  - label: "cogview-4-250304"
303    description: "Legacy model on the sync image endpoint"
304```
305 
306Notes for Z.AI setup:
307 
308- Prefer `glm-image` for posters, diagrams, and Chinese/English text-heavy layouts.
309- In `baoyu-image-gen`, Z.AI currently exposes text-to-image only; reference images are not wired for this provider.
310- The sync Z.AI image API returns a downloadable image URL, which the runtime saves locally after download.
311 
312### Replicate Model Selection
313 
314```yaml
315header: "Replicate Model"
316question: "Choose a default Replicate image generation model?"
317options:
318  - label: "google/nano-banana-2 (Recommended)"
319    description: "Current default for general Replicate image generation in baoyu-image-gen"
320  - label: "bytedance/seedream-4.5"
321    description: "Replicate Seedream 4.5 with validated local size/ref guardrails"
322  - label: "bytedance/seedream-5-lite"
323    description: "Replicate Seedream 5 Lite with validated local size/ref guardrails"
324  - label: "wan-video/wan-2.7-image-pro"
325    description: "Replicate Wan 2.7 Image Pro with 4K text-to-image support"
326```
327 
328### MiniMax Model Selection
329 
330```yaml
331header: "MiniMax Model"
332question: "Choose a default MiniMax image generation model?"
333options:
334  - label: "image-01 (Recommended)"
335    description: "Best general-purpose MiniMax image model with custom width/height support"
336  - label: "image-01-live"
337    description: "Lower-latency MiniMax image model using aspect ratios"
338```
339 
340Notes for MiniMax setup:
341 
342- `image-01` is the safest default. It supports official `aspect_ratio` values and documented custom `width` / `height` output sizes.
343- `image-01-live` is useful when the user prefers faster generation and can work with aspect-ratio-based sizing.
344- MiniMax subject reference currently uses `subject_reference[].type = character`; docs recommend front-facing portrait references in JPG/JPEG/PNG under 10MB.
345 
346### Update EXTEND.md
347 
348After user selects a model:
349 
3501. Read existing EXTEND.md
3512. If `default_model:` section exists → update the provider-specific key
3523. If `default_model:` section missing → add the full section:
353 
354```yaml
355default_model:
356  google: [value or null]
357  openai: [value or null]
358  azure: [value or null]
359  openrouter: [value or null]
360  dashscope: [value or null]
361  zai: [value or null]
362  minimax: [value or null]
363  replicate: [value or null]
364  agnes: [value or null]
365```
366 
367Only set the selected provider's model; leave others as their current value or null.
368 
369## After Setup
370 
3711. Create directory if needed
3722. Write/update EXTEND.md with frontmatter
3733. Confirm: "Preferences saved to [path]"
3744. Continue with image generation
375

Image Generation (AI SDK)

references/config/first-time-setup.md

Preparing the source view

Image Generation (AI SDK)

references/config/first-time-setup.md