Loading source
Pulling the file list, source metadata, and syntax-aware rendering for this listing.
Source from repo
Generate images via OpenAI, Google, OpenRouter, DashScope, Jimeng, Seedream, and Replicate APIs with batch support.
Files
Skill
Size
Entrypoint
Format
Open file
Syntax-highlighted preview of this file as included in the skill package.
references/config/first-time-setup.md
1---2name: first-time-setup3description: First-time setup and default model selection flow for baoyu-image-gen4---56# First-Time Setup78## Overview910Triggered when:111. No EXTEND.md found → full setup (provider + model + preferences)122. EXTEND.md found but `default_model.[provider]` is null → model selection only1314## Setup Flow1516```17No EXTEND.md found EXTEND.md found, model null18│ │19▼ ▼20┌─────────────────────┐ ┌──────────────────────┐21│ AskUserQuestion │ │ AskUserQuestion │22│ (full setup) │ │ (model only) │23└─────────────────────┘ └──────────────────────┘24│ │25▼ ▼26┌─────────────────────┐ ┌──────────────────────┐27│ Create EXTEND.md │ │ Update EXTEND.md │28└─────────────────────┘ └──────────────────────┘29│ │30▼ ▼31Continue Continue32```3334## Flow 1: No EXTEND.md (Full Setup)3536**Language**: Use user's input language or saved language preference.3738Use AskUserQuestion with ALL questions in ONE call:3940### Question 1: Default Provider4142```yaml43header: "Provider"44question: "Default image generation provider?"45options:46- label: "Google (Recommended)"47description: "Gemini multimodal - high quality, reference images, flexible sizes"48- label: "OpenAI"49description: "GPT Image - consistent quality, reliable output"50- label: "Azure OpenAI"51description: "Azure-hosted GPT Image deployments with resource-specific routing"52- label: "OpenRouter"53description: "Router for Gemini/FLUX/OpenAI-compatible image models"54- label: "DashScope"55description: "Alibaba Cloud - Qwen-Image, strong Chinese/English text rendering"56- label: "MiniMax"57description: "MiniMax image generation with subject-reference character workflows"58- label: "Replicate"59description: "Community models - nano-banana-pro, flexible model selection"60- label: "Z.AI"61description: "GLM-Image - text-to-image with recommended aspect sizes"62```6364### Question 2: Default Google Model6566Only show if user selected Google or auto-detect (no explicit provider).6768```yaml69header: "Google Model"70question: "Default Google image generation model?"71options:72- label: "gemini-3-pro-image-preview (Recommended)"73description: "Highest quality, best for production use"74- label: "gemini-3.1-flash-image-preview"75description: "Fast generation, good quality, lower cost"76- label: "gemini-3-flash-preview"77description: "Fast generation, balanced quality and speed"78```7980### Question 2b: Default OpenRouter Model8182Only show if user selected OpenRouter.8384```yaml85header: "OpenRouter Model"86question: "Default OpenRouter image generation model?"87options:88- label: "google/gemini-3.1-flash-image-preview (Recommended)"89description: "Best general-purpose OpenRouter image model with reference-image workflows"90- label: "google/gemini-2.5-flash-image-preview"91description: "Fast Gemini preview model on OpenRouter"92- label: "black-forest-labs/flux.2-pro"93description: "Strong text-to-image quality through OpenRouter"94```9596### Question 2c: Default Azure Deployment9798Only show if user selected Azure OpenAI.99100```yaml101header: "Azure Deploy"102question: "Default Azure image deployment name?"103options:104- label: "gpt-image-1.5 (Recommended)"105description: "Best default if your Azure deployment uses the same name"106- label: "gpt-image-1"107description: "Previous GPT Image deployment name"108```109110### Question 2d: Default MiniMax Model111112Only show if user selected MiniMax.113114```yaml115header: "MiniMax Model"116question: "Default MiniMax image generation model?"117options:118- label: "image-01 (Recommended)"119description: "Best default, supports aspect ratios and custom width/height"120- label: "image-01-live"121description: "Faster variant, use aspect ratio instead of custom size"122```123124### Question 2e: Default Z.AI Model125126Only show if user selected Z.AI.127128```yaml129header: "Z.AI Model"130question: "Default Z.AI image generation model?"131options:132- label: "glm-image (Recommended)"133description: "Latest GLM-Image, best aspect-ratio coverage and text rendering"134- label: "cogview-4-250304"135description: "Legacy CogView-4 model with 16-pixel size stepping"136- label: "cogview-4"137description: "Previous CogView-4 snapshot for compatibility"138```139140### Question 3: Default Quality141142```yaml143header: "Quality"144question: "Default image quality?"145options:146- label: "2k (Recommended)"147description: "2048px - covers, illustrations, infographics"148- label: "normal"149description: "1024px - quick previews, drafts"150```151152### Question 4: Save Location153154```yaml155header: "Save"156question: "Where to save preferences?"157options:158- label: "Project (Recommended)"159description: ".baoyu-skills/ (this project only)"160- label: "User"161description: "~/.baoyu-skills/ (all projects)"162```163164### Save Locations165166| Choice | Path | Scope |167|--------|------|-------|168| Project | `.baoyu-skills/baoyu-image-gen/EXTEND.md` | Current project |169| User | `$HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md` | All projects |170171### EXTEND.md Template172173```yaml174---175version: 1176default_provider: [selected provider or null]177default_quality: [selected quality]178default_aspect_ratio: null179default_image_size: null180default_model:181google: [selected google model or null]182openai: null183azure: [selected azure deployment or null]184openrouter: [selected openrouter model or null]185dashscope: null186minimax: [selected minimax model or null]187replicate: null188zai: [selected zai model or null]189---190```191192## Flow 2: EXTEND.md Exists, Model Null193194When EXTEND.md exists but `default_model.[current_provider]` is null, ask ONLY the model question for the current provider.195196### Google Model Selection197198```yaml199header: "Google Model"200question: "Choose a default Google image generation model?"201options:202- label: "gemini-3-pro-image-preview (Recommended)"203description: "Highest quality, best for production use"204- label: "gemini-3.1-flash-image-preview"205description: "Fast generation, good quality, lower cost"206- label: "gemini-3-flash-preview"207description: "Fast generation, balanced quality and speed"208```209210### OpenAI Model Selection211212```yaml213header: "OpenAI Model"214question: "Choose a default OpenAI image generation model?"215options:216- label: "gpt-image-1.5 (Recommended)"217description: "Latest GPT Image model, high quality"218- label: "gpt-image-1"219description: "Previous generation GPT Image model"220```221222### Azure Deployment Selection223224```yaml225header: "Azure Deploy"226question: "Choose a default Azure image deployment name?"227options:228- label: "gpt-image-1.5 (Recommended)"229description: "Use when your Azure deployment name matches the GPT-image-1.5 model"230- label: "gpt-image-1"231description: "Use when your Azure deployment name matches GPT-image-1"232```233234Notes for Azure setup:235236- In `baoyu-image-gen`, Azure `--model` / `default_model.azure` should be the Azure deployment name, not just the underlying model family.237- If the deployment name is custom, save that exact deployment name in `default_model.azure`.238239### OpenRouter Model Selection240241```yaml242header: "OpenRouter Model"243question: "Choose a default OpenRouter image generation model?"244options:245- label: "google/gemini-3.1-flash-image-preview (Recommended)"246description: "Recommended for image output and reference-image edits"247- label: "google/gemini-2.5-flash-image-preview"248description: "Fast preview-oriented image generation"249- label: "black-forest-labs/flux.2-pro"250description: "High-quality text-to-image through OpenRouter"251```252253### DashScope Model Selection254255```yaml256header: "DashScope Model"257question: "Choose a default DashScope image generation model?"258options:259- label: "qwen-image-2.0-pro (Recommended)"260description: "Best DashScope model for text rendering and custom sizes"261- label: "qwen-image-2.0"262description: "Faster 2.0 variant with flexible output size"263- label: "qwen-image-max"264description: "Legacy Qwen model with five fixed output sizes"265- label: "qwen-image-plus"266description: "Legacy Qwen model, same current capability as qwen-image"267- label: "z-image-turbo"268description: "Legacy DashScope model for compatibility"269- label: "z-image-ultra"270description: "Legacy DashScope model, higher quality but slower"271```272273Notes for DashScope setup:274275- Prefer `qwen-image-2.0-pro` when the user needs custom `--size`, uncommon ratios like `21:9`, or strong Chinese/English text rendering.276- `qwen-image-max` / `qwen-image-plus` / `qwen-image` only support five fixed sizes: `1664*928`, `1472*1104`, `1328*1328`, `1104*1472`, `928*1664`.277- In `baoyu-image-gen`, `quality` is a compatibility preset. It is not a native DashScope parameter.278279### Replicate Model Selection280281```yaml282header: "Replicate Model"283question: "Choose a default Replicate image generation model?"284options:285- label: "google/nano-banana-pro (Recommended)"286description: "Google's fast image model on Replicate"287- label: "google/nano-banana"288description: "Google's base image model on Replicate"289```290291### MiniMax Model Selection292293```yaml294header: "MiniMax Model"295question: "Choose a default MiniMax image generation model?"296options:297- label: "image-01 (Recommended)"298description: "Best general-purpose MiniMax image model with custom width/height support"299- label: "image-01-live"300description: "Lower-latency MiniMax image model using aspect ratios"301```302303Notes for MiniMax setup:304305- `image-01` is the safest default. It supports official `aspect_ratio` values and documented custom `width` / `height` output sizes.306- `image-01-live` is useful when the user prefers faster generation and can work with aspect-ratio-based sizing.307- MiniMax subject reference currently uses `subject_reference[].type = character`; docs recommend front-facing portrait references in JPG/JPEG/PNG under 10MB.308309### Z.AI Model Selection310311```yaml312header: "Z.AI Model"313question: "Choose a default Z.AI image generation model?"314options:315- label: "glm-image (Recommended)"316description: "Latest GLM-Image; pixels round to multiples of 32 and cap at 2^22"317- label: "cogview-4-250304"318description: "Legacy CogView-4 snapshot with 16-pixel size stepping"319- label: "cogview-4"320description: "Earlier CogView-4 snapshot for compatibility"321```322323Notes for Z.AI setup:324325- Set `ZAI_API_KEY` (or legacy `BIGMODEL_API_KEY`) from https://docs.z.ai/.326- `glm-image` supports recommended aspect sizes (1280x1280, 1728x960, 1568x1056, …); uncommon ratios auto-fit to the 2^22 pixel budget on multiples of 32.327- Legacy CogView models use 16-pixel stepping and cap at 2^21 pixels per image.328- Z.AI does not accept reference images or `n > 1` in `baoyu-image-gen`; use Google/OpenAI providers for those workflows.329330### Update EXTEND.md331332After user selects a model:3333341. Read existing EXTEND.md3352. If `default_model:` section exists → update the provider-specific key3363. If `default_model:` section missing → add the full section:337338```yaml339default_model:340google: [value or null]341openai: [value or null]342azure: [value or null]343openrouter: [value or null]344dashscope: [value or null]345minimax: [value or null]346replicate: [value or null]347zai: [value or null]348```349350Only set the selected provider's model; leave others as their current value or null.351352## After Setup3533541. Create directory if needed3552. Write/update EXTEND.md with frontmatter3563. Confirm: "Preferences saved to [path]"3574. Continue with image generation358