Loading source
Pulling the file list, source metadata, and syntax-aware rendering for this listing.
Source from repo
Build and deploy AI applications on Azure AI Foundry using Microsoft's model catalog and AI services
Files
Skill
Size
Entrypoint
Format
Open file
Syntax-highlighted preview of this file as included in the skill package.
models/deploy-model/TEST_PROMPTS.md
1# Deploy Model — Test Prompts23Test prompts for the unified `deploy-model` skill with router, preset, customize, and capacity sub-skills.45## Preset Mode (Quick Deploy)67| # | Prompt | Expected |8|---|--------|----------|9| 1 | Deploy gpt-4o | Preset — confirm project, deploy with defaults |10| 2 | Set up o3-mini for me | Preset — pick latest version automatically |11| 3 | I need a text-embedding-ada-002 deployment | Preset — non-chat model |12| 4 | Deploy gpt-4o to the best region | Preset — region scan, no capacity target |1314## Customize Mode (Guided Flow)1516| # | Prompt | Expected |17|---|--------|----------|18| 5 | Deploy gpt-4o with custom settings | Customize — walk through version → SKU → capacity → RAI |19| 6 | I want to choose the version and SKU for my o3-mini deployment | Customize — explicit keywords |20| 7 | Set up a PTU deployment for gpt-4o | Customize — PTU requires SKU selection |21| 8 | Deploy gpt-4o with a specific content filter | Customize — RAI policy flow |2223## Capacity Discovery2425| # | Prompt | Expected |26|---|--------|----------|27| 9 | Where can I deploy gpt-4o? | Capacity — show regions, no deploy |28| 10 | Which regions have o3-mini available? | Capacity — run script, show table |29| 11 | Check if I have enough quota for gpt-4o with 500K TPM | Capacity — high target, some regions may not qualify |3031## Chained (Capacity → Deploy)3233| # | Prompt | Expected |34|---|--------|----------|35| 12 | Find me the best region and project to deploy gpt-4o with 10K capacity | Capacity → Preset |36| 13 | Deploy o3-mini with 200K TPM to whatever region has it | Capacity → Preset |37| 14 | I want to deploy gpt-4o with 50K capacity and choose my own settings | Capacity → Customize |3839## Negative / Edge Cases4041| # | Prompt | Expected |42|---|--------|----------|43| 15 | Deploy unicorn-model-9000 | Fail gracefully — model doesn't exist |44| 16 | Deploy gpt-4o with 999999K TPM | Capacity shows no region qualifies |45| 17 | Deploy gpt-4o (with az login expired) | Auth error caught early |46| 18 | Delete my gpt-4o deployment | Should NOT trigger deploy-model |47| 19 | List my current deployments | Should NOT trigger deploy-model |48| 20 | Deploy gpt-4o to mars-region-1 | Fail gracefully — invalid region |4950## Project Selection5152| # | Prompt | Expected |53|---|--------|----------|54| 21 | Deploy gpt-4o (with PROJECT_RESOURCE_ID set) | Show current project, confirm before deploying |55| 22 | Deploy gpt-4o (no PROJECT_RESOURCE_ID) | Ask user to pick a project |56| 23 | Deploy gpt-4o to project my-special-project | Use named project directly |5758## Ambiguous / Routing Stress5960| # | Prompt | Expected |61|---|--------|----------|62| 24 | Help me with model deployment | Preset (default) — vague, no keywords |63| 25 | I need gpt-4o deployed fast with good capacity | Preset — "fast" + vague capacity |64| 26 | Can you configure a deployment? | Customize — "configure" keyword, should ask which model |65| 27 | What's the best way to deploy gpt-4o with 100K? | Capacity → Preset |6667## Automated Test Results (2026-02-09)6869All 18 tests passed. Deployments created during testing were cleaned up.7071| Category | Tests | Result |72|----------|-------|--------|73| Preset | 3/3 | ✅ |74| Customize | 2/2 | ✅ |75| Capacity | 3/3 | ✅ |76| Chained | 1/1 | ✅ |77| Negative | 5/5 | ✅ |78| Ambiguous | 4/4 | ✅ |79