Loading source
Pulling the file list, source metadata, and syntax-aware rendering for this listing.
Source from repo
Deploy, evaluate, and manage AI agents end-to-end on Microsoft Azure AI Foundry
Files
Skill
Size
Entrypoint
Format
Open file
Syntax-highlighted preview of this file as included in the skill package.
models/deploy-model/TEST_PROMPTS.md
1# Deploy Model — Test Prompts23Test prompts for the unified `deploy-model` skill with router, preset, customize, and capacity sub-skills.45## Preset Mode (Quick Deploy)67| # | Prompt | Expected |8|---|--------|----------|9| 1 | Deploy gpt-4o | Preset — confirm project, deploy with defaults |10| 2 | Set up o3-mini for me | Preset — pick latest version automatically |11| 3 | I need a text-embedding-ada-002 deployment | Preset — non-chat model |12| 4 | Deploy gpt-4o to the best region | Preset — region scan, no capacity target |1314## Customize Mode (Guided Flow)1516| # | Prompt | Expected |17|---|--------|----------|18| 5 | Deploy gpt-4o with custom settings | Customize — walk through version → SKU → capacity → RAI |19| 6 | I want to choose the version and SKU for my o3-mini deployment | Customize — explicit keywords |20| 7 | Set up a PTU deployment for gpt-4o | Customize — PTU requires SKU selection |21| 8 | Deploy gpt-4o with a specific content filter | Customize — RAI policy flow |2223## Capacity Discovery2425| # | Prompt | Expected |26|---|--------|----------|27| 9 | Where can I deploy gpt-4o? | Capacity — show regions, no deploy |28| 10 | Which regions have o3-mini available? | Capacity — run script, show table |29| 11 | Check if I have enough quota for gpt-4o with 500K TPM | Capacity — high target, some regions may not qualify |3031## Chained (Capacity → Deploy)3233| # | Prompt | Expected |34|---|--------|----------|35| 12 | Find me the best region and project to deploy gpt-4o with 10K capacity | Capacity → Preset |36| 13 | Deploy o3-mini with 200K TPM to whatever region has it | Capacity → Preset |37| 14 | I want to deploy gpt-4o with 50K capacity and choose my own settings | Capacity → Customize |3839## Negative / Edge Cases4041| # | Prompt | Expected |42|---|--------|----------|43| 15 | Deploy unicorn-model-9000 | Fail gracefully — model doesn't exist |44| 16 | Deploy gpt-4o with 999999K TPM | Capacity shows no region qualifies |45| 17 | Deploy gpt-4o (with az login expired) | Auth error caught early |46| 18 | Delete my gpt-4o deployment | Should NOT trigger deploy-model |47| 19 | List my current deployments | Should NOT trigger deploy-model |48| 20 | Deploy gpt-4o to mars-region-1 | Fail gracefully — invalid region |4950## Project Selection5152| # | Prompt | Expected |53|---|--------|----------|54| 21 | Deploy gpt-4o (with PROJECT_RESOURCE_ID set) | Show current project, confirm before deploying |55| 22 | Deploy gpt-4o (no PROJECT_RESOURCE_ID) | Ask user to pick a project |56| 23 | Deploy gpt-4o to project my-special-project | Use named project directly |5758## Ambiguous / Routing Stress5960| # | Prompt | Expected |61|---|--------|----------|62| 24 | Help me with model deployment | Preset (default) — vague, no keywords |63| 25 | I need gpt-4o deployed fast with good capacity | Preset — "fast" + vague capacity |64| 26 | Can you configure a deployment? | Customize — "configure" keyword, should ask which model |65| 27 | What's the best way to deploy gpt-4o with 100K? | Capacity → Preset |6667## Automated Test Results (2026-02-09)6869All 18 tests passed. Deployments created during testing were cleaned up.7071| Category | Tests | Result |72|----------|-------|--------|73| Preset | 3/3 | ✅ |74| Customize | 2/2 | ✅ |75| Capacity | 3/3 | ✅ |76| Chained | 1/1 | ✅ |77| Negative | 5/5 | ✅ |78| Ambiguous | 4/4 | ✅ |79