Loading source
Pulling the file list, source metadata, and syntax-aware rendering for this listing.
Source from repo
Build and deploy AI applications on Azure AI Foundry using Microsoft's model catalog and AI services
Files
Skill
Size
Entrypoint
Format
Open file
Syntax-highlighted preview of this file as included in the skill package.
quota/references/error-resolution.md
1# Error Resolution Workflows23**Table of Contents:** [Workflow 7: Quota Exhausted Recovery](#workflow-7-quota-exhausted-recovery) · [Workflow 8: Resolve 429 Rate Limit Errors](#workflow-8-resolve-429-rate-limit-errors) · [Workflow 9: Resolve DeploymentLimitReached](#workflow-9-resolve-deploymentlimitreached) · [Workflow 10: Resolve InsufficientQuota](#workflow-10-resolve-insufficientquota) · [Workflow 11: Resolve QuotaExceeded](#workflow-11-resolve-quotaexceeded)45## Workflow 7: Quota Exhausted Recovery67**A. Deploy to Different Region**8```bash9subId=$(az account show --query id -o tsv)10for region in eastus westus eastus2 westus2 swedencentral uksouth; do11az rest --method get --url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/$region/usages?api-version=2023-05-01" \12--query "value[?name.value=='OpenAI.Standard.gpt-4o'].{Model:name.value, Used:currentValue, Limit:limit, Available:(limit-currentValue)}" -o table &13done; wait14```1516**B. Delete Unused Deployments**17```bash18az cognitiveservices account deployment delete --name <resource> --resource-group <rg> --deployment-name <deployment>19```2021**C. Request Quota Increase (3-5 days)**2223**D. Migrate to PTU** - See capacity-planning.md2425---2627## Workflow 8: Resolve 429 Rate Limit Errors2829**Identify Deployment:**30```bash31az cognitiveservices account deployment list --name <resource> --resource-group <rg> \32--query "[].{Name:name,Model:properties.model.name,TPM:sku.capacity*1000}" -o table33```3435**Solutions:**3637**A. Increase Capacity**38```bash39az cognitiveservices account deployment update --name <resource> --resource-group <rg> --deployment-name <deployment> --sku-capacity 10040```4142**B. Add Retry Logic** - Exponential backoff in code4344**C. Load Balance**45```bash46az cognitiveservices account deployment create --name <resource> --resource-group <rg> --deployment-name gpt-4o-2 \47--model-name gpt-4o --model-version "2024-05-13" --model-format OpenAI --sku-name Standard --sku-capacity 10048```4950**D. Migrate to PTU** - No rate limits5152---5354## Workflow 9: Resolve DeploymentLimitReached5556**Root Cause:** 10-20 slots per resource.5758**Check Count:**59```bash60deployment_count=$(az cognitiveservices account deployment list --name <resource> --resource-group <rg> --query "length(@)")61echo "Deployments: $deployment_count / ~20 slots"62```6364**Find Test Deployments:**65```bash66az cognitiveservices account deployment list --name <resource> --resource-group <rg> \67--query "[?contains(name,'test') || contains(name,'demo')].{Name:name}" -o table68```6970**Delete:**71```bash72az cognitiveservices account deployment delete --name <resource> --resource-group <rg> --deployment-name <deployment>73```7475**Or Create New Resource (fresh 10-20 slots):**76```bash77az cognitiveservices account create --name "my-foundry-2" --resource-group <rg> --location eastus --kind AIServices --sku S0 --yes78```7980---8182## Workflow 10: Resolve InsufficientQuota8384**Root Cause:** Requested capacity exceeds available quota.8586**Check Quota:**87```bash88subId=$(az account show --query id -o tsv)89az rest --method get --url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/eastus/usages?api-version=2023-05-01" \90--query "value[?contains(name.value,'OpenAI')].{Model:name.value, Used:currentValue, Limit:limit, Available:(limit-currentValue)}" -o table91```9293**Solutions:**9495**A. Reduce Capacity**96```bash97az cognitiveservices account deployment create --name <resource> --resource-group <rg> --deployment-name gpt-4o \98--model-name gpt-4o --model-version "2024-05-13" --model-format OpenAI --sku-name Standard --sku-capacity 2099```100101**B. Delete Unused Deployments**102```bash103az cognitiveservices account deployment delete --name <resource> --resource-group <rg> --deployment-name <unused>104```105106**C. Different Region** - Check quota with multi-region script (Workflow 7)107108**D. Request Increase (3-5 days)**109110---111112## Workflow 11: Resolve QuotaExceeded113114**Root Cause:** Deployment exceeds regional quota.115116**Check Quota:**117```bash118subId=$(az account show --query id -o tsv)119az rest --method get --url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/eastus/usages?api-version=2023-05-01" \120--query "value[?contains(name.value,'OpenAI')]" -o table121```122123**Multi-Region Check:** (Use Workflow 7 script)124125**Solutions:**126127**A. Delete Unused Deployments**128```bash129az cognitiveservices account deployment delete --name <resource> --resource-group <rg> --deployment-name <unused>130```131132**B. Different Region**133```bash134az cognitiveservices account deployment create --name <resource> --resource-group <rg> --deployment-name gpt-4o \135--model-name gpt-4o --model-version "2024-05-13" --model-format OpenAI --sku-name Standard --sku-capacity 50136```137138**C. Request Increase (3-5 days)**139140**D. Reduce Capacity**141142**Decision:** Available < 10% → Different region; 10-50% → Delete/reduce; > 50% → Delete one deployment143144---145146