Loading source
Pulling the file list, source metadata, and syntax-aware rendering for this listing.
Source from repo
Deploy, evaluate, and manage AI agents end-to-end on Microsoft Azure AI Foundry
Files
Skill
Size
Entrypoint
Format
Open file
Syntax-highlighted preview of this file as included in the skill package.
quota/references/troubleshooting.md
1# Troubleshooting Quota Errors23**Table of Contents:** [Common Quota Errors](#common-quota-errors) · [Detailed Error Resolution](#detailed-error-resolution) · [Request Quota Increase Process](#request-quota-increase-process) · [Diagnostic Commands](#diagnostic-commands) · [External Resources](#external-resources)45## Common Quota Errors67| Error | Cause | Quick Fix |8|-------|-------|-----------|9| `QuotaExceeded` | Regional quota consumed (TPM or PTU) | Delete unused deployments or request increase |10| `InsufficientQuota` | Not enough available for requested capacity | Reduce deployment capacity or free quota |11| `DeploymentLimitReached` | Too many deployment slots used | Delete unused deployments to free slots |12| `429 Rate Limit` | TPM capacity too low for traffic (Standard only) | Increase TPM capacity or migrate to PTU |13| `PTU capacity unavailable` | No PTU quota in region | Request PTU quota or try different region |14| `SKU not supported` | PTU not available for model/region | Check model availability or use Standard TPM |1516## Detailed Error Resolution1718### QuotaExceeded Error1920All available TPM or PTU quota consumed in the region.2122**Resolution:**23241. **Check current quota usage:**25```bash26subId=$(az account show --query id -o tsv)27region="eastus"28az rest --method get \29--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/$region/usages?api-version=2023-05-01" \30--query "value[?contains(name.value,'OpenAI')].{Model:name.value, Used:currentValue, Limit:limit}" -o table31```32332. **Choose resolution:**34- **Option A**: Delete unused deployments to free quota35- **Option B**: Reduce requested deployment capacity36- **Option C**: Deploy to different region with available quota37- **Option D**: Request quota increase through Azure Portal3839### InsufficientQuota Error4041Available quota less than requested capacity.4243**Resolution:**44451. **Check available quota:**46```bash47# Calculate available: limit - currentValue48subId=$(az account show --query id -o tsv)49region="eastus"50az rest --method get \51--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/$region/usages?api-version=2023-05-01" \52--query "value[?name.value=='OpenAI.Standard.gpt-4o'].{Model:name.value, Used:currentValue, Limit:limit, Available:(limit-currentValue)}" -o table53```54552. **Options:**56- Reduce deployment capacity to fit available quota57- Delete existing deployments to free capacity58- Try different region with more available quota59- Request quota increase6061### DeploymentLimitReached Error6263Resource reached maximum deployment slot limit (10-20 slots).6465**Resolution:**66671. **List existing deployments:**68```bash69az cognitiveservices account deployment list \70--name <resource-name> \71--resource-group <rg> \72--query '[].{Name:name, Model:properties.model.name, Capacity:sku.capacity}' \73--output table74```75762. **Delete unused deployments:**77```bash78az cognitiveservices account deployment delete \79--name <resource-name> \80--resource-group <rg> \81--deployment-name <unused-deployment-name>82```83843. **Verify slot freed:**85```bash86az cognitiveservices account deployment list \87--name <resource-name> \88--resource-group <rg> \89--query 'length([])'90```9192### 429 Rate Limit Errors9394TPM capacity insufficient for traffic volume (Standard TPM only).9596**Resolution:**97981. **Check deployment capacity:**99```bash100az cognitiveservices account deployment show \101--name <resource-name> \102--resource-group <rg> \103--deployment-name <deployment-name> \104--query '{Name:name, Model:properties.model.name, Capacity:sku.capacity, SKU:sku.name}'105```1061072. **Options:**108- **Option A**: Increase TPM capacity on existing deployment109```bash110az cognitiveservices account deployment update \111--name <resource-name> \112--resource-group <rg> \113--deployment-name <deployment-name> \114--sku-capacity <higher-capacity>115```116- **Option B**: Migrate to PTU for guaranteed throughput (no rate limits)117- **Option C**: Implement retry logic with exponential backoff in application118119### PTU Capacity Unavailable Error120121No PTU quota allocated in region, or PTU not available for model/region.122123**Resolution:**1241251. **Check PTU quota:**126```bash127subId=$(az account show --query id -o tsv)128region="eastus"129az rest --method get \130--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/$region/usages?api-version=2023-05-01" \131--query "value[?contains(name.value,'ProvisionedManaged')].{Model:name.value, Used:currentValue, Limit:limit}" -o table132```1331342. **Options:**135- Request PTU quota increase through Azure Portal (include capacity calculator results)136- Try different region where PTU is available137- Use Standard TPM instead138139### SKU Not Supported Error140141PTU not available for specific model or region combination.142143**Resolution:**1441451. **Check model availability:**146- Review [PTU model availability by region](https://learn.microsoft.com/azure/ai-services/openai/concepts/models#provisioned-deployment-model-availability)1471482. **Options:**149- Deploy with Standard TPM SKU instead150- Choose different region where PTU is supported151- Use alternative model that supports PTU in your region152153## Request Quota Increase Process154155### For Standard TPM Quota1561571. Navigate to Azure Portal → Your Foundry resource → **Quotas**1582. Identify model needing increase (e.g., "GPT-4o Standard")1593. Click **Request quota increase**1604. Fill form:161- Model name162- Requested quota (in TPM)163- Business justification (required)1645. Submit and monitor status165166**Processing Time:** Typically 1-2 business days167168### For PTU Quota1691701. Navigate to Azure Portal → Your Foundry resource → **Quotas**1712. Select **Provisioned throughput unit** tab1723. Identify model needing PTU increase1734. Click **Request quota increase**1745. Fill form:175- Model name176- Requested PTU quota177- Include capacity calculator results178- Detailed business justification (workload characteristics)1796. Submit and monitor status180181**Processing Time:** Typically 3-5 business days (requires stronger justification)182183## Diagnostic Commands184185```bash186# Check deployment status187az cognitiveservices account deployment show \188--name <resource-name> \189--resource-group <rg> \190--deployment-name <deployment-name>191192# Verify available quota193subId=$(az account show --query id -o tsv)194az rest --method get \195--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/eastus/usages?api-version=2023-05-01" \196--query "value[?contains(name.value,'OpenAI')].{Model:name.value, Used:currentValue, Limit:limit, Available:(limit-currentValue)}" \197--output table198199# List all deployments200az cognitiveservices account deployment list \201--name <resource-name> \202--resource-group <rg> \203--query '[].{Name:name, Model:properties.model.name, Capacity:sku.capacity, SKU:sku.name}' \204--output table205```206207## External Resources208209- [Quota Management Documentation](https://learn.microsoft.com/azure/ai-services/openai/how-to/quota)210- [Rate Limits Documentation](https://learn.microsoft.com/azure/ai-services/openai/quotas-limits)211- [Troubleshooting Guide](https://learn.microsoft.com/azure/ai-services/openai/troubleshooting)212