Loading source
Pulling the file list, source metadata, and syntax-aware rendering for this listing.
Source from repo
Build and deploy AI applications on Azure AI Foundry using Microsoft's model catalog and AI services
Files
Skill
Size
Entrypoint
Format
Open file
Syntax-highlighted preview of this file as included in the skill package.
quota/references/troubleshooting.md
1# Troubleshooting Quota Errors23**Table of Contents:** [Common Quota Errors](#common-quota-errors) · [Detailed Error Resolution](#detailed-error-resolution) · [Request Quota Increase Process](#request-quota-increase-process) · [Diagnostic Commands](#diagnostic-commands) · [External Resources](#external-resources)45## Common Quota Errors67| Error | Cause | Quick Fix |8|-------|-------|-----------|9| `QuotaExceeded` | Regional quota consumed (TPM or PTU) | Delete unused deployments or request increase |10| `InsufficientQuota` | Not enough available for requested capacity | Reduce deployment capacity or free quota |11| `DeploymentLimitReached` | Too many deployment slots used | Delete unused deployments to free slots |12| `429 Rate Limit` | TPM capacity too low for traffic (Standard only) | Increase TPM capacity or migrate to PTU |13| `PTU capacity unavailable` | No PTU quota in region | Request PTU quota or try different region |14| `SKU not supported` | PTU not available for model/region | Check model availability or use Standard TPM |1516## Detailed Error Resolution1718### QuotaExceeded Error1920All available TPM or PTU quota consumed in the region.2122**Resolution:**23241. **Check current quota usage:**25```bash26subId=$(az account show --query id -o tsv)27region="eastus"28az rest --method get \29--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/$region/usages?api-version=2023-05-01" \30--query "value[?contains(name.value,'OpenAI')].{Model:name.value, Used:currentValue, Limit:limit}" -o table31```32332. **Choose resolution:**34- **Option A**: Delete unused deployments to free quota35- **Option B**: Reduce requested deployment capacity36- **Option C**: Deploy to different region with available quota37- **Option D**: Request quota increase through Azure Portal3839### InsufficientQuota Error4041Available quota less than requested capacity.4243**Resolution:**44451. **Check available quota:**46```bash47# Calculate available: limit - currentValue48subId=$(az account show --query id -o tsv)49region="eastus"50az rest --method get \51--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/$region/usages?api-version=2023-05-01" \52--query "value[?name.value=='OpenAI.Standard.gpt-4o'].{Model:name.value, Used:currentValue, Limit:limit, Available:(limit-currentValue)}" -o table53```54552. **Options:**56- Reduce deployment capacity to fit available quota57- Delete existing deployments to free capacity58- Try different region with more available quota59- Request quota increase6061### DeploymentLimitReached Error6263Resource reached maximum deployment slot limit (10-20 slots).6465**Resolution:**66671. **List existing deployments:**68```bash69az cognitiveservices account deployment list \70--name <resource-name> \71--resource-group <rg> \72--query '[].{Name:name, Model:properties.model.name, Capacity:sku.capacity}' \73--output table74```75762. **Delete unused deployments:**77```bash78az cognitiveservices account deployment delete \79--name <resource-name> \80--resource-group <rg> \81--deployment-name <unused-deployment-name>82```83843. **Verify slot freed:**85```bash86az cognitiveservices account deployment list \87--name <resource-name> \88--resource-group <rg> \89--query 'length([])'90```9192### 429 Rate Limit Errors9394TPM capacity insufficient for traffic volume (Standard TPM only).9596**Resolution:**97981. **Check deployment capacity:**99```bash100az cognitiveservices account deployment show \101--name <resource-name> \102--resource-group <rg> \103--deployment-name <deployment-name> \104--query '{Name:name, Model:properties.model.name, Capacity:sku.capacity, SKU:sku.name}'105```1061072. **Options:**108- **Option A**: Increase TPM capacity on existing deployment109```bash110az cognitiveservices account deployment update \111--name <resource-name> \112--resource-group <rg> \113--deployment-name <deployment-name> \114--sku-capacity <higher-capacity>115```116- **Option B**: Migrate to PTU for guaranteed throughput (no rate limits)117- **Option C**: Implement retry logic with exponential backoff in application118119### PTU Capacity Unavailable Error120121No PTU quota allocated in region, or PTU not available for model/region.122123**Resolution:**1241251. **Check PTU quota:**126```bash127subId=$(az account show --query id -o tsv)128region="eastus"129az rest --method get \130--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/$region/usages?api-version=2023-05-01" \131--query "value[?contains(name.value,'ProvisionedManaged')].{Model:name.value, Used:currentValue, Limit:limit}" -o table132```1331342. **Options:**135- Request PTU quota increase through Azure Portal (include capacity calculator results)136- Try different region where PTU is available137- Use Standard TPM instead138139### SKU Not Supported Error140141PTU not available for specific model or region combination.142143**Resolution:**1441451. **Check model availability:**146- Review [PTU model availability by region](https://learn.microsoft.com/azure/ai-services/openai/concepts/models#provisioned-deployment-model-availability)1471482. **Options:**149- Deploy with Standard TPM SKU instead150- Choose different region where PTU is supported151- Use alternative model that supports PTU in your region152153## Request Quota Increase Process154155### For Standard TPM Quota1561571. Navigate to Azure Portal → Your Foundry resource → **Quotas**1582. Identify model needing increase (e.g., "GPT-4o Standard")1593. Click **Request quota increase**1604. Fill form:161- Model name162- Requested quota (in TPM)163- Business justification (required)1645. Submit and monitor status165166**Processing Time:** Typically 1-2 business days167168### For PTU Quota1691701. Navigate to Azure Portal → Your Foundry resource → **Quotas**1712. Select **Provisioned throughput unit** tab1723. Identify model needing PTU increase1734. Click **Request quota increase**1745. Fill form:175- Model name176- Requested PTU quota177- Include capacity calculator results178- Detailed business justification (workload characteristics)1796. Submit and monitor status180181**Processing Time:** Typically 3-5 business days (requires stronger justification)182183## Diagnostic Commands184185```bash186# Check deployment status187az cognitiveservices account deployment show \188--name <resource-name> \189--resource-group <rg> \190--deployment-name <deployment-name>191192# Verify available quota193subId=$(az account show --query id -o tsv)194az rest --method get \195--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/eastus/usages?api-version=2023-05-01" \196--query "value[?contains(name.value,'OpenAI')].{Model:name.value, Used:currentValue, Limit:limit, Available:(limit-currentValue)}" \197--output table198199# List all deployments200az cognitiveservices account deployment list \201--name <resource-name> \202--resource-group <rg> \203--query '[].{Name:name, Model:properties.model.name, Capacity:sku.capacity, SKU:sku.name}' \204--output table205```206207## External Resources208209- [Quota Management Documentation](https://learn.microsoft.com/azure/ai-services/openai/how-to/quota)210- [Rate Limits Documentation](https://learn.microsoft.com/azure/ai-services/openai/quotas-limits)211- [Troubleshooting Guide](https://learn.microsoft.com/azure/ai-services/openai/troubleshooting)212