Troubleshooting Quota Errors
Table of Contents: Common Quota Errors · Detailed Error Resolution · Request Quota Increase Process · Diagnostic Commands · External Resources
Common Quota Errors
| Error | Cause | Quick Fix |
|---|---|---|
QuotaExceeded | Regional quota consumed (TPM or PTU) | Delete unused deployments or request increase |
InsufficientQuota | Not enough available for requested capacity | Reduce deployment capacity or free quota |
DeploymentLimitReached | Too many deployment slots used | Delete unused deployments to free slots |
429 Rate Limit | TPM capacity too low for traffic (Standard only) | Increase TPM capacity or migrate to PTU |
PTU capacity unavailable | No PTU quota in region | Request PTU quota or try different region |
SKU not supported | PTU not available for model/region | Check model availability or use Standard TPM |
Detailed Error Resolution
QuotaExceeded Error
All available TPM or PTU quota consumed in the region.
Resolution:
- Check current quota usage:
subId=$(az account show --query id -o tsv)
region="eastus"
az rest --method get \
--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/$region/usages?api-version=2023-05-01" \
--query "value[?contains(name.value,'OpenAI')].{Model:name.value, Used:currentValue, Limit:limit}" -o table- Choose resolution:
- Option A: Delete unused deployments to free quota
- Option B: Reduce requested deployment capacity
- Option C: Deploy to different region with available quota
- Option D: Request quota increase through Azure Portal
InsufficientQuota Error
Available quota less than requested capacity.
Resolution:
- Check available quota:
# Calculate available: limit - currentValue
subId=$(az account show --query id -o tsv)
region="eastus"
az rest --method get \
--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/$region/usages?api-version=2023-05-01" \
--query "value[?name.value=='OpenAI.Standard.gpt-4o'].{Model:name.value, Used:currentValue, Limit:limit, Available:(limit-currentValue)}" -o table- Options:
- Reduce deployment capacity to fit available quota
- Delete existing deployments to free capacity
- Try different region with more available quota
- Request quota increase
DeploymentLimitReached Error
Resource reached maximum deployment slot limit (10-20 slots).
Resolution:
- List existing deployments:
az cognitiveservices account deployment list \
--name <resource-name> \
--resource-group <rg> \
--query '[].{Name:name, Model:properties.model.name, Capacity:sku.capacity}' \
--output table- Delete unused deployments:
az cognitiveservices account deployment delete \
--name <resource-name> \
--resource-group <rg> \
--deployment-name <unused-deployment-name>- Verify slot freed:
az cognitiveservices account deployment list \
--name <resource-name> \
--resource-group <rg> \
--query 'length([])'429 Rate Limit Errors
TPM capacity insufficient for traffic volume (Standard TPM only).
Resolution:
- Check deployment capacity:
az cognitiveservices account deployment show \
--name <resource-name> \
--resource-group <rg> \
--deployment-name <deployment-name> \
--query '{Name:name, Model:properties.model.name, Capacity:sku.capacity, SKU:sku.name}'- Options:
- Option A: Increase TPM capacity on existing deployment
az cognitiveservices account deployment update \
--name <resource-name> \
--resource-group <rg> \
--deployment-name <deployment-name> \
--sku-capacity <higher-capacity>- Option B: Migrate to PTU for guaranteed throughput (no rate limits)
- Option C: Implement retry logic with exponential backoff in application
PTU Capacity Unavailable Error
No PTU quota allocated in region, or PTU not available for model/region.
Resolution:
- Check PTU quota:
subId=$(az account show --query id -o tsv)
region="eastus"
az rest --method get \
--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/$region/usages?api-version=2023-05-01" \
--query "value[?contains(name.value,'ProvisionedManaged')].{Model:name.value, Used:currentValue, Limit:limit}" -o table- Options:
- Request PTU quota increase through Azure Portal (include capacity calculator results)
- Try different region where PTU is available
- Use Standard TPM instead
SKU Not Supported Error
PTU not available for specific model or region combination.
Resolution:
- Check model availability:
- Options:
- Deploy with Standard TPM SKU instead
- Choose different region where PTU is supported
- Use alternative model that supports PTU in your region
Request Quota Increase Process
For Standard TPM Quota
- Navigate to Azure Portal → Your Foundry resource → Quotas
- Identify model needing increase (e.g., "GPT-4o Standard")
- Click Request quota increase
- Fill form:
- Model name
- Requested quota (in TPM)
- Business justification (required)
- Submit and monitor status
Processing Time: Typically 1-2 business days
For PTU Quota
- Navigate to Azure Portal → Your Foundry resource → Quotas
- Select Provisioned throughput unit tab
- Identify model needing PTU increase
- Click Request quota increase
- Fill form:
- Model name
- Requested PTU quota
- Include capacity calculator results
- Detailed business justification (workload characteristics)
- Submit and monitor status
Processing Time: Typically 3-5 business days (requires stronger justification)
Diagnostic Commands
# Check deployment status
az cognitiveservices account deployment show \
--name <resource-name> \
--resource-group <rg> \
--deployment-name <deployment-name>
# Verify available quota
subId=$(az account show --query id -o tsv)
az rest --method get \
--url "https://management.azure.com/subscriptions/$subId/providers/Microsoft.CognitiveServices/locations/eastus/usages?api-version=2023-05-01" \
--query "value[?contains(name.value,'OpenAI')].{Model:name.value, Used:currentValue, Limit:limit, Available:(limit-currentValue)}" \
--output table
# List all deployments
az cognitiveservices account deployment list \
--name <resource-name> \
--resource-group <rg> \
--query '[].{Name:name, Model:properties.model.name, Capacity:sku.capacity, SKU:sku.name}' \
--output table