An Azure service that integrates speech processing into apps and services.
Hello Trinanjan Majumder,
Welcome to Microsoft Q&A and Thank you for reaching out.
I understand that your Custom Avatar training has been stuck in “Processing – 1 hour left” for 8+ hours, that’s understandably frustrating. Below is a consolidated troubleshooting guide combining expected behavior, common causes, and recommended actions.
1. Understand Expected Training Time
Custom Avatar training (under Azure AI Speech) is compute-intensive.
Typical timelines:
- Small/standard datasets → 2–6 hours
Larger datasets → can take 20–40 compute hours
ETA may fluctuate during processing
However, if The ETA remains fixed (e.g., “1 hour left”) for many hours
There is no visible stage progression
That may indicate backend queuing or job orchestration delay.
Please refer this How to create a custom video avatar
2. Validate Service Configuration
Please confirm The resource is a Speech resource (not OpenAI)
- SKU is Standard (S0) Custom Avatar training does NOT support Free (F0)
- Region (West US 2) supports Avatar training
3. Check Training Data Quality
Training can stall if the uploaded videos have issues such as:
- Motion blur
Poor or inconsistent lighting
Face partially obstructed
Inconsistent framing
Audio noise or overlapping voices
Even if initial validation passed, borderline-quality datasets can slow processing significantly.
4. Region Capacity or Backend Queue
Occasionally, a region (e.g., West US 2) may experience:
Training queue congestion
Backend capacity throttling
Orchestration delays
Recommended steps:
Check Azure Service Health for West US 2
If feasible (for testing), create a Speech resource in:
- East US
- West Europe And attempt training there
If it completes normally in another region, this suggests regional capacity constraints.
5. Check Azure Portal Logs & Quota
Go to Azure Portal → Speech Resource → Activity Log
Look for Failed operations, Quota errors, Throttling messages
Also verify:
- You have not hit concurrent training limits
- Subscription quota is sufficient
6. Cancel and Retry
If cancellation is available in Speech Studio:
Cancel the training job
Wait ~15–30 minutes
- Restart training
If it immediately returns to the same stuck state, that indicates backend pipeline delay rather than dataset issues.
7. Force Delete
If Training exceeds 24–48 hours, No stage change, No error surfaced
You can delete the model via REST API if the UI does not allow it:
DELETE https://<region>.api.cognitive.microsoft.com/speechtotext/v3.2/models/<model-id>
You will need:
Region name
Model ID from Speech Studio
Proper authentication
After deletion, recreate and retrain.
Please refer this Train model guide
I Hope this helps. Do let me know if you have any further queries.
Thank you!