Share via

Custom Avatar Model Training Showing as Processing After 16 Hours

Trinanjan Majumder 0 Reputation points Microsoft Employee
2026-02-25T02:16:06.2066667+00:00

I created a Azure AI Service Resource in West US 2(Test Avatar) and then went to Speech Studio, uploaded all the required training Data and then started the model training. But its showing 1hr left estimated for last 8 Hours.

Azure AI Speech
Azure AI Speech

An Azure service that integrates speech processing into apps and services.

{count} votes

1 answer

Sort by: Most helpful
  1. SRILAKSHMI C 15,030 Reputation points Microsoft External Staff Moderator
    2026-02-25T03:38:09.5266667+00:00

    Hello Trinanjan Majumder,

    Welcome to Microsoft Q&A and Thank you for reaching out.

    I understand that your Custom Avatar training has been stuck in “Processing – 1 hour left” for 8+ hours, that’s understandably frustrating. Below is a consolidated troubleshooting guide combining expected behavior, common causes, and recommended actions.

    1. Understand Expected Training Time

    Custom Avatar training (under Azure AI Speech) is compute-intensive.

    Typical timelines:

    • Small/standard datasets → 2–6 hours

    Larger datasets → can take 20–40 compute hours

    ETA may fluctuate during processing

    However, if The ETA remains fixed (e.g., “1 hour left”) for many hours

    There is no visible stage progression

    That may indicate backend queuing or job orchestration delay.

    Please refer this How to create a custom video avatar

    2. Validate Service Configuration

    Please confirm The resource is a Speech resource (not OpenAI)

    • SKU is Standard (S0) Custom Avatar training does NOT support Free (F0)
    • Region (West US 2) supports Avatar training

    3. Check Training Data Quality

    Training can stall if the uploaded videos have issues such as:

    • Motion blur

    Poor or inconsistent lighting

    Face partially obstructed

    Inconsistent framing

    Audio noise or overlapping voices

    Even if initial validation passed, borderline-quality datasets can slow processing significantly.

    4. Region Capacity or Backend Queue

    Occasionally, a region (e.g., West US 2) may experience:

    Training queue congestion

    Backend capacity throttling

    Orchestration delays

    Recommended steps:

    Check Azure Service Health for West US 2

    If feasible (for testing), create a Speech resource in:

    • East US
    • West Europe And attempt training there

    If it completes normally in another region, this suggests regional capacity constraints.

    5. Check Azure Portal Logs & Quota

    Go to Azure Portal → Speech Resource → Activity Log

    Look for Failed operations, Quota errors, Throttling messages

    Also verify:

    • You have not hit concurrent training limits
    • Subscription quota is sufficient

    6. Cancel and Retry

    If cancellation is available in Speech Studio:

    Cancel the training job

    Wait ~15–30 minutes

    1. Restart training

    If it immediately returns to the same stuck state, that indicates backend pipeline delay rather than dataset issues.

    7. Force Delete

    If Training exceeds 24–48 hours, No stage change, No error surfaced

    You can delete the model via REST API if the UI does not allow it:

    DELETE https://<region>.api.cognitive.microsoft.com/speechtotext/v3.2/models/<model-id>
    

    You will need:

    Region name

    Model ID from Speech Studio

    Proper authentication

    After deletion, recreate and retrain.

    Please refer this Train model guide

    I Hope this helps. Do let me know if you have any further queries.

    Thank you!

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.