An Azure service that integrates speech processing into apps and services.
CRITICAL ISSUE Azure AI Speech SDK – Numbers getting Added , Deleted and Substituted and sometimes Exceeds too much time while using the microsoft realtime speech to text conginitve services API
We are using Azure Speech Service with the browser Speech SDK for real-time speech-to-text transcription. We are observing an issue when users speak continuous digits. The recognizer sometimes returns a significantly different number of digits than were…
Azure AI Speech
Speech Studio Text to Speech - Silence Not Working
I'm trying to add silence in my text to speech files (see image below), but the silence tags will not actually generate the specified silence where I input them when I preview the audio. The silence doesn't show up in the exported output either. Am I…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Cohere Rerank v4.0 Fast returns 500 error in Azure AI Foundry when using DefaultAzureCredential
Hi, I am experiencing persistent 500 Internal Server Error issues when using Cohere Rerank v4.0 Fast in Azure AI Foundry. I have carefully followed the guidance provided in similar threads: Verified the exact deployment name:…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Deploy Azure AI Speech with CognitiveResource or Microsoft Foundry
Someone any information on if we should deploy AI Speech via microsoft.cognitiveservices/accounts or new via MicrosoftFoundry? Will the microsoft.cognitiveservices/accounts be deprecated? The old speech studio at least doesn't seem to support…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Special character ampersand (“&”) breaks word boundaries in Azure Text-to-Speech
Hello, I’m encountering an issue with word boundary events in Azure Text-to-Speech when the input text contains the ampersand character (&). Context Locale: fr-FR Neural French voice (e.g. fr-FR-Remy:DragonHDLatestNeural) Batch synthesis API …
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Pronunciation Assessment with Language en-GB- Phoneme symbols
I am using the pronunciation assessment API for language en-GB Doing the assessment at phoneme level The documentation does mention this: AccuracyScore: Phoneme level, Syllable level (en-US only), Word level, Full Text level I get a response with empty…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Issue Creating Azure AI Language Resource in Custom Question Answering Lab
Hello, I am currently working on the Microsoft Applied Skills lab for Custom Question Answering. When attempting to create the Azure AI Language resource, the deployment fails with the following error: RequestDisallowedByPolicy – The resource was…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Please clarify the conflicting information regarding permission to use the free tier of Azure Speech for commercial purposes, such as narration of a YouTube video.
Hello everyone, I had previously asked a question on this forum regarding whether the Free Tier F0 of Azure Speech can be used for commercial purposes such as narration of a YouTube video:…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Can the audio generated by Azure Speech Studio's free tier (monthly limit of 500,000 characters) be used for commercial purposes like for example: narration of a youtube video?
Hello! I've searched this Q&A site extensively but found conflicting answers and hence, I thought I should I ask directly. I've Azure Speech Studio's free tier (monthly limit of 500,000 characters) and can I use the audio generated by that for…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Custom Neural Voice (CNV Pro) model in East US and East US 2 is failing to train the model
Custom Neural Voice (CNV Pro) model in East US and East US 2, and the training consistently fails after several hours with an internal/unknown error. The dataset uploads successfully and passes validation, but the training job never completes. It fails…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Azure AI Foundry agents intermittently failing with JSON parsing error (empty response, no schema changes)
We are experiencing intermittent but increasingly frequent failures when running agents on Azure AI Foundry. Agents that were working correctly in the same environment, with no code or schema changes, suddenly started failing with the following…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Python code to generate ephemeral token for gpt-4o-mini-transcribe OR gpt-4o-transcribe
Hi Team, We're unable to find ways/python code to generate ephemeral token for gpt-4o-mini-transcribe OR gpt-4o-transcribe. Searched online & there are some references for generating such tokens for realtime API. But none for…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
can some one help, how to config voicelive sdk to recieve animation blendshapes and viseme_id
it try to add this but no animation data recieve. modalities: ["text", "audio", 'animation'], outputAudioTimestampYypes: ["word"], animation: { modelName: "default", outputs:…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Has MS abandoned human tech support?
Reading some of the nightmare scenarios on these forums and realizing that human tech support is a thing of the past really alarms me. It's obvious that since companies such as MS are pouring so much into AI, they've abandoned tech support from humans.…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Issues with Azure Speech Services: Incorrect transcription of "draft" as "draught" and "£" as "lbs" in UK English
I'm using Azure Speech Services with the language set to UK English, and I've noticed two recurring transcription issues: When I dictate the word "draft", it consistently transcribes as "draught", even when the context clearly favors…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
High Initial Latency with Multi-Language Detection (3+ Languages)
Hello Azure Speech Team, We're experiencing significant initial latency when using Continuous Language Identification with 2+ languages in production. Configuration: Languages: 3 languages (en-IN, te-IN, hi-IN) Mode:…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
gpt-4o-transcribe for real-time speech-to-text transcription ---slow speed
When I try to use gpt-4o-transcribe for real-time speech-to-text transcription, it takes about 1.5-2 seconds for a 2s mp3 file from sending the request to receiving the first token. Are there improved methods or other model options? Additionally,…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Custom Avatar Model Training Showing as Processing After 16 Hours
I created a Azure AI Service Resource in West US 2(Test Avatar) and then went to Speech Studio, uploaded all the required training Data and then started the model training. But its showing 1hr left estimated for last 8 Hours.
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Transcription using gpt-4o-transcribe with gpt-realtime is failing in useast2
Hello, I am trying to use gpt-4o-transcribe with gpt-realtime in useast2, and it is consistently failing. I am using gpt-realtime with websockets as per the documentation. I am seeing the following event:…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
Pricing for Azure Voice Live API
We are evaluating Azure Voice live API for our Contact Center use case, automating with AI. However, we could not find the latest pricing of Azure Voice live API - we want to use Pro version - use Azure speech, GPT 5.2 Chat (or suitable chat models).…
Azure AI Speech
An Azure service that integrates speech processing into apps and services.