2,121 questions with Azure AI Speech tags

Sort by: Updated
1 answer

Analyzer deploys successfully but silently ignores key schema fields - Azure AI Content Understanding (API version 2025-05-01-preview) in the West US region

An issue with Azure AI Content Understanding (API version 2025-05-01-preview) in the West US region. Deploying a custom analyzer intended for verbatim transcription, including filler word detection, speaker diarization, and word-level timestamps. While…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-23T20:14:06.34+00:00
Dennis 10 Reputation points
edited a comment 2025-08-29T17:39:07.2266667+00:00
Dennis 10 Reputation points
1 answer

Not able to access the custom neural voice trained model through API

Hi, Im pretty new to the world of Azure, and I have been trying to understand the capabilities of Custom neural voice. After multiple trial and error, I was able to train a model and even deploy it, but I'm fully stuck in trying to generate an audio…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-27T10:20:27.3033333+00:00
Vivek P N 0 Reputation points
edited an answer 2025-08-29T17:19:12.93+00:00
Amira Bedhiafi 36,716 Reputation points Volunteer Moderator
0 answers

Error creating Speech Service resource with Azure for Students subscription (RequestDisallowedByAzure)

Hello, I’m using an Azure for Students subscription and I cannot create a Speech Service (Cognitive Services) resource. Details: Resource name: speech-lab-alice Regions attempted: East US, West Europe, West US 2, North Europe Error…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-29T12:02:59.3766667+00:00
Alice Candido Martins 0 Reputation points
1 answer

When will Azure AI Service Voice Live API transition from Public Preview to General Availability?

I am closely evaluating the Voice Live API for a Service. Currently, it is available in public preview as of July 31, 2025. To plan for production rollout, I would highly appreciate clarification on the roadmap: Is there an expected timeline or…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-29T08:16:53.01+00:00
Benjamin Meyer 0 Reputation points
commented 2025-08-29T10:23:13.9933333+00:00
Manas Mohanty 9,655 Reputation points Microsoft External Staff Moderator
1 answer

Urgent please: Custom Avatar Sample Code: How to set temperature?

Hi folks Have a look at https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/samples/js/browser/avatar There's a place where we can call the OpenAI endpoint with a prompt. We'd like to also send a temperature for this. Where do we…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-26T17:48:43.0233333+00:00
It is VMS 170 Reputation points
edited an answer 2025-08-29T07:23:58.5233333+00:00
Manas Mohanty 9,655 Reputation points Microsoft External Staff Moderator
0 answers

Azure Fast Transcription API: InvalidAudioStream Error for M4A Files Over 10MB

Azure Fast Transcription API: InvalidAudioStream Error for M4A Files Over 10MB Issue Summary Azure Speech Service Fast Transcription API returns “InvalidAudioStream” error for M4A audio files larger than ~10MB, while smaller M4A files from the same…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-21T13:15:32.0633333+00:00
Ali Uthuman 0 Reputation points
commented 2025-08-29T00:25:02.43+00:00
Ravada Shivaprasad 1,115 Reputation points Microsoft External Staff Moderator
1 answer

UK South AI Speech endpoint does not return expected Latin script for KMR language code. Works fine in US.

Submit transcription request in US region endpoint for kmr language code (Kurdish, Latin script), you get Kurdish with Latin Script Submit transcription request in UK South Region endpoint for kmr language code (Kurdish, Latin Script) you get Kurdish…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-19T16:57:25.1133333+00:00
Justin Zimmer 0 Reputation points
commented 2025-08-28T17:20:48.83+00:00
Ravada Shivaprasad 1,115 Reputation points Microsoft External Staff Moderator
0 answers

What specific outbound and inbound ports does a client have to have open for Azure Speech to Text in Canada Central?

I have customer that's been using my application which sends audio to my Azure Speech to Text service. Everything has been working without any issues for quite some time when we began to see the following error in the logs: Line 118: 2025-08-25…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-26T14:22:56.87+00:00
Billy Mandilaras 0 Reputation points
commented 2025-08-28T13:06:53.59+00:00
Billy Mandilaras 0 Reputation points
0 answers

Does live chat avatar synthesis support WordBoundary events?

I was trying to set up the WordBoundary event callback for my live chat avatar synthesis but the callback is never run (the avatar speaks in the front-end but I get no events). That brings me to the question - are these events even supported for live…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-07T13:03:07.6866667+00:00
Mindaugas Giedraitis 0 Reputation points
edited a comment 2025-08-28T07:35:28.5166667+00:00
Mindaugas Giedraitis 0 Reputation points
0 answers

Issue with digits recognition in speech to text tool

Hi, I have an issue with number recognition in streaming API. The problem is that some of the following digits of the same value in number are skipped or multiplied. Here is the example: "81118" recognized "811118." This is…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-26T09:10:19.53+00:00
Artur Bondek 0 Reputation points
commented 2025-08-28T05:12:03.59+00:00
Manas Mohanty 9,655 Reputation points Microsoft External Staff Moderator
1 answer

Missing Azure TTS voices with Genesys Cloud CX Connector

The Genesys Cloud CX contact center platform has a Microsoft Azure TTS connector available in their AppFoundry which we are using – the connector itself is provided by Microsoft and is called “Microsoft Azure Cognitive Services Text To Speech”. We have…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-07-11T15:33:17.8766667+00:00
Rich Bartolucci 0 Reputation points
answered 2025-08-26T19:04:05.0933333+00:00
Amira Bedhiafi 36,716 Reputation points Volunteer Moderator
1 answer

Speech SDK – Inconsistent Speech-to-Text Recognition Accuracy

We are currently facing issues with Azure Speech Service using the Speech SDK for voice-to-text recognition. The service is not providing consistent and accurate transcriptions. Issues observed: Some words are being completely missed during…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-26T12:53:11.0533333+00:00
Anandu K B 0 Reputation points
answered 2025-08-26T15:39:21.7033333+00:00
Sina Salam 24,096 Reputation points Volunteer Moderator
0 answers

Failed to upload data cnv_training_package_root_flat_TAB.zip. Error: Status: 400. We cannot pair your audio files with the transcripts. Make sure in the transcript file you have included the name of your audios correctly. Try again in a few moments.

Hello. I have been trying for two hours to try and upload some wav files and the .txt files to accompany them. the descriptions match exactly. I have tried as many variations as I can think of. It is very frustrating having had the voice talent do the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-26T15:06:58.62+00:00
Craig Lowther 0 Reputation points
1 answer

Azure Voice LIve API sound issues

I am using Azure voice Live Api Fot Realtime communication. using Reactjs and Fastapi. The application is working fine but the sound quality is not good. there is scratching vioce and some noise when there is respons. How to solve this voice…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-26T03:04:14.37+00:00
Intikhab Hussain 0 Reputation points
edited an answer 2025-08-26T06:38:57.49+00:00
Gowtham CP 6,030 Reputation points Volunteer Moderator
1 answer

Azure Real-Time diarization

Hi! I am working on a project in Python, in which I use Azure AI Speech Service. More specifically, I implemented real-time dairization using the azure.cognitiveservices.speech.transcription.ConversationTranscriber class. And now I am working on speaker…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-07-11T10:34:16.3133333+00:00
Karyna Khinevich 0 Reputation points
answered 2025-08-25T18:59:29.0133333+00:00
Amira Bedhiafi 36,716 Reputation points Volunteer Moderator
1 answer

Speech services - audio content creation - lexicons

HI All, I was working on text to speech (audio content creation). I created a text file, apply my customized lexicon (that used to work), but when I was previewing the file, meaning listening to it before file export, lexicon wasnt applied at all. Rules…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-13T09:24:04.4433333+00:00
Gruber, Lukáš 0 Reputation points
answered 2025-08-25T18:14:36.4566667+00:00
Amira Bedhiafi 36,716 Reputation points Volunteer Moderator
1 answer

Azure en-US neural voices reads large numbers incorrectly

We are having a problem with the way Azure TTS reads out numbers for United States English neural voices. The Azure TTS engine reads 32,768 as "thirty-two thousand seven hundred and sixty-eight" for all English locales, it seems. This is the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-21T18:17:54.5133333+00:00
Jason Horner 0 Reputation points
answered 2025-08-25T16:10:10.78+00:00
Amira Bedhiafi 36,716 Reputation points Volunteer Moderator
1 answer

Training speech translation

I am currently working with the Azure Speech Translation service. I understand that it is possible to upload data to improve the performance of the written translation services, and I would like to confirm whether a similar capability exists for speech…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-19T16:03:10.57+00:00
AART AART 0 Reputation points
answered 2025-08-25T11:04:53.86+00:00
Amira Bedhiafi 36,716 Reputation points Volunteer Moderator
1 answer

Latency Issue In Speech To Text Realtime API

We are using Azure Speech-to-Text (STT) streaming API from the Central India region and experiencing consistent latency of 1.5 to 2 seconds from audio input to transcription result. Our setup: SDK: JavaScript/Node SDK using SpeechRecognizer with…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-07T12:47:06.6733333+00:00
Nidoos Solutions 20 Reputation points
answered 2025-08-22T09:04:16.3+00:00
Aryan Parashar 530 Reputation points Microsoft External Staff Moderator
1 answer

DirectLine Speech service on Android emulators gets "Connection failed (no connection to the remote host). Internal error: 1. Error details: Failed with error: WS_OPEN_ERROR_UNDERLYING_IO_OPEN_FAILED". No errors on Android physical devices.

About 3 or 4 weeks ago, my MAUI chatbot client, when running on Android emulators, calling DirectLine speech service, started getting Connector_canceled WS_OPEN_ERROR_UNDERLYING_IO_OPEN_FAILED errors. This bug reproduces using Microsoft's "Sample:…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,121 questions
asked 2025-08-17T17:51:06.9433333+00:00
Bruce Haley 95 Reputation points
answered 2025-08-22T08:41:01.47+00:00
Amira Bedhiafi 36,716 Reputation points Volunteer Moderator