Hello Anandu K B,
Welcome to the Microsoft Q&A and thank you for posting your questions here.
I understand that you are having inconsistent Speech-to-Text Recognition Accuracy in Speech SDK for your AI Speech.
Since the problem is intermittent, it confirms that while the service itself is functional, your specific usage pattern or input data is causing the unpredictability.
Therefore, you must first eliminate client-side variables by testing with identical audio files to isolate service-side issues. Then, enforce consistency by ensuring your audio input always meets optimal format requirements and implement detailed logging to capture confidence scores for each recognition result. The most critical step is to quantify the error rate objectively using Speech Studio's accuracy testing.
For better resolution:
- Validate audio format compliance: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-use-audio-input-streams
- Implement detailed logging with word-level timestamps:
config.SetServiceProperty("wordLevelTimestamps", "true", ServicePropertyChannel.UriQueryParameter);
config.OutputFormat = OutputFormat.Detailed;
- Quantify errors with Speech Studio testing: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-custom-speech-evaluate-data
- For persistent issues, contact support with specific audio samples and timestamps from failed sessions correlated with Azure status history - https://status.azure.com/status via your Azure Portal.
I hope this is helpful! Do not hesitate to let me know if you have any other questions or clarifications.
Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful.