Real-time Audio in Meetings Using a Bot

Kiet Chau 20 Reputation points
2025-02-20T10:17:39.0033333+00:00

Using Bot, can I get audio in Real Time Meeting?

If yes, how can I get it. Thanks.

Azure AI Bot Service
Azure AI Bot Service
An Azure service that provides an integrated environment for bot development.
0 comments No comments
{count} votes

Accepted answer
  1. Divyesh Govaerdhanan 8,590 Reputation points
    2025-02-20T17:06:04.13+00:00

    Hello,

    Welcome to Microsoft Q&A,

    Step 1: Use Microsoft Graph API for Teams Call Integration

    Example: Using Microsoft Graph API to Join a Call

    var
    

    Step 2: Enable Real-Time Media Streaming

    Example: Handling Audio Streams in a Bot

    public
    

    With this you will get the Audio stream of the team meeting, If you would like to convert this to text for transcription, You could use the Azure Speech service.

    Step 3: Stream Audio to Azure Speech Service

    • Route the extracted media stream to Azure Speech Service for transcription.
    • Reference: Azure Speech-to-Text

    Example: Sending Audio Stream to Speech-to-Text API

    using
    

    Step 4: Use Direct Line Speech for Enhanced Integration

    • If required, consider using Direct Line Speech to improve communication between the bot and Azure Speech Service.
    • Reference: Direct Line Speech Integration

    Example: Enabling Direct Line Speech for a Bot

    {
        "type": "directlinespeech",
        "serviceEndpoint": "https://directline.botframework.com/v3/directline"
    }
    

    As part of the next steps, to get your bot fully functional, deploy it using Azure Bot Service (Deploy a Teams Bot), ensure you grant the necessary Graph API permissions for calls and meetings (Graph API Permissions), and optimize audio processing for real-time transcription using Azure Speech Streaming API (Azure Speech Streaming API).

    Please Upvote and accept the answer if it helps!

    0 comments No comments

2 additional answers

Sort by: Most helpful
  1. Mark Thomas 0 Reputation points
    2025-08-12T18:06:11.4466667+00:00

    You can get real-time audio from a Microsoft Teams meeting building your own bot, but it requires setting up the full Graph API + Teams Calling Bot flow.

    At a high level:

    1. Register a Teams calling bot in Azure and enable it for media streaming. ◦ Requires the Calls.AccessMedia.All Graph API permission for app-hosted media.
    2. Have the bot join the meeting as a participant via the Graph API.
      ◦ The bot receives the RTP audio packets in real time.
    3. Route the audio into your transcription or processing service. ◦ You can use Azure Cognitive Services Speech-to-Text, or you can stream it to another service.

    If you don’t want to handle the media pipeline, hosting, and permissions work yourself, there are APIs like Recall.ai’s Meeting Bot API that provide real-time audio, video, and transcripts from Teams (and other meeting platforms) through a single integration. That way you can skip the bot infrastructure and focus directly on what you want to build with the meeting data.

    Feel free to DM me if you have questions

    0 comments No comments

  2. Jeeva 0 Reputation points
    2025-08-18T11:43:00.0333333+00:00

    Can I capture live audio and video separately from a Microsoft Teams meeting in real time? If yes, are there any concepts or APIs that developers should be aware of, especially those that may be deprecated soon?

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.