Share via

bot joining Teams meetings and receiving real-time audio- python

Nathan V 45 Reputation points
2026-03-04T13:15:42.3766667+00:00

We are building a Teams bot that acts as a notetaker in meetings. We need the bot to join meetings and receive the real-time audio stream so we can send it to a speech-to-text service and show live transcription/notes.

Our entire backend is in Python (Bot Framework, aiohttp, Graph API for post-meeting transcripts). We see that:

  • Joining calls and receiving real-time media (raw audio) is only documented for C# / .NET via the Graph Communications SDK (Microsoft.Graph.Communications.Calls.Media).
  • There does not appear to be a Python (or Node.js) API for the same real-time media/call-join flow.

Questions

  1. Is there an official or supported way for a Python bot to join a Teams meeting and receive the real-time audio stream (e.g. Python SDK, REST/WebSocket API, or recommended bridge)?
  2. If not, are there roadmap plans to provide Python support for the same capabilities (join meeting + receive live audio) that the C# Graph Communications SDK offers?
  3. Is there any alternative, supported approach (e.g. different API or product) for building a Python-only bot that joins meetings and gets real-time audio for transcription/notetaking?

Use case

  • Bot is invited to a Teams meeting.
  • Bot joins and receives the meeting audio in real time.
  • We process the audio in Python (e.g. with Azure Speech or another service) to generate live captions/notes.
  • We want to implement this without maintaining a separate C# media bot if possible.

Environment

  • Bot: Python, Bot Framework, Microsoft Graph API.
  • Goal: Python-only solution for joining meetings and receiving live audio, or clear guidance on the recommended approach.We are building a Teams bot that acts as a notetaker in meetings. We need the bot to join meetings and receive the real-time audio stream so we can send it to a speech-to-text service and show live transcription/notes. Our entire backend is in Python (Bot Framework, aiohttp, Graph API for post-meeting transcripts). We see that:
    • Joining calls and receiving real-time media (raw audio) is only documented for C# / .NET via the Graph Communications SDK (Microsoft.Graph.Communications.Calls.Media).
    • There does not appear to be a Python (or Node.js) API for the same real-time media/call-join flow.
    Questions
    1. Is there an official or supported way for a Python bot to join a Teams meeting and receive the real-time audio stream (e.g. Python SDK, REST/WebSocket API, or recommended bridge)?
    2. If not, are there roadmap plans to provide Python support for the same capabilities (join meeting + receive live audio) that the C# Graph Communications SDK offers?
    3. Is there any alternative, supported approach (e.g. different API or product) for building a Python-only bot that joins meetings and gets real-time audio for transcription/notetaking?
    Use case
    • Bot is invited to a Teams meeting.
    • Bot joins and receives the meeting audio in real time.
    • We process the audio in Python (e.g. with Azure Speech or another service) to generate live captions/notes.
    • We want to implement this without maintaining a separate C# media bot if possible.
    Environment
    • Bot: Python, Bot Framework, Microsoft Graph API.
    • Goal: Python-only solution for joining meetings and receiving live audio, or clear guidance on the recommended approach.
Microsoft Teams | Development
Microsoft Teams | Development

Building, integrating, or customizing apps and workflows within Microsoft Teams using developer tools and APIs

0 comments No comments
{count} votes

Answer accepted by question author
  1. Kudos-Ng 14,515 Reputation points Microsoft External Staff Moderator
    2026-03-04T14:28:09.4866667+00:00

    Hi Nathan V,

    Thank you for posting your question in the Microsoft Q&A forum.

    After reviewing the related documentations and verifying the information, I am very sorry to say that: currently, there is no Python SDK, REST, or WebSocket interface for joining Teams meetings and receiving real-time audio. This capability is exclusive to application-hosted media bots built on the Microsoft.Graph.Communications.Calls.Media .NET library, running on Windows. No alternative language support has been announced on the roadmap in the near future.

    For a Python-only notetaker, if raw real-time audio is not a hard requirement, Graph Meeting Transcripts is the recommended official path. If real-time is mandatory and you want to stay Python-only, the .NET media bot remains the only fully documented Microsoft-native option for raw audio.

    That said, there have been several users who also looked for other supported SDK languages to handle real‑time media processing. Because of this, you may consider raising feedback on the Feedback Portal so that Microsoft can review it and potentially include it in the near future.

    I hope the information above helpful.


    Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.