Here, I am utilizing the getDisplayMedia and getUserMedia APIs to capture both audio and video from the microphone and system in separate streams. Additionally, I am utilizing APIs for real-time transcription with diarization.
However, the challenge lies in recording the screen without displaying a screen sharing dialog box for user consent.
I have observed certain Chrome extensions, such as Tactiq and Airgram, that enable screen sharing without prompting the screen sharing dialog box in the Chrome extension when on google meet.
I have utilized two APIs for capturing audio and video from both the system and microphone, combining them into a single stream. This combined stream is then sent to the Google Cloud Speech-to-Text API for real-time transcription with diarization and other functionalities.
However, my requirement is to record the screen without triggering the 'screen sharing dialog box'. Specifically, I aim to capture the screen while simultaneously capturing microphone audio, all within a Chrome extension using Manifest V3, without the need for user interaction to grant screen sharing permissions.