How to use Gradio interface to auto submit the audio when recording is done?

Question

I am using the following Gradio sample code to transcribe my audio:

from transformers import pipeline
p = pipeline("automatic-speech-recognition")

import gradio as gr

def transcribe(audio):
    text = p(audio)["text"]
    return text

gr.Interface(
    fn=transcribe, 
    inputs=gr.Audio(source="microphone", type="filepath"), 
    outputs="text").launch()

However, the user has to start recording audio, stop recording audio, and the submit the audio. Can I auto submit the audio when the user presses stop recording audio?

score 2 · Answer 1 · answered Dec 02 '22 at 19:40

I found the solution. I am putting it here for other's reference.

import gradio as gr

from transformers import pipeline

p = pipeline("automatic-speech-recognition")

def transcribe(audio):
    text = p(audio)["text"]
    return text

gr.Interface(
    fn=transcribe, 
    inputs=gr.Audio(source="microphone", type="filepath"), 
    outputs="text",live=True).launch()

Adding live=True serves the purpose.

score 0 · Answer 2 · answered Dec 02 '22 at 19:33

You can use auto-submit something like this should work

#auto submit after 5 seconds
gr.Interface(
    fn=transcribe,
    inputs=gr.Audio(source="microphone", type="filepath"),
    outputs="text",
    auto_submit=True,
    auto_submit_duration=5).launch()

How to use Gradio interface to auto submit the audio when recording is done?

2 Answers2