How to convert a audio-segment file into bytes type?

Asked Aug 11 '22 at 08:08

Active Aug 11 '22 at 08:08

Viewed 1,051 times

I want to make a speech recognition from a wav. To do that, I have a wav that I split into multiple chunks, export them, and then use the SpeechRecognition library.

from pydub import AudioSegment
import speech_recognition as sr

r = sr.Recognizer()
for i in range(5):
    audio = AudioSegment.from_wav("some_wav.wav")
    audio_chunk=audio[int(i*1000):int(i*3000)]
    audio_chunk.export('test.wav', format='wav')
    detection = sr.AudioFile('test.wav')

    with detection as source:
        audio = r.record(source)

    word = r.recognize_google(audio, language = 'ro-RO')

The problem is that this is not very optimal. I want to get rid of the export wav part. I want to transform the audio_chunk into bytes and then use it in the speechRecognition.AudioFile() with in-memory bytes.

Is there a way to convert the audio-segment type into bytes?

asked Aug 11 '22 at 08:08

TheGainadl

This helped me to convert an AudioSegment to bytes https://stackoverflow.com/q/67631465/17524305 – Cara Duf Nov 25 '22 at 13:29

How to convert a audio-segment file into bytes type?

0 Answers0

Linked