Use case: I have put my phone on speaker, dialled customer care ,recorded automated voice message on my laptop , now I want convert that into text.
What I did until now ?
I tried with speech recognition library in python
import speech_recognition as sr
import time
r=sr.Recognizer()
#This block will help in recognising audio file harvard = sr.AudioFile('voice1.wav') with harvard as source:
r.adjust_for_ambient_noise(source)
audio = r.record(source,offset=0, duration=5)
print r.recognize_google(audio, show_all=True)
However, no text is returned.