The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.
Questions tagged [speech-to-text]
2372 questions
0
votes
2 answers
How do I modify speech contexts in Dialogflow fulfillment
An issue I am seeing is that when I ask in dialogflow for the user to spell out their user id like joesmith2014, there are a large number of errors. The follow post suggested that I can fix this by using speech context to tell the speech to text…

Gavin Siu
- 81
- 1
- 9
0
votes
0 answers
google-cloud-storage library cannot be installed in Python 3.7
For a project I need to import google-cloud-storage library in python to use speech to text service. While trying to install it via pip, I keep getting the below error:
" Looking in indexes:…

Niladri
- 29
- 4
0
votes
0 answers
Is there any Google Speech to Text API health check endpoint
Is there any endpoint that could be called to check if google speech to text api is working?
I need to check in time intervals if
us-speech.googleapis.com and eu-speech.googleapis.com
are not down

justasking
- 3
- 2
0
votes
0 answers
How do I programmatically enable a microphone in a web app, without any user tapping?
I'm building a basic web app using Firebase. My goal is to have the user interact with my app only using their voice, without any tapping whatsoever.
I'm using GCP's DialogFlow to register the intent (e.g. "Hey web app"), however I need to…
0
votes
1 answer
Deactivate speech to text service
I am using IBM's speech-to-text service and it says active. How do I deactivate it so I don't use all of my minutes? I have looked everywhere and cant find a way to deactivate it.

Michael
- 3
- 3
0
votes
1 answer
Python speech_recognition using Google Web Speech API not working
I am trying to use the Google Web Speech API in Python. I just tried the following code:
import speech_recognition as sr import pyaudio r = sr.Recognizer() with sr.Microphone() as source:
# read the audio data from the default microphone
…

uk_butterfly
- 93
- 1
- 2
- 8
0
votes
1 answer
Cannot import module torchaudio.prototype
I wanted to make ctc_decoder using torchaudio ctc_decoder module. According to this tutorial ASR INFERENCE WITH CTC DECODER
it should have been easy to import as usual but I am unable to do so in google colab even after installing torchaudio.
It…

Eripsa
- 1
- 2
0
votes
0 answers
Cross-browser speech recognition?
I'm trying to build a web search engine with Speech Recognition support, just like google.
So far, it works on Chrome but not on Firefox. I read from Mozilla's site that it doesn't properly support web speech API but how for instance google search…
user16361980
0
votes
1 answer
How do I get my text from speech rendered to Angular Html?
I am trying to render a variable called "this.text" to my app.component.html. I am able to change "this.text" through a function called "recognition.onresult" but it doesn't render or change to the screen. The "recognition.onresult" function was…

tg marcus
- 157
- 2
- 13
0
votes
1 answer
How to get the speech from a video into a text file in text?
Can anyone guide me on how to get the speech from a video into a text file in text? I tried but I am getting this error- "raise RequestError("recognition request failed: {}".format(e.reason))
speech_recognition.RequestError: recognition request…

Laxmi
- 21
- 8
0
votes
0 answers
Getting Live Audio from a webcam to PyAudio?
I'm working on a project where I'm streaming live audio from an old Android phone to a server (Ubuntu) for python speech recognition. I can listen to the live audio stream with VLC fine but can't figure out how to get it into Python.
I'm using this…

James Watson
- 35
- 1
- 4
0
votes
1 answer
How can i optimize my python code for speech recognition?
I'm trying to implement a code that opens links from speech recognition. How can i write easily that if for example I said "google" it will go on specific branch and computer will ask me to dictate the link that it should follow to open…

Geek97M
- 1
0
votes
0 answers
Microsoft speech-to-text restful api return “Empty reply from server”
the question link is
Currently I am using speech-to-text restful api ,the ogg container format is written by myself ,this ogg file can be played directly ,but i don't know why the microsoft engine return “Empty reply from server” , the retCode is…

zpzhuang
- 61
- 1
- 3
0
votes
1 answer
Web speech api closes after some seconds
I'm using web speech api
https://www.google.com/intl/en/chrome/demos/speech.html
but mic automatically closes after some seconds but i have to close mix only when the user clicks on close buttons.
Any solution to resolve this issue.
Thanks

user3653474
- 3,393
- 6
- 49
- 135
0
votes
1 answer
Improving accuracy of speech recognition using Vosk (Kaldi) running on Android
I am developing an application to collect data in the field on Android devices using speech recognition. There are five "target words", as well as several numbers (zero, one, ten, one-hundred, etc) that are recognized.
I have improved accuracy of…

portsample
- 1,986
- 4
- 19
- 35