Questions tagged [speech-to-text]

The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.

2372 questions
0
votes
2 answers

How do I modify speech contexts in Dialogflow fulfillment

An issue I am seeing is that when I ask in dialogflow for the user to spell out their user id like joesmith2014, there are a large number of errors. The follow post suggested that I can fix this by using speech context to tell the speech to text…
Gavin Siu
  • 81
  • 1
  • 9
0
votes
0 answers

google-cloud-storage library cannot be installed in Python 3.7

For a project I need to import google-cloud-storage library in python to use speech to text service. While trying to install it via pip, I keep getting the below error: " Looking in indexes:…
0
votes
0 answers

Is there any Google Speech to Text API health check endpoint

Is there any endpoint that could be called to check if google speech to text api is working? I need to check in time intervals if us-speech.googleapis.com and eu-speech.googleapis.com are not down
0
votes
0 answers

How do I programmatically enable a microphone in a web app, without any user tapping?

I'm building a basic web app using Firebase. My goal is to have the user interact with my app only using their voice, without any tapping whatsoever. I'm using GCP's DialogFlow to register the intent (e.g. "Hey web app"), however I need to…
0
votes
1 answer

Deactivate speech to text service

I am using IBM's speech-to-text service and it says active. How do I deactivate it so I don't use all of my minutes? I have looked everywhere and cant find a way to deactivate it.
Michael
  • 3
  • 3
0
votes
1 answer

Python speech_recognition using Google Web Speech API not working

I am trying to use the Google Web Speech API in Python. I just tried the following code: import speech_recognition as sr import pyaudio r = sr.Recognizer() with sr.Microphone() as source: # read the audio data from the default microphone …
0
votes
1 answer

Cannot import module torchaudio.prototype

I wanted to make ctc_decoder using torchaudio ctc_decoder module. According to this tutorial ASR INFERENCE WITH CTC DECODER it should have been easy to import as usual but I am unable to do so in google colab even after installing torchaudio. It…
0
votes
0 answers

Cross-browser speech recognition?

I'm trying to build a web search engine with Speech Recognition support, just like google. So far, it works on Chrome but not on Firefox. I read from Mozilla's site that it doesn't properly support web speech API but how for instance google search…
user16361980
0
votes
1 answer

How do I get my text from speech rendered to Angular Html?

I am trying to render a variable called "this.text" to my app.component.html. I am able to change "this.text" through a function called "recognition.onresult" but it doesn't render or change to the screen. The "recognition.onresult" function was…
tg marcus
  • 157
  • 2
  • 13
0
votes
1 answer

How to get the speech from a video into a text file in text?

Can anyone guide me on how to get the speech from a video into a text file in text? I tried but I am getting this error- "raise RequestError("recognition request failed: {}".format(e.reason)) speech_recognition.RequestError: recognition request…
0
votes
0 answers

Getting Live Audio from a webcam to PyAudio?

I'm working on a project where I'm streaming live audio from an old Android phone to a server (Ubuntu) for python speech recognition. I can listen to the live audio stream with VLC fine but can't figure out how to get it into Python. I'm using this…
James Watson
  • 35
  • 1
  • 4
0
votes
1 answer

How can i optimize my python code for speech recognition?

I'm trying to implement a code that opens links from speech recognition. How can i write easily that if for example I said "google" it will go on specific branch and computer will ask me to dictate the link that it should follow to open…
0
votes
0 answers

Microsoft speech-to-text restful api return “Empty reply from server”

the question link is Currently I am using speech-to-text restful api ,the ogg container format is written by myself ,this ogg file can be played directly ,but i don't know why the microsoft engine return “Empty reply from server” , the retCode is…
zpzhuang
  • 61
  • 1
  • 3
0
votes
1 answer

Web speech api closes after some seconds

I'm using web speech api https://www.google.com/intl/en/chrome/demos/speech.html but mic automatically closes after some seconds but i have to close mix only when the user clicks on close buttons. Any solution to resolve this issue. Thanks
user3653474
  • 3,393
  • 6
  • 49
  • 135
0
votes
1 answer

Improving accuracy of speech recognition using Vosk (Kaldi) running on Android

I am developing an application to collect data in the field on Android devices using speech recognition. There are five "target words", as well as several numbers (zero, one, ten, one-hundred, etc) that are recognized. I have improved accuracy of…
portsample
  • 1,986
  • 4
  • 19
  • 35
1 2 3
99
100