Highest Voted 'speech-to-text' Questions

0

votes

2 answers

How do I modify speech contexts in Dialogflow fulfillment

An issue I am seeing is that when I ask in dialogflow for the user to spell out their user id like joesmith2014, there are a large number of errors. The follow post suggested that I can fix this by using speech context to tell the speech to text…

google-app-engine dialogflow-es speech-to-text

asked Mar 10 '22 at 14:09

Gavin Siu

81
1
9

0

votes

0 answers

google-cloud-storage library cannot be installed in Python 3.7

For a project I need to import google-cloud-storage library in python to use speech to text service. While trying to install it via pip, I keep getting the below error: " Looking in indexes:…

python google-cloud-platform google-cloud-storage speech-to-text

asked Mar 03 '22 at 13:57

Niladri

29
4

0

votes

0 answers

Is there any Google Speech to Text API health check endpoint

Is there any endpoint that could be called to check if google speech to text api is working? I need to check in time intervals if us-speech.googleapis.com and eu-speech.googleapis.com are not down

google-api speech-to-text google-speech-to-text-api

asked Mar 01 '22 at 08:35

justasking

3
2

0

votes

0 answers

How do I programmatically enable a microphone in a web app, without any user tapping?

I'm building a basic web app using Firebase. My goal is to have the user interact with my app only using their voice, without any tapping whatsoever. I'm using GCP's DialogFlow to register the intent (e.g. "Hey web app"), however I need to…

dialogflow-es speech-to-text microphone

asked Feb 28 '22 at 23:48

Tanker245859

1

0

votes

1 answer

Deactivate speech to text service

I am using IBM's speech-to-text service and it says active. How do I deactivate it so I don't use all of my minutes? I have looked everywhere and cant find a way to deactivate it.

ibm-cloud ibm-watson speech-to-text

asked Feb 28 '22 at 02:41

Michael

3
3

0

votes

1 answer

Python speech_recognition using Google Web Speech API not working

I am trying to use the Google Web Speech API in Python. I just tried the following code: import speech_recognition as sr import pyaudio r = sr.Recognizer() with sr.Microphone() as source: # read the audio data from the default microphone …

python python-3.x speech-recognition speech-to-text

asked Feb 24 '22 at 19:56

uk_butterfly

93
1
2
8

0

votes

1 answer

Cannot import module torchaudio.prototype

I wanted to make ctc_decoder using torchaudio ctc_decoder module. According to this tutorial ASR INFERENCE WITH CTC DECODER it should have been easy to import as usual but I am unable to do so in google colab even after installing torchaudio. It…

pytorch google-colaboratory speech-to-text ctc torchaudio

asked Feb 13 '22 at 16:19

Eripsa

1
2

0

votes

0 answers

Cross-browser speech recognition?

I'm trying to build a web search engine with Speech Recognition support, just like google. So far, it works on Chrome but not on Firefox. I read from Mozilla's site that it doesn't properly support web speech API but how for instance google search…

javascript speech-recognition speech-to-text

asked Feb 07 '22 at 20:34

user16361980

0

votes

1 answer

How do I get my text from speech rendered to Angular Html?

I am trying to render a variable called "this.text" to my app.component.html. I am able to change "this.text" through a function called "recognition.onresult" but it doesn't render or change to the screen. The "recognition.onresult" function was…

javascript angular speech-recognition speech-to-text

asked Jan 29 '22 at 14:38

tg marcus

157
2
13

0

votes

1 answer

How to get the speech from a video into a text file in text?

Can anyone guide me on how to get the speech from a video into a text file in text? I tried but I am getting this error- "raise RequestError("recognition request failed: {}".format(e.reason)) speech_recognition.RequestError: recognition request…

python python-3.x speech-recognition speech-to-text moviepy

asked Jan 29 '22 at 14:04

Laxmi

21
8

0

votes

0 answers

Getting Live Audio from a webcam to PyAudio?

I'm working on a project where I'm streaming live audio from an old Android phone to a server (Ubuntu) for python speech recognition. I can listen to the live audio stream with VLC fine but can't figure out how to get it into Python. I'm using this…

python ubuntu audio speech-to-text pyaudio

asked Jan 23 '22 at 01:47

James Watson

35
1
4

0

votes

1 answer

How can i optimize my python code for speech recognition?

I'm trying to implement a code that opens links from speech recognition. How can i write easily that if for example I said "google" it will go on specific branch and computer will ask me to dictate the link that it should follow to open…

python speech-recognition speech-to-text google-text-to-speech

asked Jan 18 '22 at 18:48

Geek97M

1

0

votes

0 answers

Microsoft speech-to-text restful api return “Empty reply from server”

the question link is Currently I am using speech-to-text restful api ，the ogg container format is written by myself ，this ogg file can be played directly ,but i don't know why the microsoft engine return “Empty reply from server” , the retCode is…

speech-to-text

asked Jan 13 '22 at 07:24

zpzhuang

61
1
3

0

votes

1 answer

Web speech api closes after some seconds

I'm using web speech api https://www.google.com/intl/en/chrome/demos/speech.html but mic automatically closes after some seconds but i have to close mix only when the user clicks on close buttons. Any solution to resolve this issue. Thanks

javascript speech-recognition speech-to-text webspeech-api

asked Jan 10 '22 at 14:18

user3653474

3,393
6
49
135

0

votes

1 answer

Improving accuracy of speech recognition using Vosk (Kaldi) running on Android

I am developing an application to collect data in the field on Android devices using speech recognition. There are five "target words", as well as several numbers (zero, one, ten, one-hundred, etc) that are recognized. I have improved accuracy of…

speech-to-text kaldi vosk

asked Jan 04 '22 at 21:04

portsample

1,986
4
19
35

Questions tagged [speech-to-text]