The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.
Questions tagged [speech-to-text]
2372 questions
0
votes
1 answer
Is there a way for me to combine DialogFlow with HTML Web Speech API?
I am creating a speech recognition + synthesis platform through the browser using webkit speech API. I was wondering if there was a way to capture Dialog flow's agent response and then run it through web speech API.
For example, I've built a chatbot…

spin99
- 25
- 4
0
votes
1 answer
Microsoft Azure Speech Synthesizer stops talking even when it returns SynthesizingAudioCompleted (Python)
I'm working on a chatbot that uses Azure SpeechRecognizer and SpeechSynthesizer to talk with users. After 5-10 turns, the chatbot stops talking mid-utterance, even though I get a ResultReason.SynthesizingAudioCompleted status on the entire…

user13924150
- 11
- 1
0
votes
1 answer
Nodejs/c++ addon - getting error "undefined symbol: speech_config_from_subscription" from Microsoft speech SDK on ubuntu 18.4 server
Actually, I want to do speech transcription with passing MULAW (g711) audio format to microsoft-speech-sdk (Nodejs), but MULAW streaming audio format is not supported to microsoft-speech-sdk (Nodejs).
So, for this required GStreamer with C++.
So,…

Tushar Kadam
- 173
- 1
- 1
- 9
0
votes
1 answer
Teach iOS SFSpeechRecognizer custom words (medication names, chemistry terms, etc)
I have a question regarding the integration of the speech to text iOS library called SFSpeechRecognizer.
I need SFSpeechRecognizer to recognize terms that are not present in the iOS dictionary like medication names, chemistry terms, etc.
I would…

Bruno Coelho
- 926
- 13
- 25
0
votes
2 answers
Google Speech to Text the environment variable GOOGLE_APPLICATION_CREDENTIALS must be defined pointing to a file defining the credentials
When I use Windows system to set the environment variable program can run normally, change to Linux test, appeared
2022-06-27 12:10:27,814 [pool-6-thread-1] ERROR InfiniteStreamRecognize -
java.io.IOException: The Application Default Credentials are…
0
votes
1 answer
How to use a regional endpoint in Google Speech-to-Text C# API?
I would like to use the European endpoint eu-speech.googleapis.com for the Google Cloud speech recognition C# API. An example is given for Python at
https://cloud.google.com/speech-to-text/docs/endpoints#speech-sync-recognize-python
My credentials…

Thomas853
- 517
- 1
- 7
- 12
0
votes
0 answers
Rewriting an argparse argument from a .py file to a .ipynb, works but not as expected
I am in a need to measure the word error rate (WER) given an Automatic Speech Recognition (ASR) transcript and a ground truth transcript.
Searching for a program to help me with this need I found this repo on github that I tried and its calculates…

Ale Sosa
- 93
- 1
- 8
0
votes
2 answers
Speech Recognition issue in Android 12
I've an app which uses speech recognition. Target SDK is 31. It was working fine but after updating device to Android 12 it is not giving any result.
speechRecognizerIntent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH);
…

Sunil Shreepal
- 53
- 1
- 8
0
votes
1 answer
azure speect to text service - what is continuous recognition doing with the audio on file
We are comparing two speech to text services to present pros/cons of each service - with one service we upload a file and check status via a get request - downloading scripts when status returned is done. This allows us to 'fire and forget', frees…

Cheeter_P
- 71
- 8
0
votes
0 answers
Whenever I kill my app, the service is running but speech to text not working
I created a speech to text app and it's working in the background but whenever I kill my app, the service is running but speech to text not working I want my app service to work properly after killing my app

Abhishek Vashishtha_IBS
- 651
- 1
- 5
- 18
0
votes
3 answers
@react-native-community/voice or react-native-voice / voice gives exception java.lang.NullPointerException at com.wenkesj.voice.VoiceModule.onResults
@react-native-community/voice was working smoothly but recently it gives following error
java.lang.NullPointerException: Attempt to invoke virtual method 'java.util.Iterator java.util.ArrayList.iterator()' on a null object reference
at…

Thanhal P A
- 4,097
- 3
- 18
- 38
0
votes
1 answer
Google Speech-To-Text engine returns RESOURCE_EXHAUSTED
We have setup Google Speech-To-Text as a service, all the instances of our servers use the same Google cloud account for Speech-To-Text service.
However, one of the instances is constantly getting reply "RESOURCE_EXHAUSTED: Resource has been…

Amir Deutel
- 36
- 3
0
votes
1 answer
How do you speed up Dialogflow CX speech recognition on single word responses?
I have a Dialogflow CX agent working in polish [pl] language as an audio bot using AudioCodes.
I want it to respond to yes/no answers (pl: "tak"/"nie"), yet it takes usually 15 seconds or more to detect the end of utterance. Enabling advanced speech…

Rhantolq
- 3
- 1
0
votes
0 answers
s_sec, s_millisec = divmod(float(entry['start']), 1) KeyError: 'start' for max_word= 1
I worked on code published on GitHub https://github.com/jrterven/audio-visual-dataset/blob/master/extract_detailed_text_watson.py the code was design to use 5 words but I want to change it to 1 word so I try do that in the code but there is an error…

SHAIIM
- 15
- 3
0
votes
0 answers
Pyttsx3 to tkinter text while you are speaking
I want pyttsx3 and tkinter-text go synchronically. I mean, I want to see what I told in a text like the subtitles
However I could not manage that. It is possible when it is not in a while loop.... but when I use it in while loop it shows the the…

Ozzz
- 29
- 5