Questions tagged [speech-to-text]

The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.

2372 questions
0
votes
1 answer

Is there a way for me to combine DialogFlow with HTML Web Speech API?

I am creating a speech recognition + synthesis platform through the browser using webkit speech API. I was wondering if there was a way to capture Dialog flow's agent response and then run it through web speech API. For example, I've built a chatbot…
0
votes
1 answer

Microsoft Azure Speech Synthesizer stops talking even when it returns SynthesizingAudioCompleted (Python)

I'm working on a chatbot that uses Azure SpeechRecognizer and SpeechSynthesizer to talk with users. After 5-10 turns, the chatbot stops talking mid-utterance, even though I get a ResultReason.SynthesizingAudioCompleted status on the entire…
0
votes
1 answer

Nodejs/c++ addon - getting error "undefined symbol: speech_config_from_subscription" from Microsoft speech SDK on ubuntu 18.4 server

Actually, I want to do speech transcription with passing MULAW (g711) audio format to microsoft-speech-sdk (Nodejs), but MULAW streaming audio format is not supported to microsoft-speech-sdk (Nodejs). So, for this required GStreamer with C++. So,…
Tushar Kadam
  • 173
  • 1
  • 1
  • 9
0
votes
1 answer

Teach iOS SFSpeechRecognizer custom words (medication names, chemistry terms, etc)

I have a question regarding the integration of the speech to text iOS library called SFSpeechRecognizer. I need SFSpeechRecognizer to recognize terms that are not present in the iOS dictionary like medication names, chemistry terms, etc. I would…
0
votes
2 answers

Google Speech to Text the environment variable GOOGLE_APPLICATION_CREDENTIALS must be defined pointing to a file defining the credentials

When I use Windows system to set the environment variable program can run normally, change to Linux test, appeared 2022-06-27 12:10:27,814 [pool-6-thread-1] ERROR InfiniteStreamRecognize - java.io.IOException: The Application Default Credentials are…
0
votes
1 answer

How to use a regional endpoint in Google Speech-to-Text C# API?

I would like to use the European endpoint eu-speech.googleapis.com for the Google Cloud speech recognition C# API. An example is given for Python at https://cloud.google.com/speech-to-text/docs/endpoints#speech-sync-recognize-python My credentials…
Thomas853
  • 517
  • 1
  • 7
  • 12
0
votes
0 answers

Rewriting an argparse argument from a .py file to a .ipynb, works but not as expected

I am in a need to measure the word error rate (WER) given an Automatic Speech Recognition (ASR) transcript and a ground truth transcript. Searching for a program to help me with this need I found this repo on github that I tried and its calculates…
Ale Sosa
  • 93
  • 1
  • 8
0
votes
2 answers

Speech Recognition issue in Android 12

I've an app which uses speech recognition. Target SDK is 31. It was working fine but after updating device to Android 12 it is not giving any result. speechRecognizerIntent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH); …
0
votes
1 answer

azure speect to text service - what is continuous recognition doing with the audio on file

We are comparing two speech to text services to present pros/cons of each service - with one service we upload a file and check status via a get request - downloading scripts when status returned is done. This allows us to 'fire and forget', frees…
Cheeter_P
  • 71
  • 8
0
votes
0 answers

Whenever I kill my app, the service is running but speech to text not working

I created a speech to text app and it's working in the background but whenever I kill my app, the service is running but speech to text not working I want my app service to work properly after killing my app
0
votes
3 answers

@react-native-community/voice or react-native-voice / voice gives exception java.lang.NullPointerException at com.wenkesj.voice.VoiceModule.onResults

@react-native-community/voice was working smoothly but recently it gives following error java.lang.NullPointerException: Attempt to invoke virtual method 'java.util.Iterator java.util.ArrayList.iterator()' on a null object reference at…
0
votes
1 answer

Google Speech-To-Text engine returns RESOURCE_EXHAUSTED

We have setup Google Speech-To-Text as a service, all the instances of our servers use the same Google cloud account for Speech-To-Text service. However, one of the instances is constantly getting reply "RESOURCE_EXHAUSTED: Resource has been…
0
votes
1 answer

How do you speed up Dialogflow CX speech recognition on single word responses?

I have a Dialogflow CX agent working in polish [pl] language as an audio bot using AudioCodes. I want it to respond to yes/no answers (pl: "tak"/"nie"), yet it takes usually 15 seconds or more to detect the end of utterance. Enabling advanced speech…
Rhantolq
  • 3
  • 1
0
votes
0 answers

s_sec, s_millisec = divmod(float(entry['start']), 1) KeyError: 'start' for max_word= 1

I worked on code published on GitHub https://github.com/jrterven/audio-visual-dataset/blob/master/extract_detailed_text_watson.py the code was design to use 5 words but I want to change it to 1 word so I try do that in the code but there is an error…
SHAIIM
  • 15
  • 3
0
votes
0 answers

Pyttsx3 to tkinter text while you are speaking

I want pyttsx3 and tkinter-text go synchronically. I mean, I want to see what I told in a text like the subtitles However I could not manage that. It is possible when it is not in a while loop.... but when I use it in while loop it shows the the…