Questions tagged [speech-to-text]

The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.

2372 questions
0
votes
1 answer

Speech to text conversion on a single click

I am trying to implement the voice-to-text functionality on my flutter app by following the youtube tutorial https://www.youtube.com/watch?v=wDWoD1AaLu8. However, for the app to listen the next time, I have to press the button twice (to stop the…
simsim
  • 155
  • 1
  • 11
0
votes
1 answer

Microsoft Cognitive SpeechRecognizer Stuck

I'm trying to do speech-to-text on some wave files using the MS cognitive Speech SDK. It works well enough for some files but it gets stuck for others. By stuck, I mean that it doesn't stop until cancelled manually. I tried first with the…
0
votes
2 answers

Azure Speech to text 0x38 (SPXERR_AUDIO_SYS_LIBRARY_NOT_FOUND)

I've been trying to start a project involving azure speech to text and for testing purpose wanted to corroborate the workings with the demo code found in this…
SaltyCode
  • 67
  • 1
  • 2
  • 11
0
votes
1 answer

Detect inquiry sentence in Wav2Vec 2.0 result

I am studying ASR(Automatic Speech Recognition) using Wav2Vec2.0. When I run Wav2Vec2.0, I get the result without a comma("."), question mark("?") etc. Therefore, the result came out as one whole sentence. I know that I removed regex while making…
0
votes
1 answer

How to Start Continuous Recognition on a Web Server using 'use_default_microphone' parametre (Azure Cognitive Services Speech )

I am developing a web app (python) in which I use azure cognitive services speech translation. I used PyWebIo to create an interface and start a server to run my code. I am using 'use_default_microphone' parametre in the audio config to speak to…
0
votes
1 answer

p5.speech.js: "Cannot set properties of undefined (setting 'Speech')"

See below for the p5.speech.js library from the p5.js website: /*! p5.speech.js v0.0.1 2015-06-12 */ /* updated v0.0.2 2017-10-17 */ /* updated v0.0.3 2022.1.7 */ /** * @module p5.speech * @submodule p5.speech * @for p5.speech * @main */ …
sb2021
  • 31
  • 4
0
votes
1 answer

How to make Speech To Text stop when the user stop talking?

I'm trying to make an icon in the textfield that is supposed to stop and convert into mic_none when the user has finished saying the text , but that it doesn't happen. What happens is that the text reception stops, but the icon does not return to…
John
  • 11
  • 3
0
votes
0 answers

java.lang.UnsatisfiedLinkError: 'void com.microsoft.cognitiveservices.speech.SpeechConfig.setTempDirectory(java.lang.String)' - Java CentOS 7

My application is a spring boot java project connecting to the azure services for converting speech to text, analyzing sentiment and translating. I am trying to deploy my application to a centos 7 VM, but I get the error in the title when calling my…
0
votes
1 answer

Speech Recognition(IBM) username and password

I hope to use IBM speech recognition service without - curl or ibm_watson module. And my attempt is below: import speech_recognition as sr r = sr.Recognizer() text = r.recognize_ibm(audio,username='',password='') Even though, I have 'Service…
Yong
  • 35
  • 1
  • 5
0
votes
0 answers

How to solve the error: ValueError: Audio file could not be read as PCM WAV, AIFF/AIFF-C?

So I have the WAV file that I need to turn into the text using SpeechRecognition. import speech_recognition as sr r = sr.Recognizer() sample_audio = sr.AudioFile('voice.wav') print(type(sample_audio)) with sample_audio as audio_file: …
0
votes
1 answer

How to train a custom model for speech to text Cognitive Services?

We build an Speech To Text Application. In this Conversation always in dutch language. But in some cases English and Dutch words are same. At that time how can i train my model.
0
votes
2 answers

Microsoft Azure Speech Translation not working [JavaScript SDK]

This example is not working anymore: https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/js/node/translation.js And the demonstration in azure website is not working…
0
votes
1 answer

I want to create a C# web API for azure speech to text console application

I have this following C# console application which uses azure speech to text service and converts speech taken from microphone input into text. I want to create a web API (using the endpoint id, subscription key and service region). Can anyone tell…
0
votes
1 answer

http.client.RemoteDisconnected: Remote end closed connection without response speech recognition

I try to read the text from audio file: import speech_recognition as sr # obtain path to "english.wav" in the same folder as this script from os import path AUDIO_FILE = path.join(path.dirname(path.realpath(__file__)), "09.04.01 KMCTC.wav") #…
Alex
  • 195
  • 10
0
votes
0 answers

Python Speech to text model integration with flutter

so I have a complete speech to text and text to speech model in python but I have to integrate it with flutter, is that possible? or should I try to use flutter speech to text/text to speech which will be a waste of time for me?