Questions tagged [speech-to-text]

The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.

2372 questions
4
votes
1 answer

YouTube's auto captioning produces better results than Google Speech to Text API (Model: video, UseEnhanced: true). How can be this possible?

Here my settings of Google Speech to Text AI Here is the output file of Speech to Text AI : https://justpaste.it/speechtotext2 Here is the output file of YouTube's auto caption: https://justpaste.it/ytautotranslate This is the video link :…
4
votes
2 answers

Google Speech to text available offline?

I would like to leverage Google's Speech to text service for a desktop app, but I would like it to be offline. Is this possible? They have on-prem solutions but can it be offline so no data is…
4
votes
0 answers

Is it possible to run multiple instances of SFSpeechRecognizer?

I've implemented Apple's SpeechRecognizer to convert speech to text. I have multiple audio recordings so I'm creating mulitple SFSpeechRecognizer instance so that all of those are converted parallely and I've also used DispatchGroup so that I can…
Rajat Mishra
  • 995
  • 7
  • 18
4
votes
2 answers

Flutter Speech to text not listening continuously

Working on Flutter speech to text conversion and using speech to text library, added below code for listening: speech.listen( onResult: resultListener, listenFor: Duration(minutes: 10), localeId: _currentLocaleId, …
Lavii
  • 63
  • 1
  • 4
4
votes
1 answer

Where can I see current usage data for Google Cloud Speech To Text?

I need to know how many minute I have used. I can't find it anywhere in the dashboard. (By the way, Google Speech to Text API have 60 minute/month free trial) Update: @mk_sta's answer seem like correct: But after more testing, (upload a 55…
NamNamNam
  • 1,190
  • 3
  • 13
  • 22
4
votes
0 answers

Long audio files to text real quick in Python

I want to have audio files converted in text. For this purpose I am using speechrecognition library and then use google cloud speech to text API to convert it. I tried doing so by splitting audio in chunks. However, it takes much time than expected.…
4
votes
2 answers

How to enable word level Confidence for MS Azure Speech to Text Service

How to get word-level confidence for each word using MS Azure speech to text service? Currently, I am getting confidence value for sentence-level and I need word-level confidence for further processing.
4
votes
0 answers

How can I make the speechRecognition and speechSynthesis API's work together?

I am currently working on an experiment using the Web Speech API. The idea is to use speech recognition and synthesis to improve the user's experience with HTML forms. Keep in mind that the code below is just a concept and has many issues. However,…
4
votes
1 answer

Amazon Transcribe/Lex with feedback during the call

I am a new in AWS services and we want to build a simple demo that detect a special word and: [1] trigger an action [2] responses (as speech during the call). For example, if the user say: "Help" I want to reply "OK" and make an operation (AWS…
AsfK
  • 3,328
  • 4
  • 36
  • 73
4
votes
3 answers

Convert Audio File to text using System.Speech

I am looking to convert a .wav file recorded through an android phone at 16000 to text using C#; namely the System.Speech namespace. My code is mentioned…
4
votes
2 answers

@google-cloud/speech - Error: ENOENT: no such file or directory, open 'protos.json

I'm trying to use the google cloud speech to text api. I'm using the sample google code and when i create the client object i got this error. { "errno":-2, "syscall":"open", "code":"ENOENT", "path":"protos.json", "stack":"Error:…
4
votes
0 answers

OSError: No Default Input Device Available using CoLAB

Unable to use the SpeechRecognition lib in CoLAB. Code: import speech_recognition as sr r = sr.Recognizer() with sr.Microphone() as source: print("Speak Anything: ") audio = r.listen(source) try: text =…
4
votes
1 answer

How to fix "Here is no default audio device configured" speech-to-text

Am trying to call the speech-to-text api of google console for nodejs and apparently is working, but when i try to run the example provider for here the example node MicrophoneStream.js and doing the installation like this here, am having the…
theboshy
  • 378
  • 6
  • 17
4
votes
3 answers

How to get Word Level Timestamps using Azure Speech to Text and the Python SDK?

My code currently is able to read an audio file and transcribe it using Azure Speech to Text, with help from an example that I found on GitHub. However, I need to include the timestamps for all the words in the transcription. According to the…
4
votes
2 answers

Pyttsx isn’t showing installed languages on windows 10

I am trying to use pyttsx3 to say French text. However, only English is available. Following the advice of How to change the voice in pyttsx3?, I tried to install the French speech pack as explained here…
Silver Duck
  • 581
  • 1
  • 5
  • 18