Highest Voted 'speech-to-text' Questions

4

votes

1 answer

YouTube's auto captioning produces better results than Google Speech to Text API (Model: video, UseEnhanced: true). How can be this possible?

Here my settings of Google Speech to Text AI Here is the output file of Speech to Text AI : https://justpaste.it/speechtotext2 Here is the output file of YouTube's auto caption: https://justpaste.it/ytautotranslate This is the video link :…

asked Oct 12 '20 at 17:29

Furkan Gözükara

22,964
77
205
342

4

votes

2 answers

Google Speech to text available offline?

I would like to leverage Google's Speech to text service for a desktop app, but I would like it to be offline. Is this possible? They have on-prem solutions but can it be offline so no data is…

google-cloud-platform speech-recognition offline speech-to-text google-cloud-speech

asked Sep 17 '20 at 03:43

Mody

41
1
2

4

votes

0 answers

Is it possible to run multiple instances of SFSpeechRecognizer?

I've implemented Apple's SpeechRecognizer to convert speech to text. I have multiple audio recordings so I'm creating mulitple SFSpeechRecognizer instance so that all of those are converted parallely and I've also used DispatchGroup so that I can…

ios swift speech-to-text speech sfspeechrecognizer

asked Jun 12 '20 at 12:44

Rajat Mishra

995
7
18

4

votes

2 answers

Flutter Speech to text not listening continuously

Working on Flutter speech to text conversion and using speech to text library, added below code for listening: speech.listen( onResult: resultListener, listenFor: Duration(minutes: 10), localeId: _currentLocaleId, …

flutter speech-recognition speech-to-text

asked May 12 '20 at 09:01

Lavii

63
1
4

4

votes

1 answer

Where can I see current usage data for Google Cloud Speech To Text?

I need to know how many minute I have used. I can't find it anywhere in the dashboard. (By the way, Google Speech to Text API have 60 minute/month free trial) Update: @mk_sta's answer seem like correct: But after more testing, (upload a 55…

google-cloud-platform speech-to-text

asked Apr 10 '20 at 06:11

NamNamNam

1,190
3
13
22

4

votes

0 answers

Long audio files to text real quick in Python

I want to have audio files converted in text. For this purpose I am using speechrecognition library and then use google cloud speech to text API to convert it. I tried doing so by splitting audio in chunks. However, it takes much time than expected.…

python speech-recognition speech-to-text google-cloud-speech

asked Apr 04 '20 at 14:35

Naazneen Jatu

526
9
19

4

votes

2 answers

How to enable word level Confidence for MS Azure Speech to Text Service

How to get word-level confidence for each word using MS Azure speech to text service? Currently, I am getting confidence value for sentence-level and I need word-level confidence for further processing.

azure speech-recognition speech-to-text azure-speech

asked Feb 14 '20 at 16:06

Manoj Deshpande

311
2
18

4

votes

0 answers

How can I make the speechRecognition and speechSynthesis API's work together?

I am currently working on an experiment using the Web Speech API. The idea is to use speech recognition and synthesis to improve the user's experience with HTML forms. Keep in mind that the code below is just a concept and has many issues. However,…

javascript speech-recognition text-to-speech speech-to-text

asked Jan 16 '20 at 08:58

James Perrone

97
1
8

4

votes

1 answer

Amazon Transcribe/Lex with feedback during the call

I am a new in AWS services and we want to build a simple demo that detect a special word and: [1] trigger an action [2] responses (as speech during the call). For example, if the user say: "Help" I want to reply "OK" and make an operation (AWS…

amazon-web-services twilio speech-recognition speech-to-text

asked Jan 12 '20 at 08:12

AsfK

3,328
4
36
73

4

votes

3 answers

Convert Audio File to text using System.Speech

I am looking to convert a .wav file recorded through an android phone at 16000 to text using C#; namely the System.Speech namespace. My code is mentioned…

c# speech-recognition speech-to-text system.speech.recognition

asked May 10 '11 at 07:27

Kushal Kalambi

125
1
5
13

4

votes

2 answers

@google-cloud/speech - Error: ENOENT: no such file or directory, open 'protos.json

I'm trying to use the google cloud speech to text api. I'm using the sample google code and when i create the client object i got this error. { "errno":-2, "syscall":"open", "code":"ENOENT", "path":"protos.json", "stack":"Error:…

node.js google-cloud-platform speech-to-text

asked Dec 17 '19 at 16:07

Ettore Panini

87
1
7

4

votes

0 answers

OSError: No Default Input Device Available using CoLAB

Unable to use the SpeechRecognition lib in CoLAB. Code: import speech_recognition as sr r = sr.Recognizer() with sr.Microphone() as source: print("Speak Anything: ") audio = r.listen(source) try: text =…

python nlp speech-to-text pyaudio

asked Nov 25 '19 at 16:13

Sarbajit Roy

41
3

4

votes

1 answer

How to fix "Here is no default audio device configured" speech-to-text

Am trying to call the speech-to-text api of google console for nodejs and apparently is working, but when i try to run the example provider for here the example node MicrophoneStream.js and doing the installation like this here, am having the…

node.js windows speech-to-text google-speech-api sox

asked Jul 26 '19 at 04:25

theboshy

378
6
17

4

votes

3 answers

How to get Word Level Timestamps using Azure Speech to Text and the Python SDK?

My code currently is able to read an audio file and transcribe it using Azure Speech to Text, with help from an example that I found on GitHub. However, I need to include the timestamps for all the words in the transcription. According to the…

python azure speech-to-text

asked Jul 01 '19 at 20:58

Jonathan Kvist

41
1
3

4

votes

2 answers

Pyttsx isn’t showing installed languages on windows 10

I am trying to use pyttsx3 to say French text. However, only English is available. Following the advice of How to change the voice in pyttsx3?, I tried to install the French speech pack as explained here…

python windows-10 speech-to-text pyttsx

asked Jun 24 '19 at 06:26

Silver Duck

581
1
5
18

Questions tagged [speech-to-text]