Highest Voted 'speech-to-text' Questions

18

votes

6 answers

Pocketsphinx - Adding words and Improving accuracy

I've managed to finally build and run pocketsphinx (pocketsphinx_continuous). The problem I'm running into, is how to a improve accuracy. From what I understand, you can specify a dictionary file (-dict test.dic). So I took the default dictionary…

sphinx speech-recognition speech-to-text

asked Dec 26 '10 at 20:02

Mike6679

5,547
19
63
108

17

votes

5 answers

How can I do real-time voice activity detection in Python?

I am performing a voice activity detection on the recorded audio file to detect speech vs non-speech portions in the waveform. The output of the classifier looks like (highlighted green regions indicate speech): The only issue I face here is making…

python speech-recognition speech-to-text speech pyaudio

asked Mar 24 '20 at 13:38

Nickil Maveli

29,155
8
82
85

17

votes

4 answers

Pitch detection in Python

The concept of the program I'm working on is a Python module which detects certain frequencies (human speech frequency 80-300hz) and by checking from a database shows the intonation of the sentence. I use SciPy to plot frequency of the sound files,…

python signal-processing speech-recognition speech-to-text speech

asked Sep 15 '15 at 20:52

Andrew Ravus

451
1
7
14

17

votes

0 answers

SpeechRecognizer on Android Wear

The app I'm currently working on requires simple speech recognition of single words. However, I don't want to use: startActivityForResult() using the ACTION_RECOGNIZE_SPEECH because I need to display other stuff while the user is speaking. So I…

android speech-recognition speech-to-text wear-os

asked Sep 24 '14 at 09:34

Simon

171
6

16

votes

3 answers

Speech to Text from own sound file

As you probably know, implementing speech-to-text is pretty easy with the Android API. All you have to do is just call up the API's intent and it will return text for you. My case is a bit different, I have a prerecorded 3GPP sound file that I've…

android file audio speech-to-text

asked Aug 08 '11 at 23:59

Brian

7,955
16
66
107

16

votes

1 answer

Is there a way to convert speech directly into SSML?

Just as one is able to use various speech-to-text 'dictation' tools to convert spoken word into its corresponding text, I would like to know if there are similar such tools for converting spoken word into its corresponding SSML. That is, it will…

text-to-speech speech-to-text speech-synthesis alexa-voice-service ssml

asked Sep 08 '17 at 04:59

Tristannica

161
6

15

votes

4 answers

iPhone App › Add voice recognition?

I'd like to build an app that uses voice recognition. I've seen big companies like Google etc implement this feature, but I'm curious about doing it on a start-up level. Anyone looked into this? Are there any tools out there for us to do this?

iphone speech-recognition voice-recording speech-to-text

asked Jun 02 '09 at 22:50

aaron

15

votes

3 answers

Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?

Is there a comprehensive list of known APIs for desktop or browser environments?

speech-recognition text-to-speech speech-to-text speech-synthesis

asked Jun 14 '11 at 19:13

Vladimir Keleshev

13,753
17
64
93

15

votes

6 answers

Android RecognitionListener: onResults being called twice

I have a project using RecognitionListener written in Kotlin. The speech-to-text function was always a success and never presented any problems. Since last week, it's onResult function started to be called twice. No changes were made on the project.…

android kotlin speech-to-text voice-recognition

asked Mar 25 '20 at 16:39

Pedro Henrique Flores

181
1
7

15

votes

4 answers

Speech to Text on Android

I am looking to create an app which has Speech to text. I am aware of this kind of ability using the RecognizerIntent: http://android-developers.blogspot.com/search/label/Speech%20Input However - I do not want a new Intent to be popped up, I want to…

java android speech-recognition speech speech-to-text

asked May 06 '11 at 15:42

RenegadeAndy

5,440
18
70
130

15

votes

1 answer

INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes

I'm using for the first time the GCS Speech API for a project to convert a series of audio files to text. Each file has around 60 minutes and is a person talking continuously during the whole time. I've installed the GC SDK and I'm using it to…

speech-recognition speech-to-text google-speech-api

asked Jul 30 '18 at 20:17

CIRCLE

4,501
5
37
56

15

votes

2 answers

How to implement Mozilla DeepSpeech into PHP web app to convert Speech-to-text?

I have a PHP web application and am looking for an open source, high-accuracy speech-to-text recognition implementation that will take voice commands to open web pages from users. Examples: "Make Sales" (this will open Create Sales PHP page), "Make…

php speech-recognition speech-to-text webspeech-api mozilla-deepspeech

asked May 29 '18 at 10:56

Priyesh

501
3
12

15

votes

1 answer

How to implement speech-to-text via the Speech framework in Objective-C?

I want to do speech recognition in my Objective-C app using the iOS Speech framework. I found some Swift examples but haven't been able to find anything in Objective-C. Is it possible to access this framework from Objective-C? If so, how?

ios objective-c speech-recognition speech-to-text mobile-application

asked May 07 '17 at 16:50

Boris

11,373
2
33
35

15

votes

3 answers

Comparison of Speech Recognition use in Android: by Intent or on-thread?

Introduction Android provides two ways for me to use speech recognition. The first way is by an Intent, as in this question: Intent example. A new Activity is pushed onto the top of the stack which listens to the user, hears some speech, attempts…

android android-intent speech-recognition speech-to-text

asked Aug 11 '12 at 10:00

hcarver

7,126
4
41
67

14

votes

1 answer

Live speech recognition

I have a Python script using the speech_recognition package to recognize speech and return the text of what was spoken. The transcription has a few seconds delay, however. Is there another way to write this script to return each word as it is…

python speech-recognition speech-to-text cmusphinx pocketsphinx

asked Oct 29 '17 at 20:32

Christopher Costello

1,186
2
16
30

Questions tagged [speech-to-text]