The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.
Questions tagged [speech-to-text]
2372 questions
18
votes
6 answers
Pocketsphinx - Adding words and Improving accuracy
I've managed to finally build and run pocketsphinx (pocketsphinx_continuous). The problem I'm running into, is how to a improve accuracy. From what I understand, you can specify a dictionary file (-dict test.dic). So I took the default dictionary…

Mike6679
- 5,547
- 19
- 63
- 108
17
votes
5 answers
How can I do real-time voice activity detection in Python?
I am performing a voice activity detection on the recorded audio file to detect speech vs non-speech portions in the waveform.
The output of the classifier looks like (highlighted green regions indicate speech):
The only issue I face here is making…

Nickil Maveli
- 29,155
- 8
- 82
- 85
17
votes
4 answers
Pitch detection in Python
The concept of the program I'm working on is a Python module which detects certain frequencies (human speech frequency 80-300hz) and by checking from a database shows the intonation of the sentence. I use SciPy to plot frequency of the sound files,…

Andrew Ravus
- 451
- 1
- 7
- 14
17
votes
0 answers
SpeechRecognizer on Android Wear
The app I'm currently working on requires simple speech recognition of single words. However, I don't want to use:
startActivityForResult() using the ACTION_RECOGNIZE_SPEECH
because I need to display other stuff while the user is speaking.
So I…

Simon
- 171
- 6
16
votes
3 answers
Speech to Text from own sound file
As you probably know, implementing speech-to-text is pretty easy with the Android API. All you have to do is just call up the API's intent and it will return text for you. My case is a bit different, I have a prerecorded 3GPP sound file that I've…

Brian
- 7,955
- 16
- 66
- 107
16
votes
1 answer
Is there a way to convert speech directly into SSML?
Just as one is able to use various speech-to-text 'dictation' tools to convert spoken word into its corresponding text, I would like to know if there are similar such tools for converting spoken word into its corresponding SSML. That is, it will…

Tristannica
- 161
- 6
15
votes
4 answers
iPhone App › Add voice recognition?
I'd like to build an app that uses voice recognition. I've seen big companies like Google etc implement this feature, but I'm curious about doing it on a start-up level. Anyone looked into this? Are there any tools out there for us to do this?
aaron
15
votes
3 answers
Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?
Is there a comprehensive list of known APIs for desktop or browser environments?

Vladimir Keleshev
- 13,753
- 17
- 64
- 93
15
votes
6 answers
Android RecognitionListener: onResults being called twice
I have a project using RecognitionListener written in Kotlin. The speech-to-text function was always a success and never presented any problems.
Since last week, it's onResult function started to be called twice. No changes were made on the project.…

Pedro Henrique Flores
- 181
- 1
- 7
15
votes
4 answers
Speech to Text on Android
I am looking to create an app which has Speech to text.
I am aware of this kind of ability using the RecognizerIntent: http://android-developers.blogspot.com/search/label/Speech%20Input
However - I do not want a new Intent to be popped up, I want to…

RenegadeAndy
- 5,440
- 18
- 70
- 130
15
votes
1 answer
INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes
I'm using for the first time the GCS Speech API for a project to convert a series of audio files to text. Each file has around 60 minutes and is a person talking continuously during the whole time.
I've installed the GC SDK and I'm using it to…

CIRCLE
- 4,501
- 5
- 37
- 56
15
votes
2 answers
How to implement Mozilla DeepSpeech into PHP web app to convert Speech-to-text?
I have a PHP web application and am looking for an open source, high-accuracy speech-to-text recognition implementation that will take voice commands to open web pages from users. Examples: "Make Sales" (this will open Create Sales PHP page), "Make…

Priyesh
- 501
- 3
- 12
15
votes
1 answer
How to implement speech-to-text via the Speech framework in Objective-C?
I want to do speech recognition in my Objective-C app using the iOS Speech framework.
I found some Swift examples but haven't been able to find anything in Objective-C.
Is it possible to access this framework from Objective-C? If so, how?

Boris
- 11,373
- 2
- 33
- 35
15
votes
3 answers
Comparison of Speech Recognition use in Android: by Intent or on-thread?
Introduction
Android provides two ways for me to use speech recognition.
The first way is by an Intent, as in this question: Intent example. A new Activity is pushed onto the top of the stack which listens to the user, hears some speech, attempts…

hcarver
- 7,126
- 4
- 41
- 67
14
votes
1 answer
Live speech recognition
I have a Python script using the speech_recognition package to recognize speech and return the text of what was spoken. The transcription has a few seconds delay, however. Is there another way to write this script to return each word as it is…

Christopher Costello
- 1,186
- 2
- 16
- 30