Questions tagged [speech-to-text]

The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.

2372 questions
18
votes
6 answers

Pocketsphinx - Adding words and Improving accuracy

I've managed to finally build and run pocketsphinx (pocketsphinx_continuous). The problem I'm running into, is how to a improve accuracy. From what I understand, you can specify a dictionary file (-dict test.dic). So I took the default dictionary…
Mike6679
  • 5,547
  • 19
  • 63
  • 108
17
votes
5 answers

How can I do real-time voice activity detection in Python?

I am performing a voice activity detection on the recorded audio file to detect speech vs non-speech portions in the waveform. The output of the classifier looks like (highlighted green regions indicate speech): The only issue I face here is making…
Nickil Maveli
  • 29,155
  • 8
  • 82
  • 85
17
votes
4 answers

Pitch detection in Python

The concept of the program I'm working on is a Python module which detects certain frequencies (human speech frequency 80-300hz) and by checking from a database shows the intonation of the sentence. I use SciPy to plot frequency of the sound files,…
17
votes
0 answers

SpeechRecognizer on Android Wear

The app I'm currently working on requires simple speech recognition of single words. However, I don't want to use: startActivityForResult() using the ACTION_RECOGNIZE_SPEECH because I need to display other stuff while the user is speaking. So I…
Simon
  • 171
  • 6
16
votes
3 answers

Speech to Text from own sound file

As you probably know, implementing speech-to-text is pretty easy with the Android API. All you have to do is just call up the API's intent and it will return text for you. My case is a bit different, I have a prerecorded 3GPP sound file that I've…
Brian
  • 7,955
  • 16
  • 66
  • 107
16
votes
1 answer

Is there a way to convert speech directly into SSML?

Just as one is able to use various speech-to-text 'dictation' tools to convert spoken word into its corresponding text, I would like to know if there are similar such tools for converting spoken word into its corresponding SSML. That is, it will…
15
votes
4 answers

iPhone App › Add voice recognition?

I'd like to build an app that uses voice recognition. I've seen big companies like Google etc implement this feature, but I'm curious about doing it on a start-up level. Anyone looked into this? Are there any tools out there for us to do this?
aaron
15
votes
3 answers

Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?

Is there a comprehensive list of known APIs for desktop or browser environments?
15
votes
6 answers

Android RecognitionListener: onResults being called twice

I have a project using RecognitionListener written in Kotlin. The speech-to-text function was always a success and never presented any problems. Since last week, it's onResult function started to be called twice. No changes were made on the project.…
15
votes
4 answers

Speech to Text on Android

I am looking to create an app which has Speech to text. I am aware of this kind of ability using the RecognizerIntent: http://android-developers.blogspot.com/search/label/Speech%20Input However - I do not want a new Intent to be popped up, I want to…
RenegadeAndy
  • 5,440
  • 18
  • 70
  • 130
15
votes
1 answer

INVALID_ARGUMENT: Request payload size exceeds the limit: 10485760 bytes

I'm using for the first time the GCS Speech API for a project to convert a series of audio files to text. Each file has around 60 minutes and is a person talking continuously during the whole time. I've installed the GC SDK and I'm using it to…
CIRCLE
  • 4,501
  • 5
  • 37
  • 56
15
votes
2 answers

How to implement Mozilla DeepSpeech into PHP web app to convert Speech-to-text?

I have a PHP web application and am looking for an open source, high-accuracy speech-to-text recognition implementation that will take voice commands to open web pages from users. Examples: "Make Sales" (this will open Create Sales PHP page), "Make…
15
votes
1 answer

How to implement speech-to-text via the Speech framework in Objective-C?

I want to do speech recognition in my Objective-C app using the iOS Speech framework. I found some Swift examples but haven't been able to find anything in Objective-C. Is it possible to access this framework from Objective-C? If so, how?
Boris
  • 11,373
  • 2
  • 33
  • 35
15
votes
3 answers

Comparison of Speech Recognition use in Android: by Intent or on-thread?

Introduction Android provides two ways for me to use speech recognition. The first way is by an Intent, as in this question: Intent example. A new Activity is pushed onto the top of the stack which listens to the user, hears some speech, attempts…
hcarver
  • 7,126
  • 4
  • 41
  • 67
14
votes
1 answer

Live speech recognition

I have a Python script using the speech_recognition package to recognize speech and return the text of what was spoken. The transcription has a few seconds delay, however. Is there another way to write this script to return each word as it is…