Questions tagged [speech-recognition]

Speech recognition (SR) is the inter-disciplinary sub-field of computational linguistics which incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields to develop methodologies and technologies that enables the recognition and translation of spoken language into text by computers and computerized devices such as those categorized as smart technologies and robotics

Speech recognition describes the process of analyzing an audio signal stream and convert it to text. This includes splitting the input into meaningful bits of information (tokenization) and jugding the relevance of a signal. The latter means distinguishing between irrelevant signals (e.g. noise in the background) and words to be recognized.

Ideally, speech recognition or speech-to-text mechanisms should not be biased towards a specific voice. It should be capable of recognizing arbitrary voices.

Further information

5380 questions
41
votes
5 answers

portaudio.h: No such file or directory

I got the following error while trying to install pyaudio using pip3 in ubuntu 16.04: Collecting pyaudio Downloading PyAudio-0.2.11.tar.gz Installing collected packages: pyaudio Running setup.py install for pyaudio ... error Complete output…
Monojit Sarkar
  • 657
  • 1
  • 5
  • 15
39
votes
3 answers

Saving audio input of Android Stock speech recognition engine

I am trying to save in a file the audio data listened by speech recognition service of android. Actually I implement RecognitionListener as explained here: Speech to Text on Android save the data into a buffer as illustrated here: Capturing audio…
mmmx
  • 521
  • 4
  • 7
39
votes
2 answers

Can I write SQL using speech recognition?

I have wrist pain when I type and I would like to start writing SQL statements, stored procedure, and views using speech recognition.
37
votes
3 answers

Voice recognition on android with recorded sound clip?

I've used the voice recognition feature on Android and I love it. It's one of my customers' most praised features. However, the format is somewhat restrictive. You have to call the recognizer intent, have it send the recording for transcription…
CodeFusionMobile
  • 14,812
  • 25
  • 102
  • 140
36
votes
5 answers

Is there a way to use the Javascript SpeechRecognition API with an audio file?

I want to use the SpeechRecognition api with an audio file (mp3, wave, etc.) Is that possible?
The Surrican
  • 29,118
  • 24
  • 122
  • 168
36
votes
5 answers

Split speech audio file on words in python

I feel like this is a fairly common problem but I haven't yet found a suitable answer. I have many audio files of human speech that I would like to break on words, which can be done heuristically by looking at pauses in the waveform, but can anyone…
user3059201
  • 775
  • 2
  • 7
  • 11
36
votes
12 answers

onServiceConnected never called after bindService method

I have a particular situation: a service started by a broadcast receiver starts an activity. I want to make it possible for this activity to communicate back to the service. I have chosen to use AIDL to make it possible. Everything seems works good…
Matroska
  • 6,885
  • 14
  • 63
  • 99
35
votes
1 answer

example of AlwaysOnHotwordDetector in Android

Can someone provide an example of how to use the new AlwaysOnHotwordDetector class in Android? I'd like to build an app, that when the app is running in the background, can detect a hotword like "next", or "back", or "pause".
Mark Rose
  • 351
  • 1
  • 3
  • 5
32
votes
6 answers

Google Speech Recognition timeout

I am developing an Android Application that is based around Speech Recognition. Until today everything has been working fine and in a timely manner, e.g. I would start my speech recogniser, speak, and within 1 or 2 seconds max the application…
Hector
  • 4,016
  • 21
  • 112
  • 211
32
votes
4 answers

record/save audio from voice recognition intent

I want to save/record the audio that Google recognition service used for speech to text operation (using RecognizerIntent or SpeechRecognizer). I experienced many ideas: onBufferReceived from RecognitionListener: I know, this is not working, just…
Slim
  • 528
  • 1
  • 6
  • 9
31
votes
8 answers

Speech Recognition & Programming

Has anyone had success with Dragon Naturally Speaking voice recognition software when it comes to programming? I am wondering because I think it would be a lot faster than me typing by hand, and easier on my carpol-tunnel. I program from day to…
bitcycle
  • 7,632
  • 16
  • 70
  • 121
29
votes
2 answers

good Speech recognition API

I am working on a college project in which I am using speech recognition. Currently I am developing it on Windows 7 and I'm using system.speech API package which comes along with .net and I am doing it on C#. The problem I am facing is dictation…
swordfish
  • 4,899
  • 5
  • 33
  • 61
28
votes
5 answers

How to handle ERROR_RECOGNIZER_BUSY

In my voice recognition based app, I sometimes receive ERROR_RECOGNIZER_BUSY. Intuitively, this calls for... retries, right? The problem is that this error is very undocumented, so obviously I have questions that perhaps someone more experienced in…
srf
  • 2,410
  • 4
  • 28
  • 41
27
votes
6 answers

Speech recognition in C or Java or PHP?

Is there any well known established framework for C or Java or PHP to do speech recognition applications? Microphone audio input and it will recognize English words. Such as pseudo code: Speech s = new Speech(); s.input(micStream); result =…
user285594
27
votes
6 answers

Using the Android RecognizerIntent with a bluetooth headset

I use the following code to launch speech recognition in Android: PackageManager pm = getPackageManager(); List activities = pm.queryIntentActivities(new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH), 0); if (activities.size() == 0)…
Michael Levy
  • 13,097
  • 15
  • 66
  • 100