The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.
Questions tagged [speech-to-text]
2372 questions
11
votes
1 answer
Is the Offline Dictation feature thats in Jelly Bean part of AOSP and if so what package is it in?
I have been scouring Jelly Bean to see if I could find the Offline Dictation feature but I have not had a lot of luck. As you know, the source is huge and cumbersome. I found the /external/srec folder but the documentation in there seems to be from…

Jared Sheehan
- 187
- 2
- 12
11
votes
2 answers
Options for free (and preferably open source) speech to text library
Looking for a library (with Java or Python APIs) that converts speech to text. 100% accuracy is not an absolute requirement because I just need to run some experiments for a prototype. Ideally it should accept an input file (e.g., .wav) and return…

Soumya Simanta
- 11,523
- 24
- 106
- 161
10
votes
1 answer
How to use CMU Sphinx 4 for speech to text with english voxforge models
I'm trying to figure out how to use sphinx4 or pocketsphinx with the english voxforge model but I can't get it working. I have tried to read doc pages (like this one http://cmusphinx.sourceforge.net/sphinx4/doc/UsingSphinxTrainModels.html ) but it…

tirithen
- 3,219
- 11
- 41
- 65
10
votes
1 answer
Difference among Microsoft Speech products/platforms
It seems Microsoft offers quite a few speech recognition products, I'd like to know the differences among all of them pls.
There is Microsoft Speech API, or SAPI. But somehow Microsoft Cognitive Service Speech API has the same name.
Ok now,…

Blue482
- 2,926
- 5
- 29
- 40
10
votes
3 answers
RaspberryPi + Pocketsphinx + ps3eye Error: Failed to open audio device
just installed pocketsphinx on my raspberry pi. Think i'm going crazy but not sure if I'm providing the correct device.
Whenever I run:
src/programs/pocketsphinx_continuous -adcdev plughw:1,0 -nfft 2048
-samprate 48000
I get the…

Malcolm Jones
- 1,472
- 1
- 12
- 24
10
votes
2 answers
Android extras about speech recognition does not work
I am currently busy with speech recognition, and I want speech recognizer to listen me continously, forever. The purpose of the app is, it is going to wait for speech of a specific keyword: when the user says this keyword, the service becomes ready…

Ayse
- 311
- 1
- 3
- 17
9
votes
2 answers
AppendDictation on Microsoft Speech Platform 11 (Server)?
I'm relatively new to both C# and the Microsoft Speech platform, but I am working on a server application that will need to transcribe free dictation. The MS Speech Platform SDK seemed perfect, and works on the server, unless i reference the…

Jared
- 562
- 1
- 6
- 22
9
votes
3 answers
How to convert human voice into digital format?
I am working on a project where biometric system is used to secure the system. We are planning to use human voice to secure the system.
Idea is to allow the person to say some words or sentences and system will store that voice in digital format.…

Shekhar
- 11,438
- 36
- 130
- 186
9
votes
1 answer
SFSpeechRecognizer that recognizes few command words instead of a whole phrase?
I have a SFSpeechRecognizer set up from Apple's sample app
https://developer.apple.com/library/content/samplecode/SpeakToMe/Introduction/Intro.html
I was wondering if it was possible to have the recognizer recognize individual words that are not…

user3796209
- 449
- 6
- 18
9
votes
2 answers
C# - Free Offliine speech recognition library (SDK)
C# - Free speech recognition Engine library (SDK)
System.Speech.Recognition is very bad... I want another SDK that give me good results and works with c# on Visual Studio...
and i want it OFFLINE not online like google api
Thanks

Majd Akrama
- 103
- 2
- 4
9
votes
1 answer
English US language code changed? Google Speech Api v2 not returning the correct result
Just noticed the English US words no longer display the correct spelling. This previously was ok - now it displays the English UK spelling. Below is a list of some words that I found .For example if I say center and set the language code as en-US I…

Sunil Sunny
- 3,949
- 4
- 23
- 53
9
votes
1 answer
How to use Androids' speech to text with audio sample file
I was able to run an example project which uses Android speech to text using this code:
private void startVoiceRecognitionActivity()
{
Intent intent = new Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH);
…

Moonlit
- 5,171
- 14
- 57
- 95
9
votes
1 answer
Web Speech API Custom Words
I read through the W3C docs on this and I'm thinking that custom words come from custom grammar, but I tried going to this demo and in the console entered the following javascript:
recognition.grammars.addFromString('foo');
Which ran fine and…

kentcdodds
- 27,113
- 32
- 108
- 187
9
votes
2 answers
Using System.Speech to convert mp3 file to text
I'm trying to use the speech recognition in .net to recognize the speech of a podcast in an mp3 file and get the result as string. All the examples I've seen are related to using microphone but I don't want to use the microphone and provide a sample…

Soham Dasgupta
- 5,061
- 24
- 79
- 125
9
votes
4 answers
C#: transcribe WAV file to text (speech-to-text) with System.Speech namespaces
How do you use the .NET speech namespace classes to convert audio in a WAV file to textual form which I can display on the screen or save to file?
I am looking for some tutorial samples.
UPDATE
Found a code sample here. But when I tried it it gives…

deostroll
- 11,661
- 21
- 90
- 161