The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.
Questions tagged [speech-to-text]
2372 questions
7
votes
2 answers
How to hide toast message “Your audio will be sent to google to provide speech recognition service.” in Android?
I am using google speech recognizer for integrating voice services in Android but while pressing on mic button this annoying toast message is showing. Please suggest me a way to hide this toast message.
This is my java code
public class FormActivity…

Shivam Kumar
- 1,892
- 2
- 21
- 33
7
votes
5 answers
Offline Speech Recognition in browser
I am working on a product that need to take inputs from user and do certain actions based on it. We have implemented it with a chat box via typing and it is serving our purpose. For the future releases we want to add voice recognition to the chat…

Pawan Kumar
- 1,443
- 2
- 16
- 30
7
votes
1 answer
Speech recognition on iPhone 5
I am using the iOS speech recognition API from an Objective-C iOS app.
It works on iPhone 6, 7, but does not work on iPhone 5 (iOS, 10.2.1).
Also note it works on iPhone 5s, just not iPhone 5.
Is the iOS speech API suppose to work on iPhone 5? Do…

James
- 17,965
- 11
- 91
- 146
7
votes
4 answers
How to convert speech to text in java?
Can anyone share some code snippet used to convert Speech to text in Java.

TheHat
- 434
- 3
- 6
- 12
7
votes
1 answer
How to split speech data on frames and compute MFCC
I understand the basic steps of creating an automated speech recognition engine. However, I need a clear-er idea of how segmentation is done and what are frames and samples. I will write down what I know and expect the answer-er to correct me in the…

cipher
- 2,414
- 4
- 30
- 54
7
votes
3 answers
Get user input from Speech?
I have just started trying out the Windows Speech to Text capabilities in C# .Net. I currently have the basics working (IE - Say something, and it will provide output based on what you say). However, I am struggling to figure out how to actually…

JosephGarrone
- 4,081
- 3
- 38
- 61
7
votes
2 answers
Is there software that outputs speech-to-text at the Phonological level?
Is there any software out there capable of taking audio files and outputting phonological (IPA) text?
I understand much of the software out there takes it straight to a language, but is there one that is 'teachable'?

James
- 71
- 1
7
votes
1 answer
Speech recognition for recorded audio files in .3gp or wav format
Possible Duplicate:
Voice recognition on android with recorded sound clip?
I am developing an Android app which will record audio to a file, and then use speech recognition to get text from the recorded speech.
Is there a speech recognition…

user1916399
- 79
- 1
- 1
- 4
7
votes
5 answers
Windows 8 speech to text and text to speech API
I guess the question pretty much says it all?
And I would prefer not to access cloud services such as Microsoft Translator and Project Hawaii?
Is there any direct API I can access?
(For metro apps)

cjds
- 8,268
- 10
- 49
- 84
6
votes
1 answer
Python Speech Compare
I have two .wav files that I need to compare and decide if they contain the same words (same order too).
I have been searching for the best method for a while now. I can't figure out how to have pyspeech use a file as input. I've tried getting the…

Kreuzade
- 757
- 5
- 11
- 22
6
votes
1 answer
Redirecting all input from Dragon NaturallySpeaking to Python? (Using Natlink)
I am currently writing an AI program that receives input from Dragon NaturallySpeaking (using Natlink), processes it, and returns a spoken output. I was able to come up with a Receiver GrammarBase that captures all input from Dragon and sends it to…

Jian
- 61
- 2
6
votes
4 answers
How can I bulk/batch transcribe wav files using python?
im trying to use my python app to transcribe multiple files in a folder and speed up the process.
At present I am able to do it one file at a time -
####RUN THIS PART FIRST#########
import json
from os.path import join, dirname
from ibm_watson…
user12384956
6
votes
1 answer
Programmatically toggle dictation on MacOS
Due to injury I use dictation on MacOS:
As can be seen from the screenshot, I can toggle it using a keyboard shortcut.
I wish to toggle it from code (preferably ObjC).
I can manually inject the events:
// Assumes CTRL OPT CMD Space toggles…

P i
- 29,020
- 36
- 159
- 267
6
votes
2 answers
Could anyone help me turn the profanity filter off for Google speech recognizer?
I'm trying to do a speech-to-text recognition for a wav. file I have, with Google, Google_Cloud, and Houndify.
I've noticed that with the latter two, they show no problem with profanities and but the Google speech recognizer filters the word, for…

KKKM
- 61
- 2
6
votes
4 answers
How to continuously to do speech recognition while outputting the recognized word as soon as possible
I am working on a project that involves speech recognition using the SpeechRecognition module. One thing I want to do to improve my speech recognition is to be able to output the words that have been recognized as soon as possible. I want it to be…

IShaan
- 93
- 1
- 1
- 6