Questions tagged [speech-to-text]

The translation of spoken words into text. Possible synonyms include automatic speech recognition, ASR, computer speech recognition, speech to text, STT.

2372 questions
7
votes
2 answers

How to hide toast message “Your audio will be sent to google to provide speech recognition service.” in Android?

I am using google speech recognizer for integrating voice services in Android but while pressing on mic button this annoying toast message is showing. Please suggest me a way to hide this toast message. This is my java code public class FormActivity…
Shivam Kumar
  • 1,892
  • 2
  • 21
  • 33
7
votes
5 answers

Offline Speech Recognition in browser

I am working on a product that need to take inputs from user and do certain actions based on it. We have implemented it with a chat box via typing and it is serving our purpose. For the future releases we want to add voice recognition to the chat…
7
votes
1 answer

Speech recognition on iPhone 5

I am using the iOS speech recognition API from an Objective-C iOS app. It works on iPhone 6, 7, but does not work on iPhone 5 (iOS, 10.2.1). Also note it works on iPhone 5s, just not iPhone 5. Is the iOS speech API suppose to work on iPhone 5? Do…
James
  • 17,965
  • 11
  • 91
  • 146
7
votes
4 answers

How to convert speech to text in java?

Can anyone share some code snippet used to convert Speech to text in Java.
TheHat
  • 434
  • 3
  • 6
  • 12
7
votes
1 answer

How to split speech data on frames and compute MFCC

I understand the basic steps of creating an automated speech recognition engine. However, I need a clear-er idea of how segmentation is done and what are frames and samples. I will write down what I know and expect the answer-er to correct me in the…
cipher
  • 2,414
  • 4
  • 30
  • 54
7
votes
3 answers

Get user input from Speech?

I have just started trying out the Windows Speech to Text capabilities in C# .Net. I currently have the basics working (IE - Say something, and it will provide output based on what you say). However, I am struggling to figure out how to actually…
JosephGarrone
  • 4,081
  • 3
  • 38
  • 61
7
votes
2 answers

Is there software that outputs speech-to-text at the Phonological level?

Is there any software out there capable of taking audio files and outputting phonological (IPA) text? I understand much of the software out there takes it straight to a language, but is there one that is 'teachable'?
7
votes
1 answer

Speech recognition for recorded audio files in .3gp or wav format

Possible Duplicate: Voice recognition on android with recorded sound clip? I am developing an Android app which will record audio to a file, and then use speech recognition to get text from the recorded speech. Is there a speech recognition…
7
votes
5 answers

Windows 8 speech to text and text to speech API

I guess the question pretty much says it all? And I would prefer not to access cloud services such as Microsoft Translator and Project Hawaii? Is there any direct API I can access? (For metro apps)
cjds
  • 8,268
  • 10
  • 49
  • 84
6
votes
1 answer

Python Speech Compare

I have two .wav files that I need to compare and decide if they contain the same words (same order too). I have been searching for the best method for a while now. I can't figure out how to have pyspeech use a file as input. I've tried getting the…
Kreuzade
  • 757
  • 5
  • 11
  • 22
6
votes
1 answer

Redirecting all input from Dragon NaturallySpeaking to Python? (Using Natlink)

I am currently writing an AI program that receives input from Dragon NaturallySpeaking (using Natlink), processes it, and returns a spoken output. I was able to come up with a Receiver GrammarBase that captures all input from Dragon and sends it to…
6
votes
4 answers

How can I bulk/batch transcribe wav files using python?

im trying to use my python app to transcribe multiple files in a folder and speed up the process. At present I am able to do it one file at a time - ####RUN THIS PART FIRST######### import json from os.path import join, dirname from ibm_watson…
user12384956
6
votes
1 answer

Programmatically toggle dictation on MacOS

Due to injury I use dictation on MacOS: As can be seen from the screenshot, I can toggle it using a keyboard shortcut. I wish to toggle it from code (preferably ObjC). I can manually inject the events: // Assumes CTRL OPT CMD Space toggles…
P i
  • 29,020
  • 36
  • 159
  • 267
6
votes
2 answers

Could anyone help me turn the profanity filter off for Google speech recognizer?

I'm trying to do a speech-to-text recognition for a wav. file I have, with Google, Google_Cloud, and Houndify. I've noticed that with the latter two, they show no problem with profanities and but the Google speech recognizer filters the word, for…
KKKM
  • 61
  • 2
6
votes
4 answers

How to continuously to do speech recognition while outputting the recognized word as soon as possible

I am working on a project that involves speech recognition using the SpeechRecognition module. One thing I want to do to improve my speech recognition is to be able to output the words that have been recognized as soon as possible. I want it to be…
IShaan
  • 93
  • 1
  • 1
  • 6