Highest Voted 'speech-to-text' Questions

5

votes

1 answer

Speech to Text on Android with custom unusual word matching

I would like to be able to use Android's Speech-To-Text engine to recognize a variety of unusual words in sentences. To give an example, the word "electroencephalograph" comes out of STT as "electronics supply graph". When I use Soundex or…

asked Jan 28 '13 at 15:39

Buns of Aluminum

2,439
3
26
44

5

votes

2 answers

Sample example for Speech to Text in iOS

I am new to iOS programming.Can you please tell me how to convert the speech to text in iOS?Is there any API called?Please suggest me how to proceed?

ios cocoa-touch speech-to-text

asked Dec 24 '12 at 10:00

rani

593
3
10
28

5

votes

2 answers

Which minimum android version is required for speech to text application

I have done the coding for converting speech into text.I just want to know which minimum version of android required for this.

android speech-to-text

asked Oct 20 '12 at 14:25

Nitin Gupta

236
3
10

4

votes

1 answer

Trying to use Google Speech2Text in C#

The following simple code tries to post a wave file to Google Speech2Text service, but always fails with either a "Gateway Timeout (504)" or general exception "The operation timed out". Can anyone help please? public void ProcessWaveFile(string…

c# httpwebrequest speech-to-text

asked Jan 08 '12 at 15:13

dotNET

33,414
24
162
251

4

votes

0 answers

How to use grammar text editors for Speech-to-Text documents in JavaScript / NodeJS

I'm relatively new to programming (1 year working as an intern, and finishing grad), and might be biting more than I can chew, this is also my first interaction here (yey) So let me explain the problem thoroughly: I'm currently using Google Speech…

node.js google-cloud-platform nlp speech-recognition speech-to-text

asked May 11 '23 at 16:59

Gustavo Tavares

49
5

4

votes

1 answer

Efficient speaker diarization

I am running a VM instance on google cloud. My goal is to apply speaker diarization to several .wav files stored on cloud buckets. I have tried the following alternatives with the subsequent problems: Speaker diarization on Google's API. This seems…

python google-cloud-platform speech-to-text diarization

asked Feb 15 '23 at 10:17

Luis

330
1
11

4

votes

1 answer

What are the ways to implement speech recognition in Electron?

So I have an Electron app that uses the web speech API (SpeechRecognition) to take the user's voice, however, it's not working. The code: if ("webkitSpeechRecognition" in window) { let SpeechRecognition = window.SpeechRecognition ||…

javascript electron speech-recognition speech-to-text

asked Jan 18 '23 at 18:08

XYBOX

69
7

4

votes

1 answer

(Mis)-using open.ai whisper for text-to-text translation

I noticed that transcribing speech in multiple languages with openai whisper speech-to-text library sometimes accurately recognizes inserts in another language and would provide the expected output, for example: 八十多个人 is the same as 八十几个人. So 多 and…

machine-learning speech-recognition speech-to-text machine-translation openai-whisper

asked Dec 03 '22 at 15:12

ccpizza

28,968
18
162
169

4

votes

1 answer

Adding transcriptions to Google Speech-to-text to enhance recognition

In our church we have a few Ukrain refugees that visit the churc. To give them un understanding of the sermon, I made an app to send the translations real-time to Telegram. I have implemented the Google speech-to-text API following this tutorial:…

google-cloud-platform speech-to-text google-speech-api google-cloud-speech google-speech-to-text-api

asked May 16 '22 at 10:54

Martijn van der Maas

71
1
6

4

votes

2 answers

SpeechBrain: Cannot Load Pretrained Model from Local Path

I'm trying to load a pretrained SpeechBrain HuggingFace model from local files; I don't want it to call out to HuggingFace to download. However, unless I change the pretrained_path in hyperparams.yaml, it is still calling out to HuggingFace and…

speech-recognition speech-to-text torch huggingface-transformers huggingface-tokenizers

asked Feb 01 '22 at 05:32

Nat G

191
1
15

4

votes

2 answers

Voice to Text recognition

I am a beginner in Android development. Is there any possibility to write a speech to text software that could access Googles network based back-end voice to text system ?

java android speech-recognition speech-to-text

asked Aug 16 '11 at 08:59

Illep

16,375
46
171
302

4

votes

0 answers

Android Speech Recognition Custom Audio Source

Android RecognizerIntent documentation states that public static final String EXTRA_AUDIO_INJECT_SOURCE The extra key used in intent is providing an already opened audio source for the RecognitionService to use. Data should be a URI to an audio…

android android-intent uri speech-to-text

asked Nov 25 '21 at 06:49

Vishal

53
2

4

votes

1 answer

How to get a transcript of an audio or video call within a js web app? I.e. how to route a MediaStream to a speech-to-text API

I want to make a web-app which does video calls with live transcription -- using some 3rd party speech-to-text service (e.g. Google or Amazon). So the peer-to-peer MediaStream would be played to the users, and also sent to the API for…

javascript web-applications speech-to-text peerjs mediastream

asked Mar 09 '21 at 12:02

Daniel Winterstein

2,418
1
29
41

4

votes

5 answers

Speech To Text using C#

I am trying to design a text editor using C# language and implement voice recognition for the normal file features , is this possible to implement. I am very sorry if I am repeating the question which has been asked previously. I just want to know…

c# .net speech-to-text

asked Jul 04 '11 at 09:43

ArunKumar

41
1
2

4

votes

1 answer

How to feed an audio file from S3 bucket directly to Google speech-to-text

We are developing a speech application using Google's speech-to-text API. Now our data (audio files) get stored in S3 bucket on AWS. is there a way to directly pass the S3 URI to Google's speech-to-text API? From their documentation it seems this is…

amazon-s3 speech-recognition speech-to-text google-speech-api google-cloud-speech

asked Dec 23 '20 at 10:38

Anuj Gupta

6,328
7
36
55

Questions tagged [speech-to-text]