Questions tagged [google-speech-to-text-api]
190 questions
1
vote
0 answers
Android: TTS initialization failed on TV
I'm working on a Text-To-Speech & Speech-To-Text based app. App is working fine on almost all device. But when I'm trying to use this on TV, it is giving me error "TTS Initialization failed!". I tried multiple apps from Github but this device is…

Priyank Android
- 34
- 8
1
vote
1 answer
google speech to text not working correctly with very short audio (single words)
I'm testing google Speech-to-Text API with streaming audio as well as with wav files.
I'm using audio from telephony: 8000 sample rate, 8bits, mulaw encoding. The Google configuration is set appropriately.
When I test it with normal sequences, it…

ylvi-bux
- 37
- 6
1
vote
1 answer
Custom phrases/words are ignored by Google Speech-To-Text
I am using python3 to transcribe an audio file with Google speech-to-text via the provided python packages (google-speech).
There is an option to define custom phrases which should be used for transcription as stated in the docs:…

sam
- 60
- 8
1
vote
1 answer
Firebase Emulator request to Google Cloud Speech to Text Api denied
I would like to test the Google Cloud Speech-to-Text API from within Firebase Emulators. I currently have a trigger set on Firebase Storage that automatically gets fired when I upload a file via the Emulator Storage UI. This makes a request to the…

CambodianCoder
- 467
- 4
- 14
1
vote
0 answers
ImportError: cannot import name 'client_pb2' from 'google.api'
import io
import os
os.environ["GOOGLE_APPLICATION_CREDENTIALS"]="ip-boundary-996e9662df4c.json"
from google.cloud import speech
I am trying to load the library required to use Google STT, but an error occurs. Here's what the error…

yugwon
- 11
- 3
1
vote
1 answer
When should I use the enhanced video model with Google Cloud's speech to text api?
The enhanced model for phone calls means something to me because there is generally a particular quality/sound to the audio in a phone call. I don't know what to expect with the 'video' enhanced model, however, and there seems to be no documentation…

Rob Allsopp
- 3,309
- 5
- 34
- 53
1
vote
1 answer
Can you pin the model version of Google Speech to Text?
I would like to transcribe audio using the Google speech to text API (STT), but I need the transcriptions to be consistent over time. In other words, even if Google improves the STT model, is it possible to pin the version of the STT model I used…

Bertie Ancona
- 13
- 2
1
vote
0 answers
Converting ogg base64 to other audio format for using speech-to-text
I am using speech-to-text service and unfortunately the 'OPUS' encoding does not work for my content, or any other audio/ogg content that is not corrupted.
There is a way to convert audio/ogg bufferArray/base64 to other audio format in base 64 using…

Gal Shalom
- 43
- 6
1
vote
1 answer
Asynchronous Streaming with Flask
So, I have a socket.io connection with a React.js frontend that sends a blob of wav audio data every 100 ms to my Flask backend. On my backend, I am maintaining a thread safe queue
audio_buffer = queue.Queue()
@socketio.on('stream')
def…

dragon18456
- 41
- 7
1
vote
1 answer
Google Speech Diarization label changes all the time
I'm buidling an application to transcribe a real-time audio stream using Google Speech-To-Text with speaker diarization enabled (see previous questions for background: 1, 2, 3). Ideally, the output should look something like this:
00:00, speaker 1:…

Frank
- 93
- 9
1
vote
1 answer
Flutter Speech to Text not working on Android
I have an app in Flutter that works on Android and iOS.
On iOS everything works perfect. But on Android I'm facing some issues with speech to text...
I'm using this plugin: https://pub.flutter-io.cn/packages/speech_to_text/
My code for Android…

Faabass
- 1,394
- 8
- 29
- 58
1
vote
1 answer
Google cloud speech to text difficulty on certain short words
I've built a voice recognition system with Angular/websockets/node js, and google speech to text api.
I works very well on almost all words, but it has real issues with the word "no" - it seems almost as though the word "no" doesn't even get passed…

bionara
- 228
- 1
- 10
1
vote
1 answer
Gcloud error 401: REquest had invalid authentication credentials
I'm working on a project where I need to access a Gcloud service account. However, I've been encountering issues with authentication. This is the following error from my command prompt:
My command:
curl -s -H "Content-Type: application/json" \
-H…

justin Yang
- 57
- 2
- 7
1
vote
1 answer
Google speech to text API does not return anything not even an error
I tried to get the text from a random youtube video just to try out the API but it does not return anything. It does workwhen using the sample file 'audio.raw' from this…

TyLO
- 111
- 2
- 13
1
vote
0 answers
Send live recording from HTML frontend to Google Cloud Speech via Flask backend
Alright, so I'm working on a class project and I'm trying to send a recording made using javascript's navigator.mediaDevices.getUserMedia and MediaRecorder classes to the backend of my web application (written in Python, Flask) and to the Google…

Jacob Crisan
- 39
- 3