Questions tagged [google-cloud-speech]

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API.

The API recognizes over 80 languages and variants, to support your global user base. You can transcribe the text of users dictating to an application’s microphone, enable command-and-control through voice, or transcribe audio files, among many other use cases.
Recognize audio uploaded in the request, and integrate with your audio storage on Google Cloud Storage, by using the same technology Google uses to power its own products.

try it

576 questions

votes

2 answers

400 Specify MP3 encoding to match audio file

I am trying to use the google-speech2text api however, I keep getting "Specify MP3 encoding to match audio file" even though I have setup my code to go through all available encoders. This is the file I am trying to use I have to add, If I upload…

google-speech-api google-cloud-speech

asked Aug 14 '19 at 20:11

Areza

5,623
7
48
79

votes

1 answer

Need help Speech-to-text, always failed for too many retries

I use google speech-to-text API to get subtitles from audio, but when audio is too long, normally longer than 60 minutes, it will fail for too many retries.It says: google.api_core.exceptions.GoogleAPICallError: None Too many retries, giving up. Can…

google-cloud-speech

asked Aug 09 '19 at 15:38

DLI42

votes

1 answer

speech to text throws error Audio Timeout Error: Long duration elapsed without audio. Audio should be sent close to real time

Speech to text throws an error after some time: Grpc.Core.RpcException: Status(StatusCode=OutOfRange, Detail="Audio Timeout Error: Long duration elapsed without audio. Audio should be sent close to real time.") Another error…

google-cloud-speech

asked Jul 10 '19 at 13:19

Dombivali

votes

3 answers

Google Cloud Text-to-Speech API - permission error

I am attempting to set up Google Cloud Text-to-Speech API following these instructions - https://cloud.google.com/text-to-speech/docs/quickstart I have successfully followed steps 1-6 for setting up Google SDK and authenticating with service account…

google-cloud-platform text-to-speech google-cloud-speech

asked Apr 12 '18 at 15:50

MKB

votes

1 answer

Google cloud speech API not transcription from base64 audio

I'm trying to use Google cloud speech API, by recording audio in my Angular FrontEnd, converting it to base64, sending it to my Node backend, which does the query to google speech API. So far I have had no success with it, google only sending me…

node.js angular google-cloud-speech

asked Feb 15 '18 at 12:35

Oumpaloumpa

votes

3 answers

how to play .opus audio file in android?

I'm developing an app for Recognize speech using google cloud speech APi and recording audio simultaneously. git link I tried to play recorded .opus audio file getting IOException W/System.err: java.io.IOException: setDataSource failed. Any…

android audio android-mediarecorder opus google-cloud-speech

asked Jun 08 '17 at 12:01

Swati Vishnoi

votes

1 answer

Pycharm: set environment variable for google service account key (json credential)

I want to run the google cloud speech python sample in my pycharm. But encountered the ApplicationDefaultCredentialsError: The Application Default Credentials are not available. They are available if running in Google Compute Engine. Otherwise, the…

python pycharm google-cloud-platform google-cloud-speech

asked Oct 04 '16 at 02:26

SHI Zhong Ping

votes

0 answers

Google SpeechToText API: no text in response from OGG Opus audio streaming

I'm using the SpeechToText API to get a speech transcription from a live microphone recording. This is working well with a simple LINEAR16 encoding but I need to reduce the bandwidth so I switched to an OGG Opus encoding. The audio is recorded and…

javascript google-speech-api ogg google-cloud-speech opus

asked Dec 12 '19 at 11:46

eli0tt

votes

2 answers

Failed to detect whether we are running on Google Compute Engine

I am trying to use Google Cloud Speech recognition but I get this error I/omputeEngineCredentials: Failed to detect whether we are running on Google Compute Engine. W/System.err: java.io.IOException: The Application Default Credentials are not…

android google-cloud-platform google-cloud-speech

asked Nov 23 '19 at 17:46

stavros.3p

2,244
6
20
37

votes

3 answers

Google Cloud Storage is giving me Authorization code 401 Invalid Credentials when trying to connect

I have completed the guides here Also i have completed this guides as well (as I want to use both Storage and Speech) Now i have gcloud and it is working in the terminal i also tried executing this command from the shell: curl -s -H…

php google-api google-cloud-storage google-oauth google-cloud-speech

asked Feb 20 '18 at 08:11

Dimitar

1,830
5
32
52

votes

1 answer

Bidirectional Streaming using Tensorflow Serving

I have a model that accepts an arbitrary-length stream of data and performs classification. I use Tensorflow Serving to listen to gRPC requests and perform the classification on a trained model. Google Cloud Speech API has the "Streaming Speech…

python tensorflow grpc tensorflow-serving google-cloud-speech

asked Jan 08 '18 at 10:10

naktinis

3,957
3
36
52

votes

4 answers

Google Cloud Speech-to-Text (MP3 to text)

I am using Google Cloud Platform Speech-to-Text API trial account service. I am not able to get text from an audio file. I do not know what exact encoding and sample Rate Hertz I should use for MP3 file of bit rate 128kbps. I tried various options…

mp3 speech-to-text google-cloud-speech

asked Dec 15 '17 at 06:31

Vikash Patel

votes

1 answer

How to highlight filler words (Um, uh, ah) in transcript?

Is there a configuration in google cloud speech that allows me to see the filler words in the returned JSON transcript? Currently, it seems like the transcript returned by google cloud speech automatically filters out such words (uh, um, ah, like,…

ruby google-cloud-speech

asked Nov 27 '17 at 23:17

jerryhdev

votes

1 answer

error in Request for Google Cloud Speech API request

I'm getting this error when I make my request to the Google Cloud Speech API: "error": { "code": 400, "message": "Request contains an invalid argument.", "status": "INVALID_ARGUMENT My curl command looks like this: curl -s -k -H…

google-cloud-platform google-cloud-speech

asked Apr 06 '17 at 16:09

topplethepat

votes

3 answers

Google speech API timeout time

I'm trying to use the Speech API client APIs in order to convert an audio file to text. So far I have succeeded in converting a short audio clip, but now with a longer file (10 minutes) I get this error: Retry total timeout exceeded before…

node.js timeout google-speech-api google-cloud-speech

asked Mar 13 '17 at 16:33

don

4,113
13
45
70

Prev 1

…

38 39 Next