Questions tagged [google-cloud-speech]

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API.

  • The API recognizes over 80 languages and variants, to support your global user base. You can transcribe the text of users dictating to an application’s microphone, enable command-and-control through voice, or transcribe audio files, among many other use cases.
  • Recognize audio uploaded in the request, and integrate with your audio storage on Google Cloud Storage, by using the same technology Google uses to power its own products.

try it

576 questions
7
votes
2 answers

400 Specify MP3 encoding to match audio file

I am trying to use the google-speech2text api however, I keep getting "Specify MP3 encoding to match audio file" even though I have setup my code to go through all available encoders. This is the file I am trying to use I have to add, If I upload…
Areza
  • 5,623
  • 7
  • 48
  • 79
7
votes
1 answer

Need help Speech-to-text, always failed for too many retries

I use google speech-to-text API to get subtitles from audio, but when audio is too long, normally longer than 60 minutes, it will fail for too many retries.It says: google.api_core.exceptions.GoogleAPICallError: None Too many retries, giving up. Can…
DLI42
  • 79
  • 1
7
votes
1 answer

speech to text throws error Audio Timeout Error: Long duration elapsed without audio. Audio should be sent close to real time

Speech to text throws an error after some time: Grpc.Core.RpcException: Status(StatusCode=OutOfRange, Detail="Audio Timeout Error: Long duration elapsed without audio. Audio should be sent close to real time.") Another error…
Dombivali
  • 83
  • 1
  • 5
7
votes
3 answers

Google Cloud Text-to-Speech API - permission error

I am attempting to set up Google Cloud Text-to-Speech API following these instructions - https://cloud.google.com/text-to-speech/docs/quickstart I have successfully followed steps 1-6 for setting up Google SDK and authenticating with service account…
7
votes
1 answer

Google cloud speech API not transcription from base64 audio

I'm trying to use Google cloud speech API, by recording audio in my Angular FrontEnd, converting it to base64, sending it to my Node backend, which does the query to google speech API. So far I have had no success with it, google only sending me…
Oumpaloumpa
  • 123
  • 1
  • 8
7
votes
3 answers

how to play .opus audio file in android?

I'm developing an app for Recognize speech using google cloud speech APi and recording audio simultaneously. git link I tried to play recorded .opus audio file getting IOException W/System.err: java.io.IOException: setDataSource failed. Any…
7
votes
1 answer

Pycharm: set environment variable for google service account key (json credential)

I want to run the google cloud speech python sample in my pycharm. But encountered the ApplicationDefaultCredentialsError: The Application Default Credentials are not available. They are available if running in Google Compute Engine. Otherwise, the…
6
votes
0 answers

Google SpeechToText API: no text in response from OGG Opus audio streaming

I'm using the SpeechToText API to get a speech transcription from a live microphone recording. This is working well with a simple LINEAR16 encoding but I need to reduce the bandwidth so I switched to an OGG Opus encoding. The audio is recorded and…
eli0tt
  • 677
  • 1
  • 7
  • 19
6
votes
2 answers

Failed to detect whether we are running on Google Compute Engine

I am trying to use Google Cloud Speech recognition but I get this error I/omputeEngineCredentials: Failed to detect whether we are running on Google Compute Engine. W/System.err: java.io.IOException: The Application Default Credentials are not…
stavros.3p
  • 2,244
  • 6
  • 20
  • 37
6
votes
3 answers

Google Cloud Storage is giving me Authorization code 401 Invalid Credentials when trying to connect

I have completed the guides here Also i have completed this guides as well (as I want to use both Storage and Speech) Now i have gcloud and it is working in the terminal i also tried executing this command from the shell: curl -s -H…
6
votes
1 answer

Bidirectional Streaming using Tensorflow Serving

I have a model that accepts an arbitrary-length stream of data and performs classification. I use Tensorflow Serving to listen to gRPC requests and perform the classification on a trained model. Google Cloud Speech API has the "Streaming Speech…
naktinis
  • 3,957
  • 3
  • 36
  • 52
6
votes
4 answers

Google Cloud Speech-to-Text (MP3 to text)

I am using Google Cloud Platform Speech-to-Text API trial account service. I am not able to get text from an audio file. I do not know what exact encoding and sample Rate Hertz I should use for MP3 file of bit rate 128kbps. I tried various options…
Vikash Patel
  • 61
  • 1
  • 3
6
votes
1 answer

How to highlight filler words (Um, uh, ah) in transcript?

Is there a configuration in google cloud speech that allows me to see the filler words in the returned JSON transcript? Currently, it seems like the transcript returned by google cloud speech automatically filters out such words (uh, um, ah, like,…
jerryhdev
  • 71
  • 4
6
votes
1 answer

error in Request for Google Cloud Speech API request

I'm getting this error when I make my request to the Google Cloud Speech API: "error": { "code": 400, "message": "Request contains an invalid argument.", "status": "INVALID_ARGUMENT My curl command looks like this: curl -s -k -H…
topplethepat
  • 531
  • 6
  • 23
6
votes
3 answers

Google speech API timeout time

I'm trying to use the Speech API client APIs in order to convert an audio file to text. So far I have succeeded in converting a short audio clip, but now with a longer file (10 minutes) I get this error: Retry total timeout exceeded before…
don
  • 4,113
  • 13
  • 45
  • 70
1
2
3
38 39