Questions tagged [google-cloud-speech]

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API.

  • The API recognizes over 80 languages and variants, to support your global user base. You can transcribe the text of users dictating to an application’s microphone, enable command-and-control through voice, or transcribe audio files, among many other use cases.
  • Recognize audio uploaded in the request, and integrate with your audio storage on Google Cloud Storage, by using the same technology Google uses to power its own products.

try it

576 questions
6
votes
2 answers

Google Cloud speech with React Native

I am trying to use Google Cloud Speech API so I can pass audio file and receive the translated text but I am stuck to the integration. I already have api key and everything needed but can't find how to use it from react native. In the documentation…
charbinary
  • 1,875
  • 4
  • 17
  • 25
6
votes
2 answers

use Google Application Default Credentials without json file

I have created a console app using c#. I used google cloud speech api. I followed this sample application to create the app. To authenticate speech api, I wrote the following code in main…
Amitava Karan
  • 637
  • 1
  • 5
  • 20
6
votes
3 answers

google cloud speech api returning empty result

I have been using the Chromium Google Speech API and switched over to using the Google Cloud Speech API recently. Ever since the Google cloud speech API got announced, the performance seems to have degraded in terms of the accuracy of recognition. …
rajanb
  • 63
  • 1
  • 5
6
votes
2 answers

Google cloud speech syncrecognize "INVALID_ARGUMENT"

I have managed the "overview tutorial" : https://cloud.google.com/speech/docs/getting-started Then I tried to use my own audio file . I uploaded a .flac file with a sample rate of 16000Hz. I only changed the sync-request.json file below with my own…
Damien Romito
  • 9,801
  • 13
  • 66
  • 84
6
votes
2 answers

Google Cloud Speech use custom vocabulary

I would like to use Google Cloud Speech for recognizing speech in a specific domain (healthcare). I know it already allows to add additional words to the vocabulary of the recognizer (word hints), but this must be done at each API call. Do you know…
5
votes
2 answers

How can I authorize Google Speech-to-text from Google Apps script?

I'm trying to execute google-speech-to-text from apps script. Unfortunately, I cannot find any examples for apps script or pure HTTP, so I can run it using simple UrlFetchApp. I created a service account and setup a project with enabled…
roma
  • 1,512
  • 10
  • 20
5
votes
1 answer

Google Cloud Platform: Speech to Text Conversion of Large Media Files

I'm trying to extract text from mp4 media file downloaded from youtube. As I'm using google cloud platform so thought to give a try to google cloud speech. After all the installations and configurations, I copied the following code snippet to get…
5
votes
0 answers

Google Speech API word offset timestamps are inaccurate

I have some audio files (25 GB) for which I want to provide a feature for the user to see the highlighted word synced to the audio as it's being played. I was looking to Google Speech API to transcribe the files and provide the data for the word…
5
votes
0 answers

Audio stream from cordova-plugin-audioinput to Google Speech API

For a cross-platform app project using Meteor framework, I'd like to record microphone inputs and extract speech thanks to Google Speech API Following Google documentation, I'm more specifically trying to build an audio stream to feed the Google…
5
votes
3 answers

How to get the result of a long-running Google Cloud Speech API operation later?

Below is a snippet that calls the Google Cloud Speech API long running operation to convert an audio file to text from google.cloud import speech speech_client = speech.Client() audio_sample = speech_client.sample( content=None, …
Anil Shanbhag
  • 950
  • 1
  • 13
  • 31
5
votes
1 answer

Streaming microphone input to Google Speech API

I have looked into Google Cloud Speech API and got streaming my microphone working on a Node server. I was then wondering what would be best practice for streaming my microphone from a web frontend? Is it sending an audiostream from getUserMedia to…
Adam Byström
  • 51
  • 1
  • 3
5
votes
0 answers

Why Google Cloud Speech API doesn't transcript the whole audio file?

I'm trying to transcript a short interview audio file with Google Cloud Speech API (asynchronously) but it only transcribes the first half minute of the recording. I had several attempts with recordings longer than one minute and the results were…
5
votes
5 answers

Google Cloud Speech: Distinguish Voices?

I am interested in writing a voice recognition application that is aware of multiple speakers. For example if Bill, Joe, and Jane are talking then the application could not only recognize sounds as text but also classify the results by speaker (say…
user800576
5
votes
1 answer

Google Cloud Speech API returned "Quota exceeded"

My desktop Python application transcribes large sets of speech samples (each speech sample duration is between 1 sec and 8 sec long) and for that I am using Google Cloud Speech API. I am aware of all the usage limitations mentioned in this page and…
vishnu
  • 83
  • 1
  • 8
4
votes
1 answer

Adding transcriptions to Google Speech-to-text to enhance recognition

In our church we have a few Ukrain refugees that visit the churc. To give them un understanding of the sermon, I made an app to send the translations real-time to Telegram. I have implemented the Google speech-to-text API following this tutorial:…
1 2
3
38 39