Questions tagged [google-cloud-speech]

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API.

The API recognizes over 80 languages and variants, to support your global user base. You can transcribe the text of users dictating to an application’s microphone, enable command-and-control through voice, or transcribe audio files, among many other use cases.
Recognize audio uploaded in the request, and integrate with your audio storage on Google Cloud Storage, by using the same technology Google uses to power its own products.

try it

576 questions

votes

2 answers

Google Cloud speech with React Native

I am trying to use Google Cloud Speech API so I can pass audio file and receive the translated text but I am stuck to the integration. I already have api key and everything needed but can't find how to use it from react native. In the documentation…

reactjs react-native google-cloud-speech

asked Feb 27 '17 at 15:27

charbinary

1,875
4
17
25

votes

2 answers

use Google Application Default Credentials without json file

I have created a console app using c#. I used google cloud speech api. I followed this sample application to create the app. To authenticate speech api, I wrote the following code in main…

google-speech-api google-cloud-speech

asked Feb 22 '17 at 11:01

Amitava Karan

votes

3 answers

google cloud speech api returning empty result

I have been using the Chromium Google Speech API and switched over to using the Google Cloud Speech API recently. Ever since the Google cloud speech API got announced, the performance seems to have degraded in terms of the accuracy of recognition. …

google-cloud-platform google-cloud-speech

asked Sep 26 '16 at 21:21

rajanb

votes

2 answers

Google cloud speech syncrecognize "INVALID_ARGUMENT"

I have managed the "overview tutorial" : https://cloud.google.com/speech/docs/getting-started Then I tried to use my own audio file . I uploaded a .flac file with a sample rate of 16000Hz. I only changed the sync-request.json file below with my own…

flac google-cloud-speech

asked Sep 21 '16 at 15:04

Damien Romito

9,801
13
66
84

votes

2 answers

Google Cloud Speech use custom vocabulary

I would like to use Google Cloud Speech for recognizing speech in a specific domain (healthcare). I know it already allows to add additional words to the vocabulary of the recognizer (word hints), but this must be done at each API call. Do you know…

google-cloud-speech

asked Sep 07 '16 at 13:25

user2748515

votes

2 answers

How can I authorize Google Speech-to-text from Google Apps script?

I'm trying to execute google-speech-to-text from apps script. Unfortunately, I cannot find any examples for apps script or pure HTTP, so I can run it using simple UrlFetchApp. I created a service account and setup a project with enabled…

google-apps-script oauth jwt service-accounts google-cloud-speech

asked Apr 27 '20 at 19:30

roma

1,512
10
20

votes

1 answer

Google Cloud Platform: Speech to Text Conversion of Large Media Files

I'm trying to extract text from mp4 media file downloaded from youtube. As I'm using google cloud platform so thought to give a try to google cloud speech. After all the installations and configurations, I copied the following code snippet to get…

google-cloud-platform speech-recognition speech-to-text google-speech-api google-cloud-speech

asked Nov 14 '18 at 19:43

Bilal Ahmed Yaseen

2,506
2
23
48

votes

0 answers

Google Speech API word offset timestamps are inaccurate

I have some audio files (25 GB) for which I want to provide a feature for the user to see the highlighted word synced to the audio as it's being played. I was looking to Google Speech API to transcribe the files and provide the data for the word…

google-cloud-speech

asked Aug 03 '18 at 17:01

Zeeshan Ansari

votes

0 answers

Audio stream from cordova-plugin-audioinput to Google Speech API

For a cross-platform app project using Meteor framework, I'd like to record microphone inputs and extract speech thanks to Google Speech API Following Google documentation, I'm more specifically trying to build an audio stream to feed the Google…

javascript cordova meteor google-speech-api google-cloud-speech

asked Dec 19 '17 at 13:54

Kevin

votes

3 answers

How to get the result of a long-running Google Cloud Speech API operation later?

Below is a snippet that calls the Google Cloud Speech API long running operation to convert an audio file to text from google.cloud import speech speech_client = speech.Client() audio_sample = speech_client.sample( content=None, …

python google-cloud-speech

asked May 12 '17 at 19:26

Anil Shanbhag

votes

1 answer

Streaming microphone input to Google Speech API

I have looked into Google Cloud Speech API and got streaming my microphone working on a Node server. I was then wondering what would be best practice for streaming my microphone from a web frontend? Is it sending an audiostream from getUserMedia to…

node.js stream speech google-cloud-speech

asked Mar 28 '17 at 09:35

Adam Byström

votes

0 answers

Why Google Cloud Speech API doesn't transcript the whole audio file?

I'm trying to transcript a short interview audio file with Google Cloud Speech API (asynchronously) but it only transcribes the first half minute of the recording. I had several attempts with recordings longer than one minute and the results were…

google-api speech-recognition speech-to-text google-speech-api google-cloud-speech

asked Feb 02 '17 at 12:00

Gex

2,092
19
26

votes

5 answers

Google Cloud Speech: Distinguish Voices?

I am interested in writing a voice recognition application that is aware of multiple speakers. For example if Bill, Joe, and Jane are talking then the application could not only recognize sounds as text but also classify the results by speaker (say…

speech-recognition voice-recognition google-cloud-speech

asked Feb 01 '17 at 18:23

user800576

votes

1 answer

Google Cloud Speech API returned "Quota exceeded"

My desktop Python application transcribes large sets of speech samples (each speech sample duration is between 1 sec and 8 sec long) and for that I am using Google Cloud Speech API. I am aware of all the usage limitations mentioned in this page and…

google-cloud-speech

asked Oct 27 '16 at 19:11

vishnu

votes

1 answer

Adding transcriptions to Google Speech-to-text to enhance recognition

In our church we have a few Ukrain refugees that visit the churc. To give them un understanding of the sermon, I made an app to send the translations real-time to Telegram. I have implemented the Google speech-to-text API following this tutorial:…

google-cloud-platform speech-to-text google-speech-api google-cloud-speech google-speech-to-text-api

asked May 16 '22 at 10:54

Martijn van der Maas

Prev 1 2

…

38 39 Next