Questions tagged [google-speech-api]

With Google Speech API you can convert speech to text file based or live stream

The https://cloud.google.com/speech/ and is part of https://cloud.google.com/products/ to allow for speech to text conversion.

When using a live stream words are returned almost realtime. This is limited by 1 minute or when paused for ~1 second. When using an asynchronous file the speech to text can be as long as 80 minutes. See https://cloud.google.com/speech/limits

For more see https://cloud.google.com/speech/docs/

837 questions
11
votes
3 answers

Google Streaming Speech Recognition on an Audio Stream Python

I have searched through all the available docs of Google but I could not find an example of streaming speech recognition on an audio stream in Python. Currently, I am using Speech Recognition for Python in Django to get the audio from the user and…
indexOutOfBounds
  • 541
  • 7
  • 27
10
votes
2 answers

Cannot import com.google.cloud.speech.v1.SpeechGrpc in Android

I'm trying to use Google's Speech API in Android project. The example project works. I'm having trouble to use it in my own android app. build.gradle(Module:app) : apply plugin: 'com.android.application' apply plugin: 'com.google.protobuf' ext { …
10
votes
2 answers

Call to Google Cloud Speech API doesn't return anything, fails after 10 minutes

I am trying to use Google.Cloud.Speech.V1 (client libraries for Google Cloud Speech API), and I am using this slightly modified version of Google's sample code: public async Task TranscribeSpeech(string filenameAndPath, int WAVSampleRate =…
10
votes
1 answer

HotwordDetector using VoiceInteractionService in android

I am creating the voice recognition service like "Ok Google" with custom command. For this I have used the VoiceInteractionService and I refered the code from here. Also, I am getting the STATE_HARDWARE_UNAVAILABLE error in onAvailabilityChanged.…
Nisar
  • 828
  • 1
  • 10
  • 28
9
votes
4 answers

google cloud speech ImportError: cannot import name 'enums'

I'm using google-cloud-speech api for my project . I'm using pipenv for virtual environment i installed google-cloud-speech api with pipenv install google-cloud-speech and pipenv update google-cloud-speech i followed this docs…
Linh Nguyen
  • 3,452
  • 4
  • 23
  • 67
9
votes
2 answers

How to Google Speech-to-Text using Blob sent from Browser to Nodejs Server

I am trying to set up a server to receive audio from a client browser using SocketIO, then process it through Google Speech-to-Text, and finally reply back to the client with the text. Originally and ideally, I wanted to set up to function somewhat…
9
votes
2 answers

Load JSON file's content to Heroku's environment variable

I am using Google Speech API in my Django web-app. I have set up a service account for it and am able to make API calls locally. I have pointed the local GOOGLE_APPLICATION_CREDENTIALS environment variable to the service account's json file which…
9
votes
1 answer

gcloud auth application-default print-access-token error: "invalid_grant: Bad Request"

I'm trying to do the tutorial for Google Cloud Speech. I did the first two steps successfully. The response from Step 2 is: Activated service account credentials for: [my-service-account@cloud-speech-quickstart.iam.gserviceaccount.com] The third…
Thomas David Kehoe
  • 10,040
  • 14
  • 61
  • 100
9
votes
1 answer

Creating suitable WAV files for Google Speech API

I'm using pyaudio to record my voice as wav file. I'm using following code: def voice_recorder(): FORMAT = pyaudio.paInt16 CHANNELS = 2 RATE = 22050 CHUNK = 1024 RECORD_SECONDS = 4 WAVE_OUTPUT_FILENAME = "first.wav" …
JayGatsby
  • 1,541
  • 6
  • 21
  • 41
9
votes
2 answers

What audio file types does Google Cloud Speech API recognize?

I'm trying to use Google's Cloud Speech API. There's documentation and code examples here: https://cloud.google.com/speech/docs/basics https://cloud.google.com/speech/docs/rest-tutorial I can get the sample code to run just fine if I point it to an…
9
votes
1 answer

Possible ways for the blind to enter email and password

I'm creating an emailing application for the visually impaired using java and I'm having a problem when entering the email and password. The entire application is voice based and I'm using the google speech API for converting speech to text. I've…
9
votes
5 answers

(Android Studio Speech Recognizer) I'm getting error 9 (insufficient Privileges) even though I have given it RECORD_AUDIO and INTERNET

package blessupboys.speechtest; import android.app.Activity; import android.content.Context; import android.content.Intent; import android.net.ConnectivityManager; import android.os.Bundle; import android.view.View; import…
9
votes
1 answer

English US language code changed? Google Speech Api v2 not returning the correct result

Just noticed the English US words no longer display the correct spelling. This previously was ok - now it displays the English UK spelling. Below is a list of some words that I found .For example if I say center and set the language code as en-US I…
8
votes
1 answer

ImportError: cannot import name 'speech' from 'google.cloud' (unknown location)

i installed google.cloud for the system using pip install google-cloud. but next error i have ImportError: cannot import name 'speech' from 'google.cloud' (unknown location) how to correct this error?
Nimantha
  • 183
  • 1
  • 3
  • 10
8
votes
1 answer

Detect fluency from google speech api results

I'm trying to determine the fluency of a speaker using google speech (to text) API. So far I have found that the API (betav1) can show the time taken to speak a word ( its starting time and ending time ). And from Wikipedia, Oral fluency or…
1
2
3
55 56