Questions tagged [google-speech-api]

With Google Speech API you can convert speech to text file based or live stream

The https://cloud.google.com/speech/ and is part of https://cloud.google.com/products/ to allow for speech to text conversion.

When using a live stream words are returned almost realtime. This is limited by 1 minute or when paused for ~1 second. When using an asynchronous file the speech to text can be as long as 80 minutes. See https://cloud.google.com/speech/limits

For more see https://cloud.google.com/speech/docs/

837 questions
0
votes
1 answer

Google Speech API Call

I am trying to use Google Speech API v1 in .net and facing a challange in making a call. Below is my code. Dim userSpeech = Google.Cloud.Speech.V1.SpeechClient.Create Dim response = userSpeech.Recognize(New RecognitionConfig,…
Sukhi
  • 13,261
  • 7
  • 36
  • 53
0
votes
0 answers

Translating a large file with small API rate limits on Google Speech API

I have an audio file that is 24 hours and 100MB that I want to transcribe/translate using nodejs and the Google Speech API. The Google Speech API will not support processing more than 1 min of audio nor 10MB of data at a time. How can I get this…
0
votes
1 answer

recognize mp4 file using Google Speech

I'm using postman in order to execute POST method to google cloud speech api : https://speech.googleapis.com/v1/speech:recognize?key= { "config": { "encoding": "FLAC", "sampleRateHertz": 16000, "languageCode": "en-US" }, "audio":…
0
votes
1 answer

How to get both interim Transcript and final transcript in webkitSpeechRecognition

I am developing a JavaScript app that uses webkitSpeechRecognition. In my code and what's a must i have set interimResult as true so as a user can see when the app is finding a match for the spoken words but i myself want to use the final results…
0
votes
1 answer

Does TokBox support a codec compatible with Google Speech API?

Google Speech API claims to support a number of codecs (https://cloud.google.com/speech/docs/basics). I'm interested in processing an archive of a session produced by TokBox WebRTC. Is there a sample code that does something like this? Does the…
0
votes
0 answers

Record audio from both sides of the skype call in nodejs

I am writing a node.js program for transcribing skype calls using Google speech api, for that I am easily able to read microphone input but not able to read what the other person is speaking. Is there a way in which this can be done using node.js.
0
votes
1 answer

Problems with new Google Speech API

I am trying to migrate to Google's new speech API, as the beta one is expiring. However the new one does not seem to be working. To start with, this install seems to be failing: pip install --upgrade google-cloud-speech It produces the following…
Lee Melbourne
  • 407
  • 5
  • 20
0
votes
1 answer

How do I hide the speechRecogniser activity behind the main activity?

I am trying to hide the speech recogniser dialog that shows up saying "speak now" and written google as the title ,whenever i call it for passing some voice command. I don't want it to show up. Instead I want it to work in teh background so that the…
0
votes
1 answer

How to change data stream to the google speech recognition (Python)

I have code which stream audio data from laptop microphone to the google Speech recog., but i want to stream audio from other source. From that source i can get buffer of raw data, and this buffer is what i want to stream to the google.Can somebody…
Yety
  • 1
  • 1
  • 2
0
votes
1 answer

Does Google Cloud Speech API support regional endpoints

Is it possible to co-locate the cloud speech service in a region where the calling application resides or is there just a single endpoint? I am wanting to reduce latency by getting my client application as close to the google service as possible, my…
user1371314
  • 762
  • 7
  • 24
0
votes
2 answers

How can I use speech recognition in Python on a proxy network?

It works quite well on a proxy free network but whenever I try to run it on a proxy network it gives this error. Could not request results from Google STT; recognition connection failed: [Errno 11001] getaddrinfo failed Github link for the…
0
votes
3 answers

Google Cloud Speech API real time recognition

I am developing a Python application for real-time translation. I need to recognize speech in real time: as user says something it automatically sends this piece of audio to Google Speech API and returns a text. So I want the recognized text…
Taras
  • 81
  • 1
  • 5
0
votes
1 answer

While accessing google speech api, I am receiving 403 forbidden from server

I understand that this questions is discussed around 4-5 times but none of the solutions posted had helped me. I created google developer account and enabled Google speech api, also registered project and have generated API_KEY. I am using JARVIS…
0
votes
0 answers

php exec doesn't work for python with API

I get litteraly mad about my issue. what i'm trying to do : upload a ".flac" file to my server (it works) use à php script to launch à python script (doesn't work) Python script call the google speech API to convert voice to text (work when…
floprm
  • 72
  • 2
  • 11
0
votes
1 answer

Google Speech Api Credential Error with Raspbian (Raspberry pi)

i want to make a speech to text program so i used Google Speech Api. When i launched the code on my pc it works, but on raspberry pi it doesnt. Probably it is about Credential, but i tried so many things. For example: i add…
Tony Stark
  • 135
  • 3
  • 13