Questions tagged [google-speech-to-text-api]

190 questions
0
votes
1 answer

How to properly use for loop in order to get full text out of transcribed .wav file in Flask

I'm trying to get the content out of the audio file using for loop to iterate through content of the file but what I get is the last line of the text. I'm sure the problem is in the main code which is main.py and I can't figure out what could go…
Adam
  • 13
  • 6
0
votes
1 answer

Unity Android - Error deserializing JSON credential data - Google STT API

I boiled down the problem to the SpeechClientBuilder class's function .Build() after setting up credentials which I know are 100% right cause I checked and they worked when running the code on the editor.
Trake Vital
  • 1,019
  • 10
  • 18
0
votes
1 answer

How to identify sentences in a paragraph which is convered from audio to text in python (speech-to-text)

This is my code import speech_recognition as sr import os def speech_to_text(speech_to_text_name): #calling the Recognizer() r = sr.Recognizer() BASE_DIR = os.path.dirname(os.path.abspath(__file__)) # FILE_PATH = os.path.join(BASE_DIR,…
0
votes
1 answer

Google Speech Algorithm. Testing beta features

On the Google Speech algorithm's page (https://cloud.google.com/speech-to-text) there is a "Demo" section where it's possible to upload a file and check the results. Using beta features I was able to get better results, but I'm not able to get…
0
votes
1 answer

How to pass dynamic Google Cloud API Key to client side from server?

Currently, I am testing Google Cloud's Speech API and wondering how to pass dynamic Google Cloud API key to client app from server. The speech function will be on client's app (React Native). Before every request to Google Cloud API or session, I am…
0
votes
1 answer

What role does bit rate play in the accuracy of Google Speech To Text transcription?

I am helping a client convert a video file using ffmpeg and they originally used -b:a 64k while transcoding their video to audio at a sampling rate (-ar 44100 argument in ffmpeg) of 44100. Their objective is that they want to generate the most…
0
votes
2 answers

How to use Google Cloud Speech (V1 API) for speech to text - need to be able to process over 3 hours audio files properly and efficiently

I am looking for documentation and stuff but could not find a solution yet Installed NuGet package Also generated API key However can't find proper documentation how to use API key Moreover, I want to be able to upload very long audio files So what…
0
votes
1 answer

Creating an Ogg packet from Opus buffers in nodejs

I've been pretty stuck on this problem for a few days now, praying that someone will be able to point me in the right direction. I have a stream of Opus buffers as encoded by https://github.com/discordjs/opus I want to send these to the google…
0
votes
1 answer

Google cloud SDK shell not displaying correct transcription text

I used Google cloud SDK Shell to transcribe an audio in Vietnamese. It returns a transcript but the text appears as below. Is there some other command or setting I have to enable to have the correct transcription…
0
votes
0 answers

Google Speech API speech to text only printing one word

I'm working on a project and in my following code when I press run it worke for just one word it doesn't write all my speech My code:
0
votes
1 answer

What does the google's speech-to-text configuration looks like for an .opus audio file

I am passing a .opus audio file to the google's speech-to-text api for transcription. I am using the following configurations: encoding = enums.RecognitionConfig.AudioEncoding.OGG_OPUS language_code = "en-US" sample_rate_hertz = 16000 I am getting…
0
votes
1 answer

How to work with result from google speech to text API

I am working with the google speech to text API. It returns an object of type google.cloud.speech_v1.types.RecognizeResponse. I have found this almost unusable in Python as I cannot iterate over it to get the multiple text strings returned. After…
Lee Melbourne
  • 407
  • 5
  • 20
0
votes
1 answer

Recording audio from microphone in Python

I'm trying to create a virtual assistant in python, which works using speech recognition. While running the program though, it isn't taking any voice input, though there are no errors. My microphone is properly connected and I'm pretty sure there is…
0
votes
0 answers

Google speech to text api (reading from GCS broken)

As it says in the title, I can't get the s2t API to work with gcs. When working with local <1m files, it works well, but when I provide it a gcs link (be it of the same short file or a longer one) I get bad results. I either get nothing, or just a…
0
votes
1 answer

How make Google Speech to Text API Async

I am using Google Speech to Text API to convert audio files. The issue is that in Google Speech to text, we have to wait for the operation to complete even thou we made it async as its documentation says by using long_running_recognize(). Is there a…