Questions tagged [google-speech-to-text-api]
190 questions
0
votes
1 answer
What algorithms could be used for the matching/pairing of word lists?
I'm working on some C# code to automate mapping of audio to text using the Google Speech-to-Text API for a language-study tool. I have a list of words parsed from the human-written source (transcript), also with sentence break information. I have a…

jtsoftware
- 521
- 3
- 14
0
votes
1 answer
Google speech to text timestamp is continuous but there are gaps between the spoken words
I have used google's speech to text to transcribe an audio in which I have enabled the timestamp not an issue till then.
My problem is the timestamps are continuous there is no gap between words as per the timestamp ( with seconds and nano seconds )…

chitharthan
- 5
- 4
0
votes
2 answers
Trying to make a speech recognition program with python but the program doesn't seem to hear me
the first except block runs every time i speak into the microphone, please help!
'''
import speech_recognition as sr
# get audio from the microphone
r = sr.Recognizer()
with…
0
votes
1 answer
How gcloud manages to work with svcacc only API?
I'm trying to understand how gcloud manages to work with APIs that require service account to access them, e.g. accessing Speech API using your user (not svcacc) credentials will result in "403 Your application has authenticated using end user…

Zaar Hai
- 9,152
- 8
- 37
- 45
0
votes
1 answer
How do I extract transcript with multiple speakers from Google Video Intelligence API Speech Transcription JSON output using jq?
I'm testing out Google Video Intelligence speech-to-text for transcribing podcast episodes with multiple speakers.
I've extracted an example and published that to a gist: output.json.
cat file.json | jq…

InfoMiner
- 69
- 6
0
votes
0 answers
Callin google Speech to Text API from Flask Celery is crashing the celery
Trying to run the google speech to text from celery , but worker thread is crashing. Below is the code that is being executed from the celery task.
# Google speech to text function
from google.cloud import speech_v1p1beta1
from…

SkyTreasure
- 854
- 2
- 13
- 23
0
votes
1 answer
how to pass metadata google speech to text api - swift ios
anyone please help me finding official document of pod used in this example: https://github.com/GoogleCloudPlatform/ios-docs-samples/tree/master/speech/Swift/Speech-gRPC-Streaming
moreover I am working on an iOS app in which we have google speech to…

Inzamam Malik
- 3,238
- 3
- 29
- 61
0
votes
1 answer
Google Speech to Text Locale creation for language other than English
I want to integrate Urdu Language in my android application using Google Speech to Text. Urdu language is supported according to google for speech to text. Can you guide me as to how I can create this:
class AimyboxApplication : Application(),…

Zaid Alvi
- 45
- 6
0
votes
1 answer
Recording audio with NAudio cuts me off
I am recording an audio to send it to send it to Google speech to text but when I make the audio with naudio it records me only 5 seconds and from there it stops recording. I copy the code in C #, this is my first time using this API, but I don't…

Mauricio
- 1
- 1
0
votes
1 answer
Running an asynchronous function within a Cloud Function
I am trying to implement the Speech-to-Text API in this tutorial using Cloud Functions (python)
Speech-to-Text long running
The one implemented here is an asynchronous, long running function. My aim is to have the Cloud Function download an audio…

oikonomiyaki
- 7,691
- 15
- 62
- 101
0
votes
0 answers
When streaming, is there a limit to how long a person can consistently speak before an IsFinal response is forced
This is something of a two part question.
I'm writing a unity application using google speech to text streaming and it was noticed that if a user continues talking for about a minute without pausing, google sends IsFinal even if they haven't taken…

SSal
- 1
0
votes
1 answer
Is it possible to perform two speech-to-text streams at same time from one account?
So, question is on title, I can create speech to text stream to one user and it works nice, but when I try to connect second user, it works bad for all, sometimes it works good for one user, so question is, can I create sppech-to-text stream for two…

Bill Johan
- 83
- 1
- 7
0
votes
1 answer
javax.sound.sampled.LineUnavailableException: line with format PCM_SIGNED 16000.0 Hz, 16 bit, mono, 2 bytes/frame
I'm creating a run-able jar from a project in Eclipse, which consist of a basic sample code for speech recognition using cloud speech. Anyhow the problem is when I run the code from Eclipse it works fine, but when I create a run-able jar and execute…

Ankush Singh
- 11
- 6
0
votes
1 answer
Not able to set google speech constructor using credential options
Hello I am trying not to use json approach and to use credential option to initiate the gcloud speech client.
const speech = require('@google-cloud/speech');
const client = new speech.SpeechClient({credentials:{private_key : "",…

stackjohnny
- 645
- 3
- 7
- 22
0
votes
0 answers
Google Speech to text API in web application
I want to try Google Speech to text API in a JSF application (which for the client generates html). I tried to search, but there is no suitable resource available in the web. It is available for Java SE, but can't be converted to Java Web.
Can…

Buddhika Ariyaratne
- 2,339
- 6
- 51
- 88