Questions tagged [google-speech-to-text-api]

190 questions
0
votes
1 answer

What algorithms could be used for the matching/pairing of word lists?

I'm working on some C# code to automate mapping of audio to text using the Google Speech-to-Text API for a language-study tool. I have a list of words parsed from the human-written source (transcript), also with sentence break information. I have a…
0
votes
1 answer

Google speech to text timestamp is continuous but there are gaps between the spoken words

I have used google's speech to text to transcribe an audio in which I have enabled the timestamp not an issue till then. My problem is the timestamps are continuous there is no gap between words as per the timestamp ( with seconds and nano seconds )…
0
votes
2 answers

Trying to make a speech recognition program with python but the program doesn't seem to hear me

the first except block runs every time i speak into the microphone, please help! ''' import speech_recognition as sr # get audio from the microphone r = sr.Recognizer() with…
0
votes
1 answer

How gcloud manages to work with svcacc only API?

I'm trying to understand how gcloud manages to work with APIs that require service account to access them, e.g. accessing Speech API using your user (not svcacc) credentials will result in "403 Your application has authenticated using end user…
Zaar Hai
  • 9,152
  • 8
  • 37
  • 45
0
votes
1 answer

How do I extract transcript with multiple speakers from Google Video Intelligence API Speech Transcription JSON output using jq?

I'm testing out Google Video Intelligence speech-to-text for transcribing podcast episodes with multiple speakers. I've extracted an example and published that to a gist: output.json. cat file.json | jq…
0
votes
0 answers

Callin google Speech to Text API from Flask Celery is crashing the celery

Trying to run the google speech to text from celery , but worker thread is crashing. Below is the code that is being executed from the celery task. # Google speech to text function from google.cloud import speech_v1p1beta1 from…
0
votes
1 answer

how to pass metadata google speech to text api - swift ios

anyone please help me finding official document of pod used in this example: https://github.com/GoogleCloudPlatform/ios-docs-samples/tree/master/speech/Swift/Speech-gRPC-Streaming moreover I am working on an iOS app in which we have google speech to…
0
votes
1 answer

Google Speech to Text Locale creation for language other than English

I want to integrate Urdu Language in my android application using Google Speech to Text. Urdu language is supported according to google for speech to text. Can you guide me as to how I can create this: class AimyboxApplication : Application(),…
0
votes
1 answer

Recording audio with NAudio cuts me off

I am recording an audio to send it to send it to Google speech to text but when I make the audio with naudio it records me only 5 seconds and from there it stops recording. I copy the code in C #, this is my first time using this API, but I don't…
Mauricio
  • 1
  • 1
0
votes
1 answer

Running an asynchronous function within a Cloud Function

I am trying to implement the Speech-to-Text API in this tutorial using Cloud Functions (python) Speech-to-Text long running The one implemented here is an asynchronous, long running function. My aim is to have the Cloud Function download an audio…
0
votes
0 answers

When streaming, is there a limit to how long a person can consistently speak before an IsFinal response is forced

This is something of a two part question. I'm writing a unity application using google speech to text streaming and it was noticed that if a user continues talking for about a minute without pausing, google sends IsFinal even if they haven't taken…
0
votes
1 answer

Is it possible to perform two speech-to-text streams at same time from one account?

So, question is on title, I can create speech to text stream to one user and it works nice, but when I try to connect second user, it works bad for all, sometimes it works good for one user, so question is, can I create sppech-to-text stream for two…
0
votes
1 answer

javax.sound.sampled.LineUnavailableException: line with format PCM_SIGNED 16000.0 Hz, 16 bit, mono, 2 bytes/frame

I'm creating a run-able jar from a project in Eclipse, which consist of a basic sample code for speech recognition using cloud speech. Anyhow the problem is when I run the code from Eclipse it works fine, but when I create a run-able jar and execute…
0
votes
1 answer

Not able to set google speech constructor using credential options

Hello I am trying not to use json approach and to use credential option to initiate the gcloud speech client. const speech = require('@google-cloud/speech'); const client = new speech.SpeechClient({credentials:{private_key : "",…
stackjohnny
  • 645
  • 3
  • 7
  • 22
0
votes
0 answers

Google Speech to text API in web application

I want to try Google Speech to text API in a JSF application (which for the client generates html). I tried to search, but there is no suitable resource available in the web. It is available for Java SE, but can't be converted to Java Web. Can…
1 2 3
12
13