Questions tagged [azure-speech]

Azure speech is a cloud based speech to text service in the Microsoft Azure family of machine learning and artificial intelligence APIs

140 questions
0
votes
1 answer

Azure pronunciation assessment SDK not processing audio pieces larger than 1 minute

Im currently using Azure pronunciation assessment example from cognitive services speech SDK with the next code: ` const speechConfig = sdk.SpeechConfig.fromSubscription( API_KEY, REGION, ); speechConfig.speechRecognitionLanguage = 'es-ES'; const…
0
votes
1 answer

Undesired Pause in Azure Speech Service Neural Voice Synthesis for Long Sentences

I attempted to use Azure Speach Service to synthesize long sentences using a Neural Voice having more than 500 characters without seeing a pause in the speech at 500 characters into a given sentence. I have any number of places in text I need to…
0
votes
1 answer

How to fix Azure Cognitive Services Speech SDK quickstart tutorial gives error on Lubuntu 22 and blank mp3 file?

After following the directions on azure text-to-speech quick start and speech sdk tutorial and running it in Lubuntu 22 (Ubuntu 22.04.2 LTS) as guest on VirtualBox gets the error message below. But running the same code in Lubuntu 20 (also as VB…
0
votes
1 answer

Azure speech to text doesn't work in Spanish

I have this azure function written in python import logging import azure.functions as func import azure.cognitiveservices.speech as speechsdk import azure_config def main(req: func.HttpRequest) -> func.HttpResponse: language = 'es-ES' …
0
votes
1 answer

The Display Text is null while converting video to text using Postman API

I am using the cognitive services using Postman API but the display text is not appearing. params- language: en-US headers: Ocp-Apim-Subscription-Key: key Content-Type: audio/wav body- binary: selected file Authorization- bearer Token: Token…
0
votes
0 answers

Azure Cognitive Speech SDK Issue in Vuejs Project

I am importing & using microsoft-cognitiveservices-speech-sdkin VueJS Project but whenever I click on mic, I am getting below issue. This is what I tried: import * as SDK from "microsoft-cognitiveservices-speech-sdk"; import { RecognitionResult }…
0
votes
2 answers

How do I stop Azure TTS from playing back the audio while generating samples?

I am synthesising text using Azure Speech Service's TTS. When setting the audio config, I want to disable the playback of the audio. Per the documentation, AudioOutputConfig's use_default_speaker keyword is False by default. Hence, the following…
0
votes
1 answer

Azure Text to speech is not using the lexicon I created

I use the text to speech from Azure and created a lexicon via the the buttons. Unfortunatally the content of the lexicon does not get uesed by azure. Neither the alias nor phonemes, even the examples (BTW - By the Way) are not used. I tried to…
0
votes
1 answer

Speech SDK trowing error: Exception with an error code: 0xe (SPXERR_MIC_NOT_AVAILABLE)

I have a small script in python: def listen(language): speech_config.speech_recognition_language=language audio_config = speechsdk.audio.AudioConfig(use_default_microphone=True) speech_recognizer =…
Fabian
  • 63
  • 1
  • 13
0
votes
1 answer

Sending Twilio Stream to Azure speech translation

Im working on a solution to do real-time speech translation using azure-cognitiveservices-speech. The azure solution works fine link to boilerplate code from azure only with audio file or input from microphone. When I attempt to pass a stream from a…
Rajesh Rajamani
  • 189
  • 2
  • 14
0
votes
1 answer

How to use azure "speech to text" service to first detect language type and then transcipt it using go sdk?

I have code like below, but it outputs Session Started (ID= 2e73d3d49f934b66af05ca3438339e81) Timed out And it seems can not detect which language is in the audio file. How can I resolve the timeout issue and detect language type ? func…
djy
  • 737
  • 6
  • 14
0
votes
0 answers

How to Use the audio binary response from Azure Text to Speech as Audio source in React application

I am building a react application that uses text to speech. Here I am using Azure text to speech. The API seems to be running fine, and I get a binary file in response. While using thunder client in VS code, it gives me an option to save the file in…
0
votes
1 answer

How to train azure speech to text model on localhost

For demand, I can't use azure speech to text service on the cloud. Therefore, I use azure speech to text container with docker. I use two…
蔣鎧駿
  • 9
  • 1
0
votes
1 answer

Get Speech Studio files from Azure Cognitive Text to Speech API (or SDK)

There is any way to get the files generated in the Speech Studio using the REST API or SDK? I'm working on a project where I want to create several audios from text, I like the Speech Studio tool, so we are thinking on integrate it to the workflow,…
0
votes
0 answers

Why Azure speech to text don't trigger the function recognized/recognizing after written audio buffer?

I'm building a sample speech-to-text using microsoft-cognitiveservices-speech-sdk package. I received output text from recognizing or recognized only if I execute my sample on Cloud shell, Codesandbox or my home PC but I can't get output text if…
Duy Hoang Nguyen
  • 85
  • 1
  • 1
  • 6