Questions tagged [azure-speech]

Azure speech is a cloud based speech to text service in the Microsoft Azure family of machine learning and artificial intelligence APIs

140 questions
0
votes
1 answer

Azure Cognitive Speech TTS Custom Japanese Non-Neural Voice Unavailable

I'm on the S0 Tier for Azure Cognitive Speech services and am trying to train a custom voice for Japanese TTS. My data was successfully processed. But I wasn't able to select "Statistical Parametric" or "Concatenative" as my training method.…
Andrew
  • 73
  • 1
  • 5
0
votes
1 answer

Is there a way to do batch translation using Azure's Cognitive Speach Services?

I am looking to use Azure's Speech to Text translation service while also getting speech diarization and it seems like diarization is only supported with the batch transcription API calls. Here are some relevant links: Diarization:…
0
votes
1 answer

Azure speech to text in react-native wav file pass issue

I want use Azure's STT REST API for my react-native app with recorded audio. But I've try to search how to pass the wav file but always response "No audio data received" or other error response. I'm very sure the subscribe key is working cause when…
0
votes
1 answer

Using data from server instead of file to transcribe for Microsoft azure speech SDK

I am trying to send data to azure speech SDK to transcribe. I want it to receive data from a python file, put in a buffer and then transcribe continuously. I am using this sample from azure speech SDK. def…
0
votes
3 answers

How to get NBest alternatives with azure speech-to-text

I want to get more than one alternative transcription for a single speech utterance using azure speech-to-text. I have set the format=detailed argument, and the response does include a field called NBest. But that field only every contains one…
Alain Désilets
  • 509
  • 5
  • 11
0
votes
1 answer

How to concatenate Buffer data and writeFile in JS?

I use Azure Speech SDK to convert a set of text files to voice. Successfully all of the texts are converted and an instance of ArrayBuffer is returned for each text. Then I convert each of them to Buffer and concatenate all of the buffers into one…
ZyadOmer999
  • 117
  • 1
  • 10
0
votes
0 answers

Azure Speech SDK stopping the speech recognization after 4minutes of inactivity with status : Service Timeout

I am using Azure speech SDK for speech-text translation functionality. The service is doing its job of translation however after 4minutes of inactivity the service is calling cancel event which stops the speech recognition with the status service…
0
votes
0 answers

Refused to create worker from blob in Chrome Extension App while using Microsoft Cognitive Speech service

I'm trying to develop a Chrome extension that recognises speech in Google meet using Microsoft cognitive speech service. While trying to implement this, I'm getting the following errors ERROR: Refused to create a worker…
0
votes
1 answer

Create transcription from 'UnmixedMeetingAudio' buffer as soon as those are received - bot with application hosted media

I have created a bot using bot framework (.net) with application hosted media using the sample provided here. Bot attends any incoming peer to peer or group calls in Microsoft Teams. While in the meeting I am able to receive audio buffers as…
0
votes
2 answers

Azure Speech-to-text REST API: Create dataset with POST request - Error posting to specific project

I am trying to create a dataset in Azure Speech Studio with the REST API. The schema documentation for Create Dataset POST request is here. I would like to direct the dataset to a specific project, and it should definitely be possible. But when i…
asguldbrandsen
  • 163
  • 1
  • 2
  • 13
0
votes
1 answer

How can I get begin time and end time for the conversion result of stream audio?

I am using azure-speech to recognize audio stream, from speech_recognition_samples.cpp, from class RecognitionResult I only can get the Text and m_duration, but how can I get the begin time and end time of the result in the speech? I know…
Yongqi Z
  • 605
  • 8
  • 20
0
votes
1 answer

Getting error 400 when trying to use Azure Speech Recognition and Flutter

I've been given the task to use the Azure Speech Recognition API on a Flutter application. The app is supposed to record the user's voice and send it to the Azure API. I've tried to use the only pub.dev plugin that I could find, but it did not work…
Victor
  • 91
  • 1
  • 6
0
votes
1 answer

Translate in python using Azure speech, directly from stream

I am looking for a function in the speech sdk and some exemple, who would let me translate from Azure speech a live stream comming from an other source than the mic. So far, i've found two solution who doesn't exactly my needs This one, translate…
0
votes
1 answer

Azure Speech javascript SDK: Output audio in mp3

I use the sdk.connection methods to capture audio from the speech to text recognizer. It creates PCM audio that I want to convert into MP3. This is how connection is initialised: const con = SpeechSDK.Connection.fromRecognizer(this.recognizer); …
MadeInLagny
  • 185
  • 1
  • 12
0
votes
2 answers

Speech to Text Azure from AudioSource data Unity C#

I would like to use Speech to Text Azure services to translate the AudioSource object's audio (clip). How to do it?
Aziz Siyaev
  • 579
  • 1
  • 4
  • 7