Questions tagged [azure-speech]

Azure speech is a cloud based speech to text service in the Microsoft Azure family of machine learning and artificial intelligence APIs

140 questions
0
votes
1 answer

Twilio to Azure Continuous Speech To Text

I would like to use the Twilio verb to send audio to Azure's Continuous Speech to Text API to do real time transcription on a call. I've successfully used the code below to send an audio file to Azure Speech to Text but when I push the Twilio…
0
votes
1 answer

Custom grammars supported by Microsoft Azure's Speech to text recognition service?

I have below 2 queries: Can we use VXML based grammars with the Microsoft's cloud speech recognition service ? Can we use compiled(binary-format) grammars with Microsoft's cloud speech recognition service? We have integrated C++ SDK provided by…
0
votes
1 answer

"Echo Cancellation" for Azure Speech service if not using headsets

I am using Azure Speech SDK for speech recognition in my application (chrome extension). The problem I am facing now is, if I am not using a headset, then the speech from another person (echo) in the conference is getting recognised. This is not…
0
votes
1 answer

Audio to Text from Blob trigger

So I have a use-case where I want to upload audio files (.WAV) into a blob storage which triggers a Function and gets the text from the audio. At the moment, the only way possible is having the audio file locally. The audio config can't take the uri…
0
votes
1 answer

Azure Cognitive Services - Text-to-Speech - SpeakTextAsync does not return

I am having an issue with my .NET WPF application freezing when using Azure Text-to-Speech services. From debugging it seems the SpeakTextAsync synthesizes the text I send, but never completes/returns a value, causing the application to be…
Stephen Stewart
  • 169
  • 3
  • 8
0
votes
2 answers

Automating Training for Custom Speech Model

I am working on a requirement where I have created a custom speech model for Azure Chatbot. I have uploaded some audio files in https://speech.microsoft.com , created a model and then deployed it manually for getting model id and speech recognition…
0
votes
1 answer

Multi Lingual Bot with Audio input and output using React

I am working on a requirement where I have to embed azure chat bot into React. I need to enable audio input and output functionality and also detection of user language and translation. I have seen Microsoft documentations where this is done from…
0
votes
1 answer

How can I get only text part out of recognised object in Microsoft Speech Service

Following is my output of speech recognition from file from Microsoft Azure Speech SDK. I want to know how can I extract just the 'text' part from this output rather than…
0
votes
1 answer

Azure Speech to Text word capitalization with custom model

I'm adding some domain-specific words to the prebuilt models and the words are being recognized correctly but they are not being capitalized as I have specified in the transcription (trained the model using audio + human labeling) There's no…
Sampaio
  • 98
  • 1
  • 10
0
votes
0 answers

No description for this method of Azure Cognitive Services SpeechRecognizer class

In my Text to Speech related code, I use SpeechSynthesizer.SynthesisCompleted Event of the SpeechSynthesizer class of the Microsoft.CognitiveServices.Speech namespace to handle the scenarios when a speech is completed. I was looking for a similar…
nam
  • 21,967
  • 37
  • 158
  • 332
0
votes
2 answers

Azure Cognitive Services - Speech to text: Microphone not detected

Question: Why the following code is not detecting Microphone on my Windows 10? Without code, Microphone is working fine and I can use it by clicking Window key + H. I can speak on it and it can write that my spoken speech into any focused text…
nam
  • 21,967
  • 37
  • 158
  • 332
0
votes
1 answer

Long MP3 uploaded to Azure Speech-to-Text API returns no results after a significant period

I have uploaded a long MP3 file (around 8 hours) to Azure's Speech-to-Text API, using this. However, 16 hours later, there are still no transcript files available, using this. I have previously done the same process with a 7 hour long video and have…
OD1995
  • 1,647
  • 4
  • 22
  • 52
0
votes
1 answer

InvalidPayload when posting MP3 for transcription when using Azure Speech to Text API v3.0

Using https://uksouth.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0/operations/CreateTranscription/console, I posted this: { "contentUrls": [ …
OD1995
  • 1,647
  • 4
  • 22
  • 52
0
votes
1 answer

How to simulate the transcription result with pytest without actually sending data to Azure Speech API

I'm having trouble coming up with a pytest function to test the function below: Is there a way to mock any function/object such that I can fake a transcription result that's consistent every time I run the test and not send any data to the actual…
0
votes
2 answers

is there a way to add encryption (CMK-Customer Managed Keys) and Vnet integration to Azure Speech service (Cognitive) through ARM template

"encryption": { "keyVaultProperties": { "keyName": "string", "keyVersion": "string", "keyVaultUri": "string" }, Can we achieve this with in the ARM template.