Azure speech is a cloud based speech to text service in the Microsoft Azure family of machine learning and artificial intelligence APIs
Questions tagged [azure-speech]
140 questions
0
votes
1 answer
Twilio to Azure Continuous Speech To Text
I would like to use the Twilio verb to send audio to Azure's Continuous Speech to Text API to do real time transcription on a call. I've successfully used the code below to send an audio file to Azure Speech to Text but when I push the Twilio…

Ken Arakelian
- 1
- 1
0
votes
1 answer
Custom grammars supported by Microsoft Azure's Speech to text recognition service?
I have below 2 queries:
Can we use VXML based grammars with the Microsoft's cloud speech recognition service ?
Can we use compiled(binary-format) grammars with Microsoft's cloud speech recognition service?
We have integrated C++ SDK provided by…

aman bhardwaj
- 3
- 2
0
votes
1 answer
"Echo Cancellation" for Azure Speech service if not using headsets
I am using Azure Speech SDK for speech recognition in my application (chrome extension).
The problem I am facing now is, if I am not using a headset, then the speech from another person (echo) in the conference is getting recognised. This is not…

Sulabha
- 1
- 1
0
votes
1 answer
Audio to Text from Blob trigger
So I have a use-case where I want to upload audio files (.WAV) into a blob storage which triggers a Function and gets the text from the audio. At the moment, the only way possible is having the audio file locally. The audio config can't take the uri…

Chace Lorans
- 1
- 1
0
votes
1 answer
Azure Cognitive Services - Text-to-Speech - SpeakTextAsync does not return
I am having an issue with my .NET WPF application freezing when using Azure Text-to-Speech services.
From debugging it seems the SpeakTextAsync synthesizes the text I send, but never completes/returns a value, causing the application to be…

Stephen Stewart
- 169
- 3
- 8
0
votes
2 answers
Automating Training for Custom Speech Model
I am working on a requirement where I have created a custom speech model for Azure Chatbot. I have uploaded some audio files in https://speech.microsoft.com , created a model and then deployed it manually for getting model id and speech recognition…

Sharvani
- 61
- 1
- 2
- 12
0
votes
1 answer
Multi Lingual Bot with Audio input and output using React
I am working on a requirement where I have to embed azure chat bot into React. I need to enable audio input and output functionality and also detection of user language and translation. I have seen Microsoft documentations where this is done from…

Sharvani
- 61
- 1
- 2
- 12
0
votes
1 answer
How can I get only text part out of recognised object in Microsoft Speech Service
Following is my output of speech recognition from file from Microsoft Azure Speech SDK. I want to know how can I extract just the 'text' part from this output rather than…

Arihant Jain
- 47
- 3
0
votes
1 answer
Azure Speech to Text word capitalization with custom model
I'm adding some domain-specific words to the prebuilt models and the words are being recognized correctly but they are not being capitalized as I have specified in the transcription (trained the model using audio + human labeling)
There's no…

Sampaio
- 98
- 1
- 10
0
votes
0 answers
No description for this method of Azure Cognitive Services SpeechRecognizer class
In my Text to Speech related code, I use SpeechSynthesizer.SynthesisCompleted Event of the SpeechSynthesizer class of the Microsoft.CognitiveServices.Speech namespace to handle the scenarios when a speech is completed. I was looking for a similar…

nam
- 21,967
- 37
- 158
- 332
0
votes
2 answers
Azure Cognitive Services - Speech to text: Microphone not detected
Question: Why the following code is not detecting Microphone on my Windows 10? Without code, Microphone is working fine and I can use it by clicking Window key + H. I can speak on it and it can write that my spoken speech into any focused text…

nam
- 21,967
- 37
- 158
- 332
0
votes
1 answer
Long MP3 uploaded to Azure Speech-to-Text API returns no results after a significant period
I have uploaded a long MP3 file (around 8 hours) to Azure's Speech-to-Text API, using this. However, 16 hours later, there are still no transcript files available, using this.
I have previously done the same process with a 7 hour long video and have…

OD1995
- 1,647
- 4
- 22
- 52
0
votes
1 answer
InvalidPayload when posting MP3 for transcription when using Azure Speech to Text API v3.0
Using https://uksouth.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0/operations/CreateTranscription/console, I posted this:
{
"contentUrls": [
…

OD1995
- 1,647
- 4
- 22
- 52
0
votes
1 answer
How to simulate the transcription result with pytest without actually sending data to Azure Speech API
I'm having trouble coming up with a pytest function to test the function below:
Is there a way to mock any function/object such that I can fake a transcription result that's consistent every time I run the test and not send any data to the actual…

Andrew
- 73
- 1
- 5
0
votes
2 answers
is there a way to add encryption (CMK-Customer Managed Keys) and Vnet integration to Azure Speech service (Cognitive) through ARM template
"encryption": {
"keyVaultProperties": {
"keyName": "string",
"keyVersion": "string",
"keyVaultUri": "string"
},
Can we achieve this with in the ARM template.

twinkle hema
- 75
- 1
- 11