Azure speech is a cloud based speech to text service in the Microsoft Azure family of machine learning and artificial intelligence APIs
Questions tagged [azure-speech]
140 questions
1
vote
1 answer
How can we give the input file from storage container to azure speech api using python
Below is the code,
call_name1="test.wav"
blob_client1=blob_service_client.get_blob_client("bucket/audio",call_name1)
print(blob_client1)
streamdownloader=blob_client1.download_blob()
stream =…

Raphael Titus
- 173
- 12
1
vote
1 answer
Azure Speech to text result undefined nodejs
I am trying to implement a speech to text with userMedia. I write stream in the file on Nodejs server file created successfully but when try to convert speech to text with Azure fromStreamInput getting an undefined result.
var subscriptionKey =…

Arsalan Subhan
- 33
- 6
1
vote
1 answer
Azure Speech Recognition not detecting microphone SPXERR_MIC_NOT_FOUND
I have a small sample application to test speech recog. It works in some machines but not in other machines. In my dev environment where I first installed the necessary packages, it all worked 100% with no issues. But, my team mates are unable to…

R123
- 61
- 3
1
vote
2 answers
azure speech to text rest api python Connection aborted. OSError 10054, WSAECONNRESET
I am trying to test azure speech to text the rest API. First of all I used POSTMAN client and it worked fine.
Now I am trying the same thing using python but getting an error.
I am following instructions given as per the…

Prateek Saurabh
- 53
- 7
1
vote
1 answer
Speaker Diarizations vs speaker recognition google cloud vs microsoft azure vs ibm watson vs aws transcribe
I want to do a project of speech-to-text analysis where I would like to 1) Speaker recognition 2) Speaker diarization 3)Speech-to-text. Right now I am testing various APIs provided for various companies like Microsoft, Google, AWS, IBM etc
I could…

Prateek Saurabh
- 53
- 7
1
vote
1 answer
Why is the Speech To Text from Azure so slow?
I'm using Azure Speech To Text API to recognize small spoken recordings, from 10 seconds to 1 minute.
Each speech recognition takes around 5 seconds to complete, which seems a bit too much!
Here is how I do it:
speech_config =…

Be Chiller Too
- 2,502
- 2
- 16
- 42
1
vote
2 answers
Azure speech to text with numbers
A use case for my app is to convert speech (single word utterances) to text. I need to use Azure speech to text for this. Sometimes the speech needs to be converted into an integer - I need to submit the response as a quantity for example.
My…

David Christopher Reynolds
- 1,361
- 1
- 18
- 37
1
vote
1 answer
Azure Cognitive Services - Batch Transcription API responds with error message "The recording URI is invalid."
Steps Followed:
I created a speech services instance in West US.
I have the right headers(Content-Type and Ocp-Apim-Subscription-Key)
I make a POST request to https://westus.cris.ai/api/speechtotext/v2.0/transcriptions/ with below request…

GuruCharan94
- 852
- 9
- 13
1
vote
1 answer
How to use Batch Transcription API through Python
I have create an Azure Batch Transcription Service which will take audio file from Azure blob storage as input and share the Speech to Text with me.
I am able to achieve the above using the https://eastus.cris.ai/swagger/ui/index#/ website, the curl…

Anuja Parikh
- 53
- 1
- 14
1
vote
0 answers
Speech-to-text recognition of multiple voices with Microsoft Speech API?
I would like to know if Microsoft Speech API on Python supports multiple voices recognition. I saw the beta of SDK Speaker Recognition, but I was wondering if this feature was already in the Speech SDK or not.
I have checked all the python SDK…

Julie
- 63
- 9
0
votes
0 answers
How to get Azure Speech Synthesis SDK / API to work in a Python container?
I am trying to get Azure Speech Synthesis SDK / API to work in a Python container.
Just trying the quickstart in a Python container CLI will get the error message:
Speech synthesis canceled: CancellationReason.Error
Error details: USP error: timeout…

S7bvwqX
- 147
- 3
- 13
0
votes
0 answers
Absolute positioning start of the scentence in Azure speach synthesis using SSML
I am using Azure - Speech synthesis, and I need scentences to start at exect times. Simplest option would be media as mentioned here SSML Speak tag Absolute value for begin attribute
but Azure does not support media at the moment.
so I tried…

zebra
- 1,330
- 1
- 13
- 26
0
votes
2 answers
No matter what I do, I cannot get the pitch to increase through SSML
I cant seem to get the pitch increase through ssml and am lost at this point. Apologies if this question is not properly asked, it is my first time on stack overflow. I am developing a chatbot that utilizes the Microsoft Azure Text-to-Speech (TTS)…

cwp0627
- 1
0
votes
1 answer
Azure cognitive services speech SDK Python: Beating sound using synthesis callback
Using synthesizing callback, how do we correctly stream the audio data to a file? I want to write to a file as soon as audio data happens, this is not my final intent, but if this works, I can proceed with more abilities later on.
I have to use the…

Newbie
- 1
0
votes
1 answer
Azure Cognitive Speech Service returning "No Match" always in MAUI application
I am trying to build simple speech to text android application using .Net MAUI but always getting result as - Microsoft.CognitiveServices.Speech.ResultReason.NoMatch.
Same code if I tried using console application and it is working as expected and…

Mithilesh
- 1
- 1