Azure speech is a cloud based speech to text service in the Microsoft Azure family of machine learning and artificial intelligence APIs
Questions tagged [azure-speech]
140 questions
0
votes
1 answer
Batch create transcription always results in: The recordings URI contains invalid data
I would like to use Azure Speech Services Batch Transcription APIs to create a transcription of my audio file. I've already had success using the Speech Service SDK (for Node.js), but was interested in trying out one of the newer features available…

shanewwarren
- 2,234
- 22
- 17
0
votes
1 answer
Reading WindowsMicrophoneStream for Azure Speech SDK on Hololens
I want to perform real-time speech recognition for the Hololens 2 with Unity 2021 and I am using the Microsoft Azure Cognitive Services Speech SDK to do so. Instead of the default Hololens 2 microphone stream, I want to switch to the Stream Category…

Leado
- 25
- 6
0
votes
1 answer
Azure Text-to-Speech SDK vs API: when to use which?
I’m new to using Azure Cognitive Services (and programming in general).
The Azure Text to Speech service is provided through two interfaces: the Speech SDK and the REST API. I was wondering when one should choose to use the SDK or REST API? What are…
0
votes
1 answer
commitment tier pricing plan with custom model
Technical question - it's unclear from the Speech to Text pricing page ( https://azure.microsoft.com/en-us/pricing/details/cognitive-services/speech-services/#pricing ) if custom models are included in the commitment tier pricing plan.
I would be…

Ar Y
- 1
- 1
0
votes
1 answer
The python error TypeError: argument of type 'NoneType' is not iterable
I am using Azure Speech Recognition services and I'm having trouble
creating commands.
I'm using interpreter Python 3.10.4
Code:
def takeCommand():
speech_config = speechsdk.SpeechConfig(subscription=speech_key, region=service_region)
…

MENTOR
- 1
- 2
0
votes
0 answers
how to store as a blob the audio recoding from azure API SpeechSynthesizer and play it in angular
We are currently implementing a spring boot API that calls text to speech API SpeechSynthesizer.
public PronunciationBlob callAzureToTransformTextToSpeech(String text){
//create a azure speech resource/speech services, and get the key from…

Fernando
- 381
- 1
- 5
- 20
0
votes
1 answer
Azure speech to text REST API V3 binary data
I'm trying to use Azure Speech to text service. In the documentation I'm confronted with examples, that use V1 API version:
https://$region.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1
And basically every link to…

Kipras Bielinskas
- 127
- 1
- 10
0
votes
1 answer
Azure Text to Speech Error: 0x38 (SPXERR_AUDIO_SYS_LIBRARY_NOT_FOUND) when deploying to Azure App Services Linux environment
I adapted the quick start code Python version here for Azure text to speech.
The app runs correctly on my local server. However, when I deploy the app through Visual Studio Code to Azure App Services, I got the error: 0x38…

Rui
- 95
- 1
- 14
0
votes
1 answer
Microsoft speech service to listen multiple audio
I am developing a project using python in speech recognition. In that, I need to recognize audio from multiple microphones. The application is based on Azure Cognitive Services and the assistant need to handle multiple microphones. I have seen…

user235
- 1
- 5
0
votes
1 answer
How to Start Continuous Recognition on a Web Server using 'use_default_microphone' parametre (Azure Cognitive Services Speech )
I am developing a web app (python) in which I use azure cognitive services speech translation. I used PyWebIo to create an interface and start a server to run my code. I am using 'use_default_microphone' parametre in the audio config to speak to…

Cihan Ünlü
- 11
- 3
0
votes
0 answers
java.lang.UnsatisfiedLinkError: 'void com.microsoft.cognitiveservices.speech.SpeechConfig.setTempDirectory(java.lang.String)' - Java CentOS 7
My application is a spring boot java project connecting to the azure services for converting speech to text, analyzing sentiment and translating. I am trying to deploy my application to a centos 7 VM, but I get the error in the title when calling my…

meresipere
- 3
- 3
0
votes
1 answer
Microsoft cognitive-services audiooutputconfig device_name
https://learn.microsoft.com/en-us/python/api/azure-cognitiveservices-speech/azure.cognitiveservices.speech.audio.audiooutputconfig?view=azure-python
how to specify a device_name I have tried everything from deviceUID,deviceName nothing seems to…
0
votes
1 answer
How to setup microphone with Azure Speech Services C# SDK on Raspberry Pi 4
I am playing around creating a home assistant using Raspberry Pi 4 and Azure speech services. I have a keyword model setup and listen for the keyword using the following code.
using AudioConfig audioConfig =…

ImJhardy
- 1
- 1
0
votes
1 answer
Python SpeechRecognition doesn't listen to full audio?
I'm just trying to simply get a transcript from an audio file using python SpeechRecognition. It seems like no matter what pause_threshold I set, or duration or whatever, it always gives me the same exact output, approximately 30 seconds out of 80…

eeveepotato
- 13
- 3
0
votes
1 answer
SpeechSynthesizer and SSML
I have been trying to work with the prosody pitch attribute but doesn't seem straightforward or seem to work. I want to create a simple "do re mi" following the g-major scale. The results do not turn out as expected using the various Hz values. …

Ken Netherland
- 77
- 1
- 7