Questions tagged [azure-speech]

Azure speech is a cloud based speech to text service in the Microsoft Azure family of machine learning and artificial intelligence APIs

140 questions
0
votes
1 answer

Batch create transcription always results in: The recordings URI contains invalid data

I would like to use Azure Speech Services Batch Transcription APIs to create a transcription of my audio file. I've already had success using the Speech Service SDK (for Node.js), but was interested in trying out one of the newer features available…
shanewwarren
  • 2,234
  • 22
  • 17
0
votes
1 answer

Reading WindowsMicrophoneStream for Azure Speech SDK on Hololens

I want to perform real-time speech recognition for the Hololens 2 with Unity 2021 and I am using the Microsoft Azure Cognitive Services Speech SDK to do so. Instead of the default Hololens 2 microphone stream, I want to switch to the Stream Category…
Leado
  • 25
  • 6
0
votes
1 answer

Azure Text-to-Speech SDK vs API: when to use which?

I’m new to using Azure Cognitive Services (and programming in general). The Azure Text to Speech service is provided through two interfaces: the Speech SDK and the REST API. I was wondering when one should choose to use the SDK or REST API? What are…
0
votes
1 answer

commitment tier pricing plan with custom model

Technical question - it's unclear from the Speech to Text pricing page ( https://azure.microsoft.com/en-us/pricing/details/cognitive-services/speech-services/#pricing ) if custom models are included in the commitment tier pricing plan. I would be…
Ar Y
  • 1
  • 1
0
votes
1 answer

The python error TypeError: argument of type 'NoneType' is not iterable

I am using Azure Speech Recognition services and I'm having trouble creating commands. I'm using interpreter Python 3.10.4 Code: def takeCommand(): speech_config = speechsdk.SpeechConfig(subscription=speech_key, region=service_region) …
MENTOR
  • 1
  • 2
0
votes
0 answers

how to store as a blob the audio recoding from azure API SpeechSynthesizer and play it in angular

We are currently implementing a spring boot API that calls text to speech API SpeechSynthesizer. public PronunciationBlob callAzureToTransformTextToSpeech(String text){ //create a azure speech resource/speech services, and get the key from…
0
votes
1 answer

Azure speech to text REST API V3 binary data

I'm trying to use Azure Speech to text service. In the documentation I'm confronted with examples, that use V1 API version: https://$region.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1 And basically every link to…
0
votes
1 answer

Azure Text to Speech Error: 0x38 (SPXERR_AUDIO_SYS_LIBRARY_NOT_FOUND) when deploying to Azure App Services Linux environment

I adapted the quick start code Python version here for Azure text to speech. The app runs correctly on my local server. However, when I deploy the app through Visual Studio Code to Azure App Services, I got the error: 0x38…
Rui
  • 95
  • 1
  • 14
0
votes
1 answer

Microsoft speech service to listen multiple audio

I am developing a project using python in speech recognition. In that, I need to recognize audio from multiple microphones. The application is based on Azure Cognitive Services and the assistant need to handle multiple microphones. I have seen…
0
votes
1 answer

How to Start Continuous Recognition on a Web Server using 'use_default_microphone' parametre (Azure Cognitive Services Speech )

I am developing a web app (python) in which I use azure cognitive services speech translation. I used PyWebIo to create an interface and start a server to run my code. I am using 'use_default_microphone' parametre in the audio config to speak to…
0
votes
0 answers

java.lang.UnsatisfiedLinkError: 'void com.microsoft.cognitiveservices.speech.SpeechConfig.setTempDirectory(java.lang.String)' - Java CentOS 7

My application is a spring boot java project connecting to the azure services for converting speech to text, analyzing sentiment and translating. I am trying to deploy my application to a centos 7 VM, but I get the error in the title when calling my…
0
votes
1 answer

Microsoft cognitive-services audiooutputconfig device_name

https://learn.microsoft.com/en-us/python/api/azure-cognitiveservices-speech/azure.cognitiveservices.speech.audio.audiooutputconfig?view=azure-python how to specify a device_name I have tried everything from deviceUID,deviceName nothing seems to…
0
votes
1 answer

How to setup microphone with Azure Speech Services C# SDK on Raspberry Pi 4

I am playing around creating a home assistant using Raspberry Pi 4 and Azure speech services. I have a keyword model setup and listen for the keyword using the following code. using AudioConfig audioConfig =…
0
votes
1 answer

Python SpeechRecognition doesn't listen to full audio?

I'm just trying to simply get a transcript from an audio file using python SpeechRecognition. It seems like no matter what pause_threshold I set, or duration or whatever, it always gives me the same exact output, approximately 30 seconds out of 80…
0
votes
1 answer

SpeechSynthesizer and SSML

I have been trying to work with the prosody pitch attribute but doesn't seem straightforward or seem to work. I want to create a simple "do re mi" following the g-major scale. The results do not turn out as expected using the various Hz values. …