Highest Voted 'azure-speech' Questions

0

votes

1 answer

Batch create transcription always results in: The recordings URI contains invalid data

I would like to use Azure Speech Services Batch Transcription APIs to create a transcription of my audio file. I've already had success using the Speech Service SDK (for Node.js), but was interested in trying out one of the newer features available…

transcription azure-speech

asked Sep 13 '22 at 02:08

shanewwarren

2,234
22
17

0

votes

1 answer

Reading WindowsMicrophoneStream for Azure Speech SDK on Hololens

I want to perform real-time speech recognition for the Hololens 2 with Unity 2021 and I am using the Microsoft Azure Cognitive Services Speech SDK to do so. Instead of the default Hololens 2 microphone stream, I want to switch to the Stream Category…

c# audio-streaming streamreader hololens azure-speech

asked Jul 11 '22 at 17:39

Leado

25
6

0

votes

1 answer

Azure Text-to-Speech SDK vs API: when to use which?

I’m new to using Azure Cognitive Services (and programming in general). The Azure Text to Speech service is provided through two interfaces: the Speech SDK and the REST API. I was wondering when one should choose to use the SDK or REST API? What are…

azure text-to-speech azure-speech

asked Jul 04 '22 at 22:09

user19482318

1

0

votes

1 answer

commitment tier pricing plan with custom model

Technical question - it's unclear from the Speech to Text pricing page ( https://azure.microsoft.com/en-us/pricing/details/cognitive-services/speech-services/#pricing ) if custom models are included in the commitment tier pricing plan. I would be…

azure azure-speech

asked Jun 07 '22 at 15:40

Ar Y

1
1

0

votes

1 answer

The python error TypeError: argument of type 'NoneType' is not iterable

I am using Azure Speech Recognition services and I'm having trouble creating commands. I'm using interpreter Python 3.10.4 Code: def takeCommand(): speech_config = speechsdk.SpeechConfig(subscription=speech_key, region=service_region) …

python azure azure-speech

asked May 23 '22 at 15:37

MENTOR

1
2

0

votes

0 answers

how to store as a blob the audio recoding from azure API SpeechSynthesizer and play it in angular

We are currently implementing a spring boot API that calls text to speech API SpeechSynthesizer. public PronunciationBlob callAzureToTransformTextToSpeech(String text){ //create a azure speech resource/speech services, and get the key from…

angular spring-boot azure azure-speech azure-text-translation

asked May 12 '22 at 23:34

Fernando

381
1
5
20

0

votes

1 answer

Azure speech to text REST API V3 binary data

I'm trying to use Azure Speech to text service. In the documentation I'm confronted with examples, that use V1 API version: https://$region.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1 And basically every link to…

azure rest text-to-speech azure-speech

asked May 10 '22 at 11:06

Kipras Bielinskas

127
1
10

0

votes

1 answer

Azure Text to Speech Error: 0x38 (SPXERR_AUDIO_SYS_LIBRARY_NOT_FOUND) when deploying to Azure App Services Linux environment

I adapted the quick start code Python version here for Azure text to speech. The app runs correctly on my local server. However, when I deploy the app through Visual Studio Code to Azure App Services, I got the error: 0x38…

python azure-web-app-service azure-speech

asked May 04 '22 at 17:33

Rui

95
1
14

0

votes

1 answer

Microsoft speech service to listen multiple audio

I am developing a project using python in speech recognition. In that, I need to recognize audio from multiple microphones. The application is based on Azure Cognitive Services and the assistant need to handle multiple microphones. I have seen…

python audio-streaming azure-cognitive-services azure-speech

asked Apr 28 '22 at 02:43

user235

1
5

0

votes

1 answer

How to Start Continuous Recognition on a Web Server using 'use_default_microphone' parametre (Azure Cognitive Services Speech )

I am developing a web app (python) in which I use azure cognitive services speech translation. I used PyWebIo to create an interface and start a server to run my code. I am using 'use_default_microphone' parametre in the audio config to speak to…

azure-web-app-service speech-recognition speech-to-text azure-cognitive-services azure-speech

asked Apr 15 '22 at 14:09

Cihan Ünlü

11
3

0

votes

0 answers

java.lang.UnsatisfiedLinkError: 'void com.microsoft.cognitiveservices.speech.SpeechConfig.setTempDirectory(java.lang.String)' - Java CentOS 7

My application is a spring boot java project connecting to the azure services for converting speech to text, analyzing sentiment and translating. I am trying to deploy my application to a centos 7 VM, but I get the error in the title when calling my…

java linux centos7 speech-to-text azure-speech

asked Apr 11 '22 at 11:42

meresipere

3
3

0

votes

1 answer

Microsoft cognitive-services audiooutputconfig device_name

https://learn.microsoft.com/en-us/python/api/azure-cognitiveservices-speech/azure.cognitiveservices.speech.audio.audiooutputconfig?view=azure-python how to specify a device_name I have tried everything from deviceUID,deviceName nothing seems to…

azure azure-speech

asked Mar 18 '22 at 12:04

Steve Belb

1

0

votes

1 answer

How to setup microphone with Azure Speech Services C# SDK on Raspberry Pi 4

I am playing around creating a home assistant using Raspberry Pi 4 and Azure speech services. I have a keyword model setup and listen for the keyword using the following code. using AudioConfig audioConfig =…

c# .net-core raspberry-pi speech-recognition azure-speech

asked Dec 04 '21 at 01:51

ImJhardy

1
1

0

votes

1 answer

Python SpeechRecognition doesn't listen to full audio?

I'm just trying to simply get a transcript from an audio file using python SpeechRecognition. It seems like no matter what pause_threshold I set, or duration or whatever, it always gives me the same exact output, approximately 30 seconds out of 80…

python speech-recognition speech-to-text azure-speech

asked Nov 09 '21 at 18:20

eeveepotato

13
3

0

votes

1 answer

SpeechSynthesizer and SSML

I have been trying to work with the prosody pitch attribute but doesn't seem straightforward or seem to work. I want to create a simple "do re mi" following the g-major scale. The results do not turn out as expected using the various Hz values. …

c# azure-cognitive-services ssml azure-speech speechsynthesizer

asked Oct 30 '21 at 00:33

Ken Netherland

77
1
7

Questions tagged [azure-speech]