Questions tagged [azure-speech]

Azure speech is a cloud based speech to text service in the Microsoft Azure family of machine learning and artificial intelligence APIs

140 questions
1
vote
1 answer

Cannot Recognize Speech Error: Error occurred while processing 'audio.wav'. Invalid WAV header in file, RIFF was not found

I captured one voice message from WhatsApp and it saved as wav file using node js. But I pass this wav file for speech translation using azure speech to text JavaScript sdk it not showing anything and also try this file into demo app also, in azure…
1
vote
2 answers

save microphone audio input when using azure speech to text

I'm currently using Azure speech to text in my project. It is recognizing speech input directly from microphone (which is what I want) and saving the text output, but I'm also interested in saving that audio input so that I can listen to it later…
1
vote
1 answer

List of Azure speech dictation words per language

When using speech to text in Azure with dictation mode ON it recognizes words like "question mark" and returns "?". We found other words like this and were looking for complete list but were not able to find it in the documentation…
1
vote
0 answers

where do I get the secret property for creating webhook for azure speech to text service

If you look at the properties key,secret value is SecretUsedToCreateHMACHexDigestThatYouWillFindIn where do i get this value from ? { "displayName": "TranscriptionCompletionWebHook", "properties": { "secret":…
1
vote
2 answers

Using Azure speech Text-To-Speech in Brwoser

I am looking out for examples on using Azure speech from within the browser. I can see examples for Node.js but not for using Text-To-speech in the browser.
NSS
  • 1,835
  • 2
  • 29
  • 66
1
vote
2 answers

Azure diarization (speaker separation) not working

I'm able to run this piece of code and get transcriptions of audio files. But it does not distinguish between speakers (results always show "speaker 1", "speaker 2" is never…
1
vote
1 answer

How to save converted text data from azure cognitive services?

I'm currently having a hard time saving the result of the speech to text from Microsoft azure cognitive service API. This is my code. import azure.cognitiveservices.speech as speechsdk import time # Creates an instance of a speech config with…
hojin
  • 13
  • 2
1
vote
1 answer

How to pass audio buffer to speech to text service using python

I am using azure speech to text service using python to process bunch of audios. In order to process the audios, These are the steps performed- Download audio from web server to local 'C:/audio' drive. Pass the path of downloaded audio to Speech…
user1990
  • 37
  • 4
1
vote
1 answer

How to create WAV stream in JavaScript with help of RecordRTC?

I'm trying to create an application to transcribe a streaming audio recording. The idea is to capture the user's microphone stream using RecordRTC and send it in chunks to a gunicorn server using Socket.IO. The server will then create an input…
Frank
  • 93
  • 9
1
vote
1 answer

Azure Speech Translation: How to activate the recognition only when speech is detected?

I’m currently working on a live translation web app allowing multiple participants to use the Azure Speech Translation and share their transcriptions in multiple languages. I don’t want to be billed for the number of participants X the duration of a…
1
vote
1 answer

Cannot upload speech dataset because "Failed"

So I am trying to upload a dataset to the microsoft cognitive services speech portal for custom models. I have been doing this for about a year without issue, however now I am getting "Failed" with the detail "Failed to upload data. Please check…
1
vote
1 answer

What is the azure cognitive services speech to text webhook callback IPs to whitelist?

I need to use azure cognitive services (speech to text) behind a corporate firewall. The speech to text batch processing has a callback from azure notifying once the process is…
hre
  • 11
  • 2
1
vote
1 answer

Azure Speech SDK: createPushStream from GetUserMedia in javascript

I am trying to find examples on how to use getUserMedia stream object to createPushStream with the Azure Speech SDK. Note that I intend to run the code in Safari, so the use of MediaRecorder is not possible. The intent is to use getUserMedia stream…
1
vote
2 answers

Custom Keyword continuous recognition option using Azure Cognitive Speech Services sdk for Javascript

Using the Custom Keyword Recognizer provided by Microsoft Cognitive Speech Services, I would like to setup continuous recognition on the browser using the microsoft-cognitiveservices-speech-sdk npm package. Is there a way to setup continuous keyword…
1
vote
0 answers

Unwanted insertion in recognized text

Is there a way to remove an insertion in result in my custom speech project in custom speech studio? My audio is in Portuguese and my transcript too. The test result after training has inserted an unwanted word. This unwanted word will harm the…
1 2
3
9 10