Questions tagged [microsoft-speech-api]

The Microsoft Speech API (SAPI) provides a high-level interface between an application and speech engines. SAPI implements all the low-level details needed to control and manage the real-time operations of various speech engines.

The Microsoft Speech API (SAPI) provides a high-level interface between an application and speech engines. SAPI implements all the low-level details needed to control and manage the real-time operations of various speech engines.

The two basic types of SAPI engines are text-to-speech (TTS) systems and speech recognizers. TTS systems synthesize text strings and files into spoken audio using synthetic voices. Speech recognizers convert human spoken audio into readable text strings and files.

API for Text-to-Speech

Applications can control text-to-speech (TTS) using the ISpVoice Component Object Model (COM) interface. Once an application has created an ISpVoice object (see Text-to-Speech Tutorial), the application only needs to call ISpVoice::Speak to generate speech output from some text data.

In addition, the IspVoice interface also provides several methods for changing voice and synthesis properties such as speaking rate ISpVoice::SetRate, output volume ISpVoice::SetVolume and changing the current speaking voice ISpVoice::SetVoice.

API for Speech Recognition

Just as ISpVoice is the main interface for speech synthesis, ISpRecoContext is the main interface for speech recognition. Like the ISpVoice, it is an ISpEventSource, which means that it is the speech application's vehicle for receiving notifications for the requested speech recognition events.

Source:http://msdn.microsoft.com/en-us/library/ee125077(v=vs.85).aspx

82 questions
1
vote
0 answers

Using single letters in Microsoft Speech SDK

When I just used the letters, D got confused with B, and a lot of the letters got confused with other letters, so I replaced any letter that was working incorrectly with a word or name, but as you can see, it is half of them. I was wondering if…
1
vote
0 answers

Bing Speech API and bots Frameworks

I am trying to use Bing's speech API within the Bot Framework (I am familiarizing myself with both of these technologies). Specifically, I am trying to use the DataClientWithIntent that it supports. I was able to look at this example in GitHub, but…
havan
  • 164
  • 2
  • 11
1
vote
0 answers

System.Speech to recognize letters and numbers instead of words or sentences

I am attempting to get basic recognition of letters and numbers with System.Speech rather than words or sentences. I am using the code sample from the top answer on this post. Here are some of the odd results I am seeing: (spoken,…
Roger
  • 2,063
  • 4
  • 32
  • 65
1
vote
1 answer

Free speech recognition for C# WinForm App

I'm trying to find free speech recognition for C# Windows Form Application executable, which can work as Google Speech Recognition, recognize and convert absolutely new words to text. I tried to use System.Speech.Recognition; different ways, but it…
1
vote
0 answers

Not able to run Microsoft Bing Speech Recognition API on iOS device using iOS Client Sample provided by Microsoft

I was trying to explore Microsoft's Bing Speech Recognition API for iOS https://github.com/Microsoft/Cognitive-Speech-STT-iOS. I followed all the steps written in the read me. The app runs and it seems to be detecting the speech from microphone and…
1
vote
1 answer

Microsoft Speech Recognition defaults vs API

So I've been using Microsoft Speech Recognition in Windows 10, doing the training exercises, dictating text into Wordpad and correcting it, adding words to the dictionary and so on. I would like to use the software to transcribe .wav files. It…
1
vote
2 answers

Microsoft SAPI Sub-language issue

My problem is this: My SAPI inproc recognizer refuses to load my grammar file when the windows 10 Display language is set to English(UK). System Display language set to UK. Speech Recognition language is UK. System Locale is UK. SAPI xml-format…
1
vote
1 answer

System requirements for System.speech application

I am making a dumb little project for my college degree using the microsoft System.speech recognition engine. I wanted to know what were the minimum requirements for such a software to run. Thanks.
1
vote
0 answers

How to set the input device for voice recognition

Is there a possibility to set another microphone input than the systems default for recognizing speech with either the System.Speech or Microsoft.Speech on Windows 8 and Windows Server 2012 machines? I surfed the docs and I have not found an obvious…
HCL
  • 36,053
  • 27
  • 163
  • 213
1
vote
2 answers

Matching wildcard/dictation in Microsoft Speech Grammar

I'm using Microsoft Speech API to load a grxml grammar: Grammar grammar = new Grammar(file); grammar.Enabled = true; SpeechRecognitionEngine sre = GetEngine(); sre.LoadGrammarAsync(grammar); Based on MSDN I can not find tag to match a wildcard /…
Jean-Philippe Encausse
  • 1,491
  • 2
  • 22
  • 40
1
vote
1 answer

Can't load external ruleref Microsoft Speech Grammar

I'm using Microsoft Speech API to perform Voice Recognition. Everything was fine until I use an external ruleref (according to http://msdn.microsoft.com/en-us/library/hh361619)
Jean-Philippe Encausse
  • 1,491
  • 2
  • 22
  • 40
0
votes
0 answers

How to implement word-by-word display with Microsoft Text to Speech?

I am building an app with React that uses Microsoft Speech to handle Text to Speech (TTS) tasks. In the app there is a process that fetches the response from ChatGPT as a stream then feed each complete sentence into the TTS queue. There is a text…
Tsuu
  • 96
  • 1
  • 6
0
votes
0 answers

Is there an API to invoke Windows Dictation Commands programmatically?

Windows 10 provides support for Dictation and modifying the dictated text using voice commands like "Delete word", "New line"…
Kiran Kumar
  • 51
  • 1
  • 5
0
votes
0 answers

Why Microsoft Cognitive Speech Api is sending "Bad Request" even with all the requirements fulfilled

I have been trying to implement Microsoft Cognitive Speech-to-text API to convert an audio file to text but always I am getting a Bad Request message. Here is sample code I am implementing: public static object MC() { HttpClient client = new…
0
votes
0 answers

How to configure the webhook properly so that i receive notifications when the transcription is completed

i tried to configure a webhook url using microsoft azure speech to text api v 3.1 so that i receive a notifications when a transcription is completed . I faced an issue when triying to create the webhook url, when searching on some other links and…