Questions tagged [voice]

A voice in programming may refer to voice synthesis (e.g. transforming text to speech) and voice recognition (e.g. transforming speech to text, voice commands).

A voice in programming may refer to voice synthesis (e.g. transforming text to speech) and voice recognition (e.g. transforming speech to text, voice commands).

Despite being an rather old engineering task (with some commercial products available in late 1990-s) it's still considered advanced and challenging technology and it's very far from perfection.

1122 questions
7
votes
1 answer

Voice commands for Huawei GSM/UMTS usb dongles?

I am trying to find out how to use voice functions with a HUAWEI E173 dongle in linux. My dongle is voice enabled (voice works with the windows application). I can check the voice activation in linux using the AT^CVOICE? command. This tells me how…
callegar
  • 223
  • 1
  • 2
  • 7
7
votes
4 answers

C++ API for "Text To speech" and "Voice to Text"

I would like to know whether there is a good API for "voice recognition" and "text to speech" in C++. I have gone through Festival, which you can't even say whether the computer is talking because it is so real and voce as well. Unfortunately…
Soldier
  • 539
  • 6
  • 11
  • 18
6
votes
3 answers

PlatformNotSupportedException Using .NET Speech Recognition

So I'm trying voice recognition for C#, I'm using System.Speech.Recognition, and, I was searching around on the internet, trying out several pieces of code for some basic speech recognition, the best one I could find was this: using System; using…
Stan
  • 3,659
  • 14
  • 35
  • 42
6
votes
1 answer

Android TTS Male Female Voice Change

I am working with TTS. But i need to change the voice of the tts from female to male and vice versa. I know i can do it by eSpeak. But problem is that. In the setting page when i select the eSpeak TTS the voice changes to male and when i choose the…
Debarati
  • 3,296
  • 2
  • 18
  • 30
6
votes
2 answers

How can I control my application with built-in voice control?

Let's say I develop audio player for iPod Touch 4/iPhone. Currently it can be controlled with standard panel (double click Home and scroll left). But how can I control that with built-in voice control (long click on Home)?
LA_
  • 19,823
  • 58
  • 172
  • 308
6
votes
1 answer

How to detect language automatically with Speech Recognition in Python

I am developing an app and I want to detect the language automatically and then print it. My code: with sr.Microphone() as source: audio = r.listen(source) try: # Auto detect the language print("You said: " +…
user13966397
6
votes
0 answers

Controlling local user voice volume in Discord with setLocalVolume

I'm trying to programmatically control the relative local volume of users' voices in a Discord channel from a web app. Discord's Game SDK has a documented API call to control a user's local volume with VoiceManager.setLocalVolume(userId, level) from…
Petrus Theron
  • 27,855
  • 36
  • 153
  • 287
6
votes
1 answer

Availability of installed voices for use by AVSpeechSynthesis in iOS

I would like to be able to test which text-to-speech voices are available for my iOS app to use with AVSpeechSynthesis. It is easy to generate a list of the installed voices, but Apple makes some of them are off-limits for use by apps, and I would…
Anton
  • 2,512
  • 2
  • 20
  • 36
6
votes
3 answers

How to check if bot is connected to a channel? | discord.py

I've decided to try making my discord bot play music, but I've gotten stuck already. Mainly due to the fact I can't find any sources to help with the current version, I've been winging everything from the docs. However, I can't figure out how to…
xupaii
  • 465
  • 4
  • 15
  • 31
6
votes
2 answers

Why do MFCC extraction libs return different values?

I am extracting the MFCC features using two different libraries: The python_speech_features lib The BOB lib However the output of the two is different and even the shapes are not the same. Is that normal? or is there a parameter that I am…
SuperKogito
  • 2,998
  • 3
  • 16
  • 37
6
votes
2 answers

How to record a google assistant demo?

I just built a voice conversational app that runs on google assistant. However, I am not able to figure out how to record a demo of this app for sharing with other people. I am using iPhone and Mac. Video recording part works just fine, but I am not…
rahulmohan
  • 1,285
  • 11
  • 19
6
votes
0 answers

Voice to Text API with Language Detection or Confidence Rate?

I am trying to develop an application that needs to detect one of two possible languages from an audio stream and transform the audio to text. Most Voice to Text APIs require specifying the language before detecting the text. Google Translate…
Juanvulcano
  • 1,354
  • 3
  • 26
  • 44
6
votes
1 answer

play raw audio file in python in realtime

I have a udp server in python that continuously receives voice packets from a client in raw format, array of bytes. How can I play the voice on the server side in real time? Any recommended libraries or ways to do it? Here is my very simple server…
A_Matar
  • 2,210
  • 3
  • 31
  • 53
6
votes
2 answers

Offline Speech Recognition in Android

I searched a lot on StackOverFlow for this Problem but the Threads are older than 3 years old. I implemented the Google Voice Recognition which requires a Internet Connection. Searching how i can use the Offline Voice Recognition brought no…
korunos
  • 768
  • 3
  • 11
  • 31
6
votes
3 answers

Simple Voice Recognition Swift?

In my app when the user says a certain word or words the app will answer by reading out a sentence which is set in app. What is the simplest way of doing this? I'm using the latest version of Xcode (Its for my WWDC 2015 scholarship app)
Junaid
  • 321
  • 1
  • 4
  • 15