Questions tagged [speech-synthesis]

Speech synthesis is the artificial production of human speech.

Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech.

Synthesized speech can be created by concatenating pieces of recorded speech that are stored in a database. Systems differ in the size of the stored speech units; a system that stores phones or diphones provides the largest output range, but may lack clarity. For specific usage domains, the storage of entire words or sentences allows for high-quality output. Alternatively, a synthesizer can incorporate a model of the vocal tract and other human voice characteristics to create a completely "synthetic" voice output.

The quality of a speech synthesizer is judged by its similarity to the human voice and by its ability to be understood. An intelligible text-to-speech program allows people with visual impairments or reading disabilities to listen to written works on a home computer. Many computer operating systems have included speech synthesizers since the early 1990s.

For more info, take a look at: http://en.wikipedia.org/wiki/Speech_synthesis

442 questions
0
votes
1 answer

Is WorldBet a superset of Arpabet?

Does WorldBet contain all of the phonemes that Arpabet has? And do they mean the same thing in WorldBet? E.g. "AO" code in WorldBet is pronounced the same as "AO" in Arpabet, and so on for every phoneme in Arpabet.
Erik Hermansen
  • 2,200
  • 3
  • 21
  • 41
0
votes
1 answer

Cortana integration faliure

I am making a Windows 8.1 speech based application. My problem is that when I give Cortana the input it launches my app and the app closes at the splashscreen, but when I run my app in background (minimize the app) or when the app is running, the…
0
votes
0 answers

Two-dimensional AR modelling and linear predication with MATLAB

I'm working on two-dimensional AR modelling and using the algorithms from Matlab. In matlab there is a commend LPC which can compute the LP coefficients. How can I integrated to two dimension for image ? The matlab code for one dimension signal is…
amir nemat
  • 105
  • 1
  • 13
0
votes
2 answers

C# System.speech.synthesis post processing

Is there any way to apply post processing effects(change speed, pitch, volume) of the output from system.speech.synthesis in C#. To be more clear I am calling system.speech.synthesis.SpeechSynthisizer.Speak(String); and I want to edit the output.…
ProWolf
  • 196
  • 1
  • 11
0
votes
1 answer

Is there a way to display an audio wave as Windows Speech Synthesizer speaks?

I'm making a program that uses Windows Speech Recognition to listen out for commands and I am using the Speech Synthesizer to provide real-time feedback. I was wondering whether it would be possible to use the result from the synthesizer to create…
Luke Wood
  • 19
  • 1
0
votes
0 answers

How Do I Get Microsoft Speech Engine To Stop Instead Of Pause Without Crashing?

I'm working on a text to speech app using the Microsoft Speech Engine and SAPI voices and for the most part, it's working the way it should. First, here's my using statements: using System; using System.Globalization; using…
manicdrummer
  • 161
  • 3
  • 14
0
votes
1 answer

Resizing Text while SpeechSynthesizer is speaking C#

I am wondering if anyone can tell me how to basically resize a label or image when the .NET Microsoft Speech Synthesizer is speaking. The goal is to make the text in the center bigger and smaller as the synthesizer speaks. so when there are high…
Andy
  • 1
0
votes
1 answer

Mary TTS with HTML/JavaScript and PHP

I installed Mary TTS (version 5.1.2) on my Windows (and Linux computers). I started the Mary TTS server and the Mary TTS client, and I did some trials with text to audio conversion in the GUI window (its great). I would like to use Mary TTS on my…
blsn
  • 1,077
  • 3
  • 18
  • 38
0
votes
1 answer

Recognition reads twice after stopping it and resuming it again?

I made a speech handler object and when I switch on the speech recognition with a button in the first time and test it, it responds just fine. but when I stop the speech recognition with a command and switch it again with the button it repeats the…
Hakar
  • 29
  • 6
0
votes
1 answer

HTML5 Speech Synthesis API - Only able to use the "native" voice

I have a web application that makes use of the HTML5 speech synthesis API and it works - but only with the native voice. Here's my code: var msg = new SpeechSynthesisUtterance(); var voices; window.speechSynthesis.onvoiceschanged = function() { …
Sean Morris
  • 135
  • 2
  • 10
0
votes
2 answers

Forced to browser-sniff Safari version

Can someone tell me the "best" way to detect Safari and then its version number? Look, I don't want to do it but I can't find a way to pin down a WORKING version of speechSynthesis on Safari See below for an example that works on iOS 7.1.2 IF you…
Richard Maher
  • 41
  • 1
  • 9
0
votes
1 answer

Problems with events of speechSynthesis

I am working on a small app using SpeechSynthesisUtterance API. I try to launch a small function betweens words (when they are enunciated) : I use boundary event : myWords.onboundary = function() { ... }; But it doesn't…
0
votes
1 answer

System.Speech.Synthesis.SpeechSynthesizer Handling Of Numbers

Sometimes I have a number like #12543 and I want the synthesizer to say "Number one-two-five-four-three". Other times I would like the synthesizer to say "Number twelve-thousand-five-hundred-fourty-three". Does anybody here know what mechanism in…
Fred Chateau
  • 869
  • 1
  • 6
  • 16
0
votes
1 answer

What is the purpose of speaker adaptive training and speaker dependent training?

I'm trying to create a TTS engine for Indian Accented English (not any Indian language). I already have a voice recordings database for Indian accented English. So what are the next steps ? I think we need to label them with the ".lab" extension…
0
votes
1 answer

How to do server side text to speech?

I am building an Android and iOS app which reads jokes to users. For this, I want to do server side TTS, so that the app works in poor connectivity areas (by caching the voice content) and so that I can save on TTS calls (as the jokes will be the…
Adi
  • 23
  • 3