Questions tagged [speech-synthesis]

Speech synthesis is the artificial production of human speech.

Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech.

Synthesized speech can be created by concatenating pieces of recorded speech that are stored in a database. Systems differ in the size of the stored speech units; a system that stores phones or diphones provides the largest output range, but may lack clarity. For specific usage domains, the storage of entire words or sentences allows for high-quality output. Alternatively, a synthesizer can incorporate a model of the vocal tract and other human voice characteristics to create a completely "synthetic" voice output.

The quality of a speech synthesizer is judged by its similarity to the human voice and by its ability to be understood. An intelligible text-to-speech program allows people with visual impairments or reading disabilities to listen to written works on a home computer. Many computer operating systems have included speech synthesizers since the early 1990s.

For more info, take a look at: http://en.wikipedia.org/wiki/Speech_synthesis

442 questions
8
votes
1 answer

TTS to Stream with SpeechAudioFormatInfo using SpeechSynthesizer

I am using System.Speech.Synthesis.SpeechSynthesizer to convert text to speech. And due to Microsoft's anemic documentation (see my link, there's no remarks or code examples) I'm having trouble making heads or tails of the difference between two…
8
votes
4 answers

How to insert pause in speech synthesis with grammatical hints

I am writing a simple spelling test app using the HTML5 SpeechSynthesis API. The text I would like my app to say is something like the following: "The spelling word is Cat. The cat chased the dog.". The API tends to race without much of a pause…
Bob Woodley
  • 1,246
  • 15
  • 30
8
votes
0 answers

Synchronize video subtitle with text-to-speech voice

I try to create a video of a text in which the text is narrated by text-to-speech. To create the video file, I use the VideoFileWriter of Aforge.Net as the following: VideoWriter = new VideoFileWriter(); VideoWriter.Open(CurVideoFile,…
Ahmad
  • 8,811
  • 11
  • 76
  • 141
8
votes
2 answers

JS Speech Synthesis Issue on iOS

I recently implemented a basic web app which relied on Google's TTS URL to generate clear MP3 files for playback on the front end. This has since been subject to an additional security check, meaning I have had to update the code base to use…
Bob-ob
  • 1,560
  • 4
  • 18
  • 34
8
votes
1 answer

Adding a new Language to SpeechSynthesizer

So I'am trying to add a new language, spesifically norwegian, to SpeechSynthesizer, but it doesn't seem to get installed. Found this: Add another voice into .NET Speech (But here the problem is that Czech isn't supported) I have installed the…
Moddaman
  • 2,538
  • 3
  • 23
  • 41
8
votes
3 answers

Memory leak in .Net Speech.Synthesizer?

I found a continuous leakage in my application. After examining using a memory profiler, I found the course is some object from Microsoft Speech.Synthesizer So I build up a toy project to verify the hypothesis: //Toy example to show memory leak in …
JXITC
  • 1,110
  • 1
  • 13
  • 27
7
votes
1 answer

Why can't I control the Apple macOS Speech Synthesis audio unit with slider values?

I'm working to incorporate Apple speech synthesis audio unit stuff (works only on macOS, not iOS) into AudioKit and I've built a AKSpeechSynthesizer Class (initially created by wangchou in this pull request) and a demo project both available on the…
Aurelius Prochazka
  • 4,510
  • 2
  • 11
  • 34
7
votes
2 answers

C# Speech Recognition

I am making a Smart House Control System right now, and I have a little problem. I was thinking on using Cosmos for a base system, and adding the needed namespace libraries to it, but as the usual System.Speech.Recognition namespace depends too much…
fonix232
  • 2,132
  • 6
  • 39
  • 69
7
votes
1 answer

How to access audio result from Speech Synthesis API?

The Speech Synthesis API allows text-to-speech functionality in Chrome Beta. However, results from TTS requests are automatically played by the browser. How do I access the audio results for post-processing and disable the default behavior of the…
7
votes
3 answers

what text to speech and speech recognition libraries are available for Clojure?

what text to speech and speech recognition libraries are available for Clojure? So far I have found https://github.com/klutometis/speech-recognition https://github.com/klutometis/speech-synthesis both of these use Google and thus depend of the…
6
votes
2 answers

Resuming a paused speech using speechSynthesis.resume() on chrome android doesn't work

using the browser api speechSynthesis.resume() I'm trying to resume a paused speech on android chrome I've tested the code below on chrome desktop version 78.0.3904.97 on mac os mojave and it resumes a speech without any problem after the speech is…
6
votes
1 answer

Change the culture of voice to other languages

I have an some text other than English culture. Ex: Tamil culture. If I don't mention culture, default English will be taken. How to convert the text into voice (other than English)? Code snippet: For English Language: public static void…
Pandi
  • 471
  • 3
  • 17
6
votes
4 answers

JS SpeechSynthesis Problems with the cancel() Method

i want to use the cancel Method of window.SpeechSynthesis in Chrome, to cut off an Utterance and start a new one (so you dont have to hear all utterances which are still in queue) var test = new…
Erik
  • 69
  • 1
  • 4
6
votes
1 answer

Speech synthesis in chrome for android not loading voices

I have an script correctly running on chrome for windows, but when I try it on my android chrome, it doesn't works. It seems to be a problem of the voices, since the : if ('speechSynthesis' in window) returns true. I've tried some online examples…
user3192301
  • 61
  • 2
  • 4
6
votes
1 answer

Preloading Web Speech API before calling speak

So I've noticed that after you do the first speak using speechSynthesis.speak, it speeds up dramatically in providing results. So my aim below was to speed it up by pre-initializing the synthesis so when we call speakIt() we don't have to wait for…
1 2
3
29 30