Questions tagged [text-to-speech]

Text-to-speech or speech synthesis is the artificial production of human speech.

Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware. A text-to-speech (TTS) system converts normal language text into speech.

3705 questions
8
votes
2 answers

What is an Earcon?

I just stumbled upon the method TextToSpeech.addEarcon(String, String) in Android's TTS API. I actually couldn't find more information on the net, so does anyone know what exactly an earcon is? Thanks!
Force
  • 6,312
  • 7
  • 54
  • 85
8
votes
3 answers

TTS output always going to A2DP

My Android tutorial states that I can explicitly tell the TTS engine which stream to use: For music playback: params.put(TextToSpeech.Engine.KEY_PARAM_STREAM, String.valueOf(AudioManager.STREAM_MUSIC)); And for phone…
an00b
  • 11,338
  • 13
  • 64
  • 101
8
votes
1 answer

Aws TTS Polly -> How to check current usage (number of chars already used)?

I want to know how can I check my current usage of Amazon Polly. I'm currently on free tier, so I could not find any information on billing since I still haven't surpassed the 3 million characters per month. But I wanted to have a control of how…
8
votes
2 answers

Playing remote audio (from Google Translate) in HTML5 on a server

I'm trying to use text-to-speech on a website using HTML5 and Google Translate. Getting speech from Google is as easy as a GET request to: http://translate.google.com/translate_tts?tl=en&q=hello In order to play that file I'm using the…
TomTasche
  • 5,448
  • 7
  • 41
  • 67
8
votes
2 answers

How to use Phonetic or Phoneme pronunciation in google text to speech?

I have been trying for a while to get Phonetic or Phoneme pronunciation working with google's text to speech but have not managed to get it performing consistently. I have managed to get limited results from using https://tophonetics.com/ It…
Hellonearthis
  • 1,664
  • 1
  • 18
  • 26
8
votes
5 answers

gTTS direct output

I want to make a chatbot's response in audio and text. All the example code using gTTS seem like one needs to 'save the text into a file then play the file'. Is there another way to simplify the process such as, play the 'response from chatbot'…
June Wang
  • 499
  • 2
  • 6
  • 20
8
votes
3 answers

Is it possible to select the word that is being read while using the SpeechSynthesisUtterance API?

Is it possible to select the word that is being read while using the SpeechSynthesisUtterance API? Is there an event I can use to get the current spoken word and cursor position? Here is what I have so far: var msg = new…
1.21 gigawatts
  • 16,517
  • 32
  • 123
  • 231
8
votes
1 answer

Swift - How can I convert Saved Audio file conversations to Text?

I work on speech recognition. I solve the text-to-speech and speech-to-text with IOS frameworks. But now i want to convert saved audio file conversations to text. How can i solve this ? Thank you for all replies.
Ali Ihsan URAL
  • 1,894
  • 1
  • 20
  • 43
8
votes
1 answer

Android WebView (WebChromeClient) Speech Synthesis Undefined

I'm trying to use Web Speech Synthesis API in an Android WebView that I've set as a WebChromeClient. However when I inspect the page and run the following command console.log('speechSynthesis' in window, 'webkitSpeechRecognition' in window,…
Strikeforce
  • 361
  • 4
  • 15
8
votes
6 answers

How to change the voice in pyttsx3?

This code is working but I'm only able to switch between the voices which came preInstalled in Microsoft Windows. These voices are "Microsoft David Mobile" and "Microsoft Zira Mobile". Later I installed "Microsoft Kalpana Mobile" and set it as the…
8
votes
1 answer

Does TTS API work without network connection on android?

hello i'm a student of engineer school and i was wondering if TTS(Text to Speech) works in offline environment for android.
John_potato
  • 105
  • 1
  • 4
8
votes
4 answers

How to change voice in Speech Synthesis?

I am trying out a simple example with Speechsynthesis.