Questions tagged [ssml]

Speech Synthesis Markup Language(SSML) is a web standard for generating synthetic speech for voice-impaired users or device audio output.

SSML can be used to create software which integrates with voice services such as Skype to provide text-to-speech capabilities.

164 questions
2
votes
0 answers

Which SSML Tags are supported by Android TTS default engine

I have searched a lot unable to find any solid solution for, which are valid ssml tags supported by Android TTS default engine. could any one please update me. Will these tags support on all the android devices.
Manmohan Soni
  • 6,472
  • 2
  • 23
  • 29
1
vote
1 answer

My Azure Text-to-speech app no longer outputs once i added the ssml string

I created a text-to-speech script using Microsoft Azure. Today I decided i wanted to add a pitch changer, speaking rate changer and possibly some silence additions. For that, i need to replace my speak_text_async(text) with…
1
vote
1 answer

Problem with google cloud text to speech using python

Here are four versions of input texts to google cloud text to speech: Version 1 (This one works fine)
user3567761
  • 145
  • 1
  • 2
  • 9
1
vote
1 answer

Save files to a new folder with python

This code saves text files from a data frame of sentences, then saves each one as a ssml file. How can I get the sentences to be saved in a new folder? max = len(sentences) for i in range(0,max): txt = sentences[i] new_txt = starter + txt +…
aliencode
  • 49
  • 4
1
vote
2 answers

How can I use goquery parse single element

aaaaaaaa bbbb.
ccccccdddddddd
I use this code to parse and get: doc, err :=…
1
vote
1 answer

Unreliable timepoints with google text to speech v1beta1 (russian)

I send sixteen sentences written in Russian to the beta version of Google Cloud for synthesizing (one request per sentence). This is what the response object looks like : response = client.synthesize_speech( …
1
vote
1 answer

Using SSML and Python with Azure Voice

I'm creating a project that uses Azure's voice to text to speak a string back to the user. I want to change the voice gender and style with SSML, but Python does not seem to really support all the symbols needed in the string. I can't find any…
1
vote
0 answers

SSML: Play audio

I am using Android Studio and Google Cloud Text-To-Speech. In SSML, I would like to play audio (see the Google documentation). I know that I could play that audio file with Java only but I would like to know why it is not working with SSML. This is…
Reza Saadati
  • 5,018
  • 4
  • 27
  • 64
1
vote
0 answers

SetOutputToDefaultAudioDevice() gets NullException

i've wanted to try out the SSML but everytime i try to run my code i get a null exception at SetOutputToDefaultAudioDevice() . I am using the Example Code from Microsoft: using System; using System.Speech.Synthesis; namespace SSML_Test { class…
benni1913
  • 11
  • 2
1
vote
1 answer

Phoneme from Jsonstream unrecognized in c#

I have a little problem with my JsonStream in C# I am reading a Json Blob in my Storage with this inside: {"id":"275177", "fremdwort":"1.FFC-Frankfurt", "ipa":"ʹeːɐ̯stɐ ɛf ɛf ʦeː ʹfraŋkfʊrt"} in C#: while (Jsonreader.Read()) …
smotorious
  • 45
  • 4
1
vote
2 answers

IBM Cloud - How to adjust speaking rate in Watson TTS using curl POST?

I'm having issues trying to adjust the prosody speaking rate in IBM Watson's TTS Service using curl. Here is the code I've tried, it does synthesize audio but just completely ignores the --header "prosody rate: +50%" ^ line I inserted which was to…
Bloggy
  • 109
  • 12
1
vote
2 answers

Is there a way to make Google Text to Speech, speak text for a desired duration?

I went through the documentation of Google Text to Speech SSML. https://developers.google.com/assistant/actions/reference/ssml#prosody So there is a tag called which as per the documentation of W3 Specification can accept an attribute…
1
vote
1 answer

Audio file not longer than 240 seconds in Alexa

I am an Alexa skill newbie and would appreciate help from the community. I am trying to figure why I keep getting "Error retrieving text to speech: The audio file cannot be longer than 240 seconds" but the audio file in question is less than 240…
matrix15
  • 13
  • 2
1
vote
1 answer

Detect if browser supports Speech Synthesis Markup Language

On Chrome linux code such as the following speak('Intro the rest.'); has the TTS engine reading out the xml stuff. On Android browsers it understands it and introduces a break. I don't want…
havlock
  • 662
  • 9
  • 16
1
vote
1 answer

Does Google's WaveNet support phonetic input (SSML phoneme elements)?

I am working with a product that uses phonetic input to make TTS generate proper pronunciations for names. I don't see phoneme tags in Google's WaveNet TTS documentation https://cloud.google.com/text-to-speech/docs/ssml, but perhaps I'm missing…
murspieg
  • 144
  • 2
  • 14