0

I am trying to get individual phoneme sounds from Google Text to Speech. I'm getting the individual phonetic spellings and corresponding IPA from dictionary.com.

For Ex) pizza (peet-suh) IPA: ('pit sə)

I'm using Python to get the response from the API and the ssml is,

<phoneme alphabet="ipa" ph="sə">suh</phoneme>

However, the audio comes out sounding 'sooh'.

The code for the request is as follows,

text='<phoneme alphabet="ipa" ph="sə">suh</phoneme>' synthesis_input = texttospeech_v1beta1.SynthesisInput(ssml=text) response = client.synthesize_speech(input = synthesis_input, voice = speaker, audio_config = config)

The client is also instantiated as

client = texttospeech_v1beta1.TextToSpeechClient()

It would be great if I could get some help with this.

0 Answers0