This question was asked several years ago, but hopefully things have changed...
Could someone point me to any details about which versions of Android support what subset of SSML in its Text-To-Speech engine?
I did some experiments using Flutter-TTS, which is just a layer that passes the text to be spoken to the underlying platform TTS service.
Some unknown subset of SSML does work on a newer Android - for example, this SSML
<speak>before<break time="5s"/>after</speak>
does indeed produce a five-second pause between the words on API 27 and API 29. It does not work on API 21, but at least it handles it gracefully by just ignoring all tags. I have not tested other API levels yet. I also tried prosody, phoneme, and lang tags - they seem to be not working.