I'd recommend the python module pyttsx3
for text to speech. You can just pass a string through to the engine and will use a voice (which you can change) to speak it out in one of those robotic voices.
For example:
import pyttsx3
engine = pyttsx3.init()
engine.say("I will speak this text")
engine.runAndWait()
If you are looking merely for being able to run .wav, .mpf, or other sound files such as songs I'd recommend checking out the playsound
module
Here's an example of that below:
from playsound import playsound
playsound('audio.mp3')