I am synthesising speech using the Google cloud APIs. I have the following information about the speech synthesise response.
- Sample rate: 8000 Hz
- Audio format: MP3
- Length of the byte array
The response from the API is a byte array. Given this information, how could I approximate or accurately compute the length of the synthesised audio?