I'm testing google Speech-to-Text API with streaming audio as well as with wav files. I'm using audio from telephony: 8000 sample rate, 8bits, mulaw encoding. The Google configuration is set appropriately.
When I test it with normal sequences, it returns a correct transcription. However when I say a single word (especially a number), I'm very often obtaining no response from the api -> as if it would be no input. This occurrence happens for both streaming as well as batch transcription.
does anybody know why is this happening? how to fix it?