I am currently developing a speech recognition service using the Google Speech API (Python).
There is no sound other than the voice of the voice actor in the Korean listening evaluation mp3 file which is being used as the sample now.
I am currently using long_running_recognize after converting my mp3 file to FLAC and uploading it to Google Storage, but the accuracy of the file is only 60% for 2 minutes.
I think i used the most intuitive data as a sample and I want to know if the length of the file affects the rate of recognition and if you can improve performance.