0

I am using Ubuntu 12.04, Python 2.7 & PocketSphinx.

I made a custom dictionary, language model using online LM tool. Using pocketsphinx_continous to decode the spoken voice gives me 100% accuracy. But using PyAudioto record sound in Python recognises the text but adds 'A' and 'AND' with the main context as shown in the images below PocketSphinx_Continuous PocketSphinx in Python How to cure it?

VeilEclipse
  • 2,766
  • 9
  • 35
  • 53

1 Answers1

2

The reason for the reduced accuracy is that you've added an artifical zero-silenced region around the utterance and that corrupts spectrum and reduces accuracy. Stop doing that. Instead, just recognize the sound you have recorded.

If you still need to decode zero-silenced regions, you need to add

 dither="yes"

option in decoder arguments in order to let decoder work around them. Once you add this option result will be accurate.

Nikolay Shmyrev
  • 24,897
  • 5
  • 43
  • 87