0

Im working on cmusphinx speech to text, I need to train/add words to my dictionary, so i used lmtool and uploaded a corpus file and used the .dict and .lm file and used these as parameters for pocketsphinx and it worked. Im wondering how to add these files to default files. i.e i want to add the new words .dict and .lm files to /edu/cmu/sphinx/models/en-us/cmudict-en-us.dict and /edu/cmu/sphinx/models/en-us/en-us.lm.bin

Im not sure, if this is feasible and im wondering how to combine dictionaries into single one. I found this link but not sure how to achieve the same.

When i use the TranscriberDemo.java my wav file has different words and the output prints different. how to improve the accuracy ?

Community
  • 1
  • 1
optimus
  • 729
  • 2
  • 12
  • 36

1 Answers1

0

Dictionary and language model extension is covered in the following part of tutorial

http://cmusphinx.sourceforge.net/wiki/tutoriallmadvanced

Nikolay Shmyrev
  • 24,897
  • 5
  • 43
  • 87
  • thanks for the response, I went through the link, as a starter, im not able to understand things like mix, pruning, is there is any guidance/steps provided in documentation combine multiple dictionary files to Sphinx, sry for my ignorance. – optimus Jul 14 '16 at 13:16