I want to create a model to recognize Arabic Letters. I know how to create the language model and the dictionary files, but I am stuck at the acoustic model. I record wav files for each letter, but during the training, it says that the training hours are too small although the training continues. When I try to use it, the model doesn't recognize anything(giving null).
I want to know how should I record the wav files, should I keep repeating, for example, the letter Alif like 100 times in one wav file or should I just record multiple wav files of the same letter.
Your help is highly appreciated.