I am working on converting a speech recognition project from MATLAB to Java code. I have been able to read the .wav files (as vectors of values in the range -1 to 1) using the java example provided here. This works exactly as the wavread function in MATLAB.
My next task is to extract the MFCC feature vectors from the original sample vectors. In MATLAB, I used Voicebox to achieve this easily but have been unable to find a Java equivalent. Using Voicebox, I have a code like:
a = melcepst(samples(1,:), 44100)
where each row of 'samples' contains the vector representing each .wav sample. The method returns a 2-dimensional matrix of the MFCC features for each sample.
I have seen Sphinx, but have been unable to understand how to use it for this task. Any help with using Sphinx or any other java solution will be appreciated.