SHORT AND SIMPLE: What are the steps that are involved to get an MFCC from an FFT.
DETAILED:
I'm working on a drum application to classify sounds. Its a matching application for the iPhone with the openframeworks library for sound processing, the idea is to return the name of the note that you play on the loud Indian drum (known as the Dhol) - only a few notes are playable.
I've implemented the FFT algorithm and successfully obtain a spectrum. I now want to take it one step further and return the mfcc from the fft.
This is what I understand so far. Its based on linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency.
It uses triangulation to filter out the frequencies and get a desired coefficient. http://instruct1.cit.cornell.edu/courses/ece576/FinalProjects/f2008/pae26_jsc59/pae26_jsc59/images/melfilt.png
So if you have around 1000 values returned from the fft algorithm - the spectrum of the sound, then desirably you'll get around 12 elements (i.e., coefficients). This 12-element vector is used to classify the instrument, including the drum played...
This is all I'm trying to achieve.
Could someone please help me on how to do something like this? Any help would be greatly appreciated. Cheers