I am trying to extract pitch features from an audio file which I would use for a classification problem. I am using python(scipy/numpy) for classification.
I think I can get frequency features using scipy.fft
but I don't know how to approximate musical notes using frequencies. I researched a bit and found that I need to get chroma features which map frequencies to 12
bins for notes of a chromatic scale.
I think there's a chroma toolbox for matlab but I don't think there's anything similiar for python.
How should I go forward with this? Could anyone also suggest reading material I should look into?