Researching the subject, one can find papers where the author makes image classification / retrieval using the "Bag of Words" model, while others do similar tasks using a "Bag of features" model.
Even though I have a basic understanding of the method involved (detect and extract visual words, build a visual dictionary, use machine learning to train a classifier), I still can't see the difference between both models. Are they synonyms? Maybe I have missed concrete examples / documentation that shows the difference...