I am thinking about implementing a content based recommendation project idea in hadoop.
Just wanted suggestions if weka can be used with hadoop or if it is advisable to use Mahout for a cbf based implementation
Please advise
Thanks
I am thinking about implementing a content based recommendation project idea in hadoop.
Just wanted suggestions if weka can be used with hadoop or if it is advisable to use Mahout for a cbf based implementation
Please advise
Thanks
What is your content? What data surrounds or makes up your items?
There are several ways to use this for recs and some can blend in collaborative filtering data if appropriate. Take a look at rowsimilarity in Mahout. It calculates the similarity of an item to all others. If you input content in a way it can digest you'll have a CBR.
For instance input for recommending similar blogs posts would be: postID, tokens-from-the-text-of-the-post
Then for each postID you'll get a list other postIDs that use similar words. You can do the same for tags, categories, or other metadata too.