-1

I want to recommend items that are tagged and are categorized into three price categories (cheap, regular and expensive). I know that with Mahout recommendation could be achieved but here's why I don't know how to use it.

Mahout is based on the other users opinion but all of the new items that I want to recommend are just the new ones that don't have any preferences set yet.

Is Mahout the right tool for this? Is this content-based? (which mahout don't support yet????) or should I use classification?

Thanks!

Javier Manzano
  • 4,761
  • 16
  • 56
  • 86
  • 1
    are you sure you want recommendation model and not some clustering method? (from your question It seems all your data are new - so basically you don't have any training set) – xhudik Feb 27 '13 at 09:03
  • I really don't know yet if I want this or classification. I'm still reading and I don't even know what to do – Javier Manzano Feb 27 '13 at 16:55

2 Answers2

1

Since I've never built any recommender system - do not take this answer very seriously (no-one has answered it, so I try)

recommendation system has to be built on some already known (or partially known data). If you have only new (unseen) data there is only possibility to use some clustering algorithm in order to build some clusters.

And if those clusters would be ok, they can be used for training some recommendation system.

Mahout is just a tool which implement various ML methods. You can use other tools like Weka, R, ...

xhudik
  • 2,414
  • 1
  • 21
  • 39
  • Well, this algorithm I'm trying to create will be based on data collected before, but has to be applied to new ones. – Javier Manzano Feb 28 '13 at 10:51
  • I think, I'll go with clustering because it's what sounds better for me :) – Javier Manzano Feb 28 '13 at 10:51
  • I recommend lessons about recommender systems in coursera: https://class.coursera.org/ml-2012-002/lecture/index (week IX: Recommender systems) - you need to register – xhudik Feb 28 '13 at 13:56
  • maybe you need register at www.coursera.org and then enter the lessons (for me it is working) – xhudik Mar 01 '13 at 10:21
0

If you have no data at all about a new user, there's really nothing you can do to make recommendations, no matter what you do. There is zero input that would differentiate the person from anyone else.

Good systems should however be able to do something reasonable after the first input is available.

This is not a classifier problem by nature, no. It is also not a clustering tool, other answers notwithstanding.

The price categories are not core to any rec process you would use. You have other data presumably, what is it? That's important.

Finally whether or not to use Mahout depends on taste. You would use it if you want to use Java and Hadoop. And in turn you would only consider Hadoop if you had very large input, and few people have that much data (like >10M data points at least).

(Well, not quite -- my recommender pieces in Mahout pre-date Hadoop and are for on-line, smaller-scale applications. You might indeed be interested in this, if you are working in Java.)

Sean Owen
  • 66,182
  • 23
  • 141
  • 173