Questions tagged [mahout]

Apache Mahout open source scalable machine learning project

This topic covers questions related to Apache Mahout, a scalable machine learning project written in Java and largely based on Apache Hadoop, with implementations of algorithms for:

1171 questions
-1
votes
1 answer

Item Based Similarity Metric

I am using Mahout Apache to write an item based recommender (based on similar item ratings by users) and I was wondering which of the following two similarity metrics would be the best to use: Pearson, Spearman, Euclidean, Tanimoto and Loglikelihood
tlauer
  • 558
  • 2
  • 8
  • 22
-1
votes
1 answer

how to combine mahout recommendations

I'd like to build a simple recommendation system. Let's say for online shop, where I have events like purchases, likes, views. Currently, I understand how to build a recommendation for each of those types of events separately. But, I can't figure…
kikulikov
  • 2,512
  • 4
  • 29
  • 45
-1
votes
1 answer

Where mahout logs are stored

I'm new to to Mahout. I've run mahout clustering task, but it fails I'm unable get why this task fails, so where are mahout logs stored?
Vijay_Shinde
  • 1,332
  • 2
  • 17
  • 38
-1
votes
1 answer

got error when run mahout cat ponut.csv

when i run mahout cat ponut.csv got this error Running on hadoop, using /usr/local/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /Users/shawn/Code/mahout/examples/target/mahout-examples-0.9-job.jar Exception in thread "main"…
Shawn
  • 3
  • 1
-1
votes
1 answer

Mahout Clustering with one dim K-means

Can I cluster data with one variable instead of many (What I had already test) using mahout K-means Algorithm ? if yes (I hope so :) )could you give me an Example of clustering and thinks
user3260388
  • 313
  • 1
  • 4
  • 10
-1
votes
4 answers

programming in mahout

what is the step-by-step procedure for executing a program in mahout
Anakha
  • 435
  • 1
  • 5
  • 4
-1
votes
1 answer

Apache Mahout Training on Sample Data vs Implementing on Actual Data

The scenario is like this: I am trying to make a recommender using apache mahaout and i have some sample preference(user,item,preference value) data for generating the similarity matrix and determining item-item similarities. But the actual…
-1
votes
1 answer

Running mahout k means clustering command without converting input file to vectors

I have a dataset(300MB) on which I wish to run k means clustering using Mahout. The data is in a form of csv which contains only numerical values. Is it still necessary to input the file in vectorized format for the mahout k means command? If not,…
-1
votes
1 answer

Mahout: Recommending Items for a user in particular product category

What do we have as of now? - We are using Mahout's GenericItemBasedRecommender to get a list of recommended products for a user using TanimotoCoefficientSimilarity as ItemSimilarity. Where do we want to go from here? - The above works fine when we…
-1
votes
1 answer

Which is better for running recommendations on a Hadoop cluster, Apache Mahout or using R with Hadoop (via hadoop streaming/RHIPE/RHadoop etc)?

I am new to Big-data and looking for a good platform to perform recommendations,clustering and classification. I understand Mahout has many algorithms to do this. Also R itself being a very good analytical tool is more than helpful for achieving…
Kiran Karanth
  • 133
  • 1
  • 1
  • 8
-1
votes
2 answers

Vector.class doesn't exist in Mahout-core 0.7 -cdh4.2.1 jar, not able to run SimpleKMeansClustering Example

Hi i'm new to Mahout so was to trying to run the SimpleKMeansClustering from Github. I dowloaded the Mahout-core jar from Cloudera Repository. Now when i'm trying to compile my code in eclipse, i cannot find the Vector.Class in…
user2454360
  • 91
  • 1
  • 1
  • 10
-1
votes
1 answer

list of recommendations without outdated (old) items

i couldn't find any info on this anywhere. i have a setup of users and items, where the items could became outdated pretty fast (in days). this means, i cannot show those items anymore, since the customer (who sells items) has no credit left and we…
ulkas
  • 5,748
  • 5
  • 33
  • 47
-1
votes
2 answers

What does it mean by Mahout is built on Hadoop?

I have read that Mahout is developed on Hadoop, and that we can use Mahout with or without Hadoop. What does it mean by saying that Mahout algorithms can be run on Hadoop? Mahout algorithms are not written in Map-Reduce form right?Could you please…
Neethu Prem
  • 137
  • 1
  • 10
-1
votes
2 answers

Has anybody tried to check out mahout (SVN) from eclipse (subclipse)

I was trying to check out a copy of mahout using the subclipse. I typed in the following link to try to create a new reposistory location, but failed. Did I do something wrong? Thanks https://svn.apache.org/repos/asf/lucene/mahout/trunk The error…
flyingfromchina
  • 9,571
  • 12
  • 35
  • 38
-1
votes
2 answers

should I use mahout for this?

I want to recommend items that are tagged and are categorized into three price categories (cheap, regular and expensive). I know that with Mahout recommendation could be achieved but here's why I don't know how to use it. Mahout is based on the…
Javier Manzano
  • 4,761
  • 16
  • 56
  • 86