7

I want to run Mahout's K-Means example in a hadoop cluster of 5 machines. Which Mahout jar files should I need to keep in all the nodes, in order for the K-Means to be executed in a distributed manner. Thanks.

-Venkiram

Sean Owen
  • 66,182
  • 23
  • 141
  • 173
Venkiram
  • 201
  • 1
  • 3
  • 6

1 Answers1

2

If you really just want to run the built-in K-Means, or other jobs with static drivers, the answer is 'none'. The mahout 'job' jars are self-contained hadoop job jars. If you submit a job to the cluster with 'hadoop job' it will work without any other jars.

bmargulies
  • 97,814
  • 39
  • 186
  • 310