1

I want to run the KMeans algorithm of MLLIB (Apache Spark), but with reproducible results.

Is it possible to run KMeans of MLLIB (Apache Spark) with fixed seed? How?

Thanks and regards,

zero323
  • 322,348
  • 103
  • 959
  • 935
learning_spark
  • 669
  • 1
  • 8
  • 19

2 Answers2

0

As of Spark 1.2, it appears that this is not possible. It generates a seed internally.

Sean Owen
  • 66,182
  • 23
  • 141
  • 173
0

Set seed for KMeans is availble in Spark MLLib since version 1.4.0 See doc library here

m.piunti
  • 340
  • 2
  • 8