-2

What's the difference between the ml.classification.DecisionTreeClassifier in ml and the mllib.tree.DecisionTree trainClassifier in mllib?

Drup
  • 3,679
  • 13
  • 14
peterlandis
  • 645
  • 1
  • 7
  • 17

1 Answers1

0

The spark.mllib DecisionTree classifier is built on top of RDDs while the spark.ml provides a further API built on top of DataFrames for constructing ML pipelines. It's recommended to use spark.ml due to the advantages of DataFrames optmizations.

Here's a good post I found after searching more of the forums: Difference between org.apache.spark.ml.classification and org.apache.spark.mllib.classification

peterlandis
  • 645
  • 1
  • 7
  • 17