I have randomforest regressor pyspark ml model .response variable is of 9 classses.
When I predict the test data I am getting probability I need to get the classes instead.
Code used:
rf = RandomForestRegressor(featuresCol="scaled_features")
pipeline = Pipeline(stages=[featureIndexer, rf])
# Train model. This also runs the indexer.
model = pipeline.fit(train)
# Make predictions.
predictions = model.transform(test)
evaluator = RegressionEvaluator(labelCol="label", predictionCol="prediction", metricName="rmse")
rmse = evaluator.evaluate(predictions)