-2

What is the proper way of doing this to apply log2 to my RDD of numbers? Is there a function to help with this?

hli
  • 61
  • 1
  • 5

1 Answers1

1

RDD:

import org.apache.commons.math.util.MathUtils

val rdd: RDD[Double] = ???
rdd.map(x => MathUtils.log(2.0, x))

DataFrame:

import org.apache.spark.sql.functions.log2

rdd.toDF("value").select(log2("value"))