I have a DataFrame
of numerical features that I need to standarize. To do so I am using python MinMaxScaler to perform the following operations on all columns of the DataFrame
:
X = (X - X.min(axis=0)) / (X.max(axis=0) - X.min(axis=0))
Now I am thinking to do this using Scala. One way is to use MinMaxScaler
in Scala but it generates an array of features and store it as a new column. How can I use MinMaxScaler
and still have multiple columns of scaled features?