I have a JavaRDD which contains arrays of doubles. Now I want to calculate the pearson coefficient between each array. But if I convert the rdd to vectors and apply statistics.corr(), the function calculates for the columns but i want it to calculate for rows. Can anyone suggest a way to convert my data to columns so that i can apply corr() function on that?
Edit: The statistics.corr() function takes javardd < vector > as input.