I ran a SVM algorithm using MLIB library in Spark on a data of size 8G, and 7 million rows. I am running Spark in standalone mode on a single node.
I used /usr/bin/time -v to capture data about the job. I got the peak memory utilization, and % CPU time among other things. The % CPU utilization I got was a mere 6%. I was monitoring TOP while the program was running as well for sometime and I could see more than 100% being used almost consistently. I am now confused why /usr/bin/time showed only 6%?
Some more details - my machine is 16G, and the program I was running was consuming 13.88G. The program executed in 2.1 hour.
Any insights, anyone?