How to optimize treeAggregate of LBFGS on spark

Asked Sep 14 '16 at 08:06

Active Sep 20 '17 at 03:01

Viewed 392 times

I'm run LBFGS on spark, with 5 features and 10w records, and found treeAggregate this:

We can see the treeAggregte is time-consuming

I have 100 cores, and every job 'treeAggregate at LBFGS.scala:218' has 1w+ tasks

edited Sep 22 '17 at 17:48

Community

asked Sep 14 '16 at 08:06

Dylan Wang

i have ran 'coalesce' on my trainData, reduce partitions equal to cores, and got a big improvement. But why? – Dylan Wang Sep 18 '16 at 06:56

0 Answers0