I just wanted to know, why my ParallelGC (--conf "spark.executor.extraJavaOptions=-XX:+UseParallelGC"
) in a very long Spark ML Pipeline works faster than when I set G1GC (--conf "spark.executor.extraJavaOptions=-XX:+UseG1GC"
), even though the Spark community suggests G1GC to be much better than the ParallelGC.
Any pointers on this would help.