Is it still necessary to repartition spark dataframe after enabling AQE?

Asked Sep 14 '22 at 07:55

Active Sep 14 '22 at 08:04

Viewed 102 times

As I learned the spark AQE (Adaptive Query Execution) is taking care of the spark data frame partition dynamically at the runtime (if shuffling).

Therefore do we still need to concern about "manually" repartition?

And, does the processed data frame partition number relates to the number of current parallelism (spark.sparkContext.defaultParallelism) or the input dataframe's partitions?

edited Sep 14 '22 at 08:04

asked Sep 14 '22 at 07:55

QPeiran

1,108
1
8
18

Spark will take care of it if AQE enable , for data suffeling. – Soumen C Sep 14 '22 at 11:58

Is it still necessary to repartition spark dataframe after enabling AQE?

0 Answers0