In my experience, sometimes when I apply transformation()
to large data, it seems that the tasks are not evenly partitioned and are skewed to one side so that only a few tasks are working. As a result, it was confirmed that the efficiency of the work was poor.
When the tasks are not evenly partitioned:
I want to know in more detail the cause of when the tasks are biased towards one another.
Any Ideas?