While i have several documentation that suggest that a driver run on its own node which is the master and the executors on slave node also called Worker, I come to somehow get confused with that. Hence i would like to confirm the following if possible:
where does the Driver run on a cluster of type:
- Standalone
- Yarn
- Mesos
So i think i have the answer for 2 which is driver in master. However i am unsure for 1 and 3. Can someone help clarify ?
Finally if a Driver share a node with an executor, does that mean when we size the cluster node we need to take into account that more thread might actually run on it and the memory usage might be bigger ? In other words, we should systematically oversize our node to account for a potential driver.