I have a task that would benefit from more cores but the standalone scheduler launches it when only a subset are available. I’d rather use all cluster cores on this task.
Is there a way to tell the scheduler to finish everything before allocating resources to a task? Put another way the DAG would be better for this job if it ended all paths before executing a task or waited until more cores were available. Perhaps a way to hint that a task is fat? I am not and do not wish to run Yarn.
Succinctly: I need to run this map task on an otherwise idle cluster so it has all resources/cores. Is there any way to do this? Even a hacky answer would be appreciated.
Any ideas?