I am using a tool which spawns parallel jobs onto an SGE grid. When the grid is close to capacity some of the grid jobs start but others do not. This leads to wildly different results from the tool depending on the actual number of jobs that start.
So how can I get SGE to queue until all the grid slots are available and then start all the jobs at once?
Note that the grid slots will typically become available across a number of machines.