You need to run, say, 30 srun jobs, but ensure each of the jobs is run on a node from the particular list of nodes (that have the same performance, to fairly compare timings). How would you do it?
What I tried:
srun --nodelist=machineN[0-3] <some_cmd>
: runs<some_cmd>
on all the nodes simultaneously (what i need: to run<some_cmd>
on one of the available nodes from the list)srun -p partition
seems to work, but needs a partition that contains exactly machineN[0-3], which is not always the case.
Ideas?