I'm trying to understand how Nomad spread
works with client failues.
In Nomad jobs, you can define a spread
stanza, such that a job's instances get spread actoss all client.
Here are the docs: https://www.nomadproject.io/docs/job-specification/spread
As the spread
is a soft preference, if one of clients goes down, for any reason, Nomad will migrate all the jobs running on the lost client to another available clients. (This takes effect even with bin-packing).
In case of a 2 client grid and a job with two allocations, if one client fails, both allocations will run on the same client.
What happens when the grid recovers and brings up a new client? Will the jobs be re-spread, following the spread
stanza, to both nodes, or will the two allocations continue to run on the same client until the job is re-run?