I am testing a two-node cluster and testing various split-brain scenarios.
It is all working fine, with resources doing what I would expect, except I am wondering about timing in a particular case.
I am running with
auto_tie_breaker: 1
auto_tie_breaker_node: lowest
lowest would be node 1 in my case.
In the case where the resources are on node 1 and I cause a split-brain scenario everything seems fine.
However, when the resources are on node 2, and I cause a split-brain, the resources are shutdown on node 2 and started on node 1, which makes perfect sense.
But I am wondering about the timing. What is the mechanism, now that the two nodes cannot communicate with each other, that makes sure the resource on node 1 starts after the resource on node 2 shutdowns?
I can imagine putting a sleep in the start up script for the resource, but I was wondering if there is an officially endorsed way to handle this timing?
Or perhaps I am going about this the wrong way. Perhaps I should use some sort of stickiness. Telling the nodes to continue what they were doing when a split-brain condition is detected. In other words, on split-brain detected do not change anything.