I explored that PM2 uses the node cluster module to run the application in cluster,
node cluster module follows two approches to handle the cluster. uses the round-robin approach by default for handling cluster.
Document for how the node cluster module works
In that document they mentions like, The cluster module supports two methods of distributing incoming connections.
The first one (and the default one on all platforms except Windows) is the round-robin approach, where the primary process listens on a port, accepts new connections and distributes them across the workers in a round-robin fashion, with some built-in smarts to avoid overloading a worker process.
In the default approach, it uses the round-robin, then my assumsion is,in PM2 if the first request goes to first instance, then the second request should goes to the next instance that could be the instance 2.
But What I found is, all requests go to an instance till some time, after that time all request go to the next instance. So here what my question is, I want to know that time duration (How long all requests are going to same instance), How the time duration is managed by PM2?