I have 1 instance. If latency more then 1 second for 3 minutes, it will add up another instance.
And here the problems arises: after 50 min(because of scaling cooldown) this 2nd instance terminates. And, if load is still high, latency jumps back to more than 1 second.
But because of scaling cooldown, it can't add up a new one again!
Is it possible to set up and down cool downs separately? Or another suggestion to solve this?
Making cooldown less not helps, instances just rises and dies more often, so application down time is still big.