I'm using boto3 to apply custom application autoscaling policies to two HTTP endpoints, which increases instance counts for each. These policies are triggered based on the size of a single queue. For example, both endpoints have alerts that trigger scale-out when the queue gets larger than 1000. Currently, both endpoints begin scaling as soon as this alarm goes High.
Endpoint #1 takes 3 mins to complete scale-out, while endpoint #2 takes 7 mins to complete scale-out. The issue is that endpoint #1 sends data to endpoint #2, so I need endpoint #2 to finish scaling out before endpoint #1.
So, I'm trying to find a way to add a time delay to endpoint #1's scale-out. It looks like the scale_out_cooldown
parameter only adds a time delay between scaling events, but not at initial triggering as I'd like. Is there a way to delay the initial scale-out event triggered by the alarm for a set period of seconds?