Questions tagged [autoscaling]
282 questions
0
votes
1 answer
Autoscaling Groups: what happens during scale-in if all ec2 are termination protected?
Suppose I create an ec2 autoscaling group where each instance is protect terminated.
Suppose a min of 2 and a maximum of 4, I was wondering what happens if, in presence of a step policy:
There is a scale-out of 4 and then a scale-in: will always 4…

Phate
- 155
- 2
- 8
0
votes
1 answer
How to minimize ecs autoscaling reaction time from terraform?
When you create an ECS autoscaling policy, two alarms tag along with it: one for scaling up ("out"), one for scaling down ("in").
The scale-out ones I see created appear to sample CPU utilization (or the metric of interest) every minute, and only…

Jonas Kölker
- 103
- 3
0
votes
1 answer
Limit max number of simulataneous connections from NLB to a specific instance
I am trying to restrict maximal number of TCP connections from amazon NLB to a target group.
Example of a desired scenario:
We have 2 instances and NLB. The first 100 users with 100 TCP connections are directed to the first instance and are…

rlib
- 195
- 1
- 1
- 8
0
votes
0 answers
EBS target response time increasing before CPU utilization
I'm in a company where we have an elastic beanstalk configuration and it works fine with our CICD. The only issue is that earlier today i ran a stress test (basically just disabled AWS shield and went hammer on the DDOS). Regardless of what we did…
0
votes
1 answer
Connect instances in auto-scaling group to EC2 instance (coordinator node) without using elastic IP?
I have a setup where I do batch computation using spot instances in an auto-scaling group.
The batch computation is coordinated by a "coordinator node", which is just an EC2 instance. I use an elastic ip to ensure the spot instances can connect to…

Foobar
- 115
- 7
0
votes
1 answer
AWS auto scaling, "scale in" controls
I'm trying to move off a monolithic big server, that ran many jobs, into a design that uses auto scaling to build out more servers as more jobs are run. The issue I found in testing is that when the default scale in task took place it terminated a…

RunThor
- 197
- 2
- 12
0
votes
0 answers
GCE autoscaling based on custom metrics provided by the running instances?
I have a managed instance group with autoscaling.
Each instance runs a program that polls a remote site asking "do you have any work for me?" If the answer is "no", it goes to sleep for a few seconds and repeats. We consider this an idle instance.…

JayEye
- 121
- 4
0
votes
1 answer
Configure a new kubernetes node automatically
I configured a Kubernetes cluster with GPU in AWS, and one of the steps is configuring the nodes to enable Nvidia runtime by default.
So when I configured cluster auto scaler, the cluster scales up successfully but the new node still needs to be…

Sadmi
- 11
- 3
0
votes
1 answer
How can I autoscale my Azure Kubernetes nodepool before resources run out
I did some testing today with my autoscaling setup on Azure Kubernetes. I noticed that when an autoscale was triggered, it took a while for the next node to spin up, so the last pod had to wait a long time to be scheduled. I would like to make it so…

Ben Davis
- 280
- 1
- 6
- 18
0
votes
1 answer
Autoscaling with Google Compute Engine with Static IP pointing to domain name
I am looking to add autoscaling to my Google Compute Engine via the instructions I have found here: https://cloud.google.com/compute/docs/autoscaler
My original Compute Engine instance has a static ip and I have the domain (example: domain.com)…

Chrono
- 3
- 1
0
votes
0 answers
Targeting specific google cloud run instance through HTTP
I'm using cloud run to run a node.js app with lru-cache and auto scaling. I have created a route to allow external services to reset the application's cache. I am now looking for a way to target every running container of the app and to curl the…

J Dubuis
- 101
- 1
0
votes
1 answer
Google Run Managed - how keep a minimum number of instances and not have 0
We have a set of services deployed on google run platform managed for a web application and whenever there is no traffic, it seems the services go to sleep and then it takes a long time to wake them up whenever new traffic comes in.
Is there a way…

Gregory
- 1
0
votes
1 answer
google cloud Stackdriver Metrics to scale an Manage instance group - Regional
What are the Stackdriver Metrics we can use to autoscale Regional Manage instance groups ?
When i check the docs it says
Regional managed instance groups do not support filtering for per-instance metrics.
Regional managed instance groups do not…

Thulasya
- 3
- 2
0
votes
0 answers
autoscaling php-fpm in AWS ECS
I have a t3.medium instance managed by ASG and used by ECS. My application requires approximately 25% of the capacity of this instance to operate at a minimum. will work in the standard nginx + php_fpm dockerized configuration. With that, I thought…

Julio
- 1
- 1
0
votes
2 answers
Production level Auto-scaling in AWS
I have completely understood the concept of Auto-Scaling in AWS. My only question is, what AMI will the launch configuration use while in production environment?
According to my understanding Image of existing instance should be used. Lets say I…

Subramanian Lakshmanan
- 7
- 1
- 3