Questions tagged [autoscaling]

Auto scaling is a feature of cloud computing services which allows adding more computational resources automatically if load increases or based on triggers/custom metrics.

Auto scaling is a feature of cloud computing services which allows almost infinite growth of computational power automatically as load increases on CPU, memory or disk storage.

In particular, such feature is available, for example, in Amazon EC2 service using ASG.

1836 questions
19
votes
1 answer

Mongodb cluster with aws cloud formation and auto scaling

I've been investigating creating my own mongodb cluster in AWS. Aws mongodb template provides some good starting points. However, it doesn't cover auto scaling or when a node goes down. For example, if I have 1 primary and 2 secondary nodes. And the…
18
votes
3 answers

Best way to manage code changes for application in Amazon EC2 with Auto Scaling

I have multiple instances running behind Load balancer with Auto Scaling in AWS. Now, if I have to push some code changes to these instances and any new instances that might launch because of auto scaling policy, what's the best way to do this? The…
18
votes
3 answers

Kubernetes scaling based on network utilization or requests per second

Is there any way to scale Kubernetes nodes based on network utilization and not based on memory or CPU? Let's say for example you are sending thousands of requests to a couple of nodes behind a load balancer. The CPU is not struggling or the…
Ulukai
  • 1,360
  • 2
  • 12
  • 17
18
votes
3 answers

Continuous deployment & AWS autoscaling using Ansible (+Docker ?)

My organization's website is a Django app running on front end webservers + a few background processing servers in AWS. We're currently using Ansible for both : system configuration (from a bare OS image) frequent manually-triggered code…
17
votes
1 answer

How does pod replica scaling down work in Kubernetes Horizontal Pod Autoscaler?

My understanding is that in Kubernetes, when using the Horizontal Pod Autoscaler, if the targetCPUUtilizationPercentage field is set to 50%, and the average CPU utilization across all the pod's replicas is above that value, the HPA will create more…
pkout
  • 6,430
  • 2
  • 45
  • 55
17
votes
4 answers

How do I force a CloudFormation stack to update when the parameter is updated?

I am running a AWS CloudFormation stack that takes in some parameters and launches EC2 instances along with other AWS resources. The parameters are fed into the user data of the EC2 instance and based on that changes are made dynamically to the web…
17
votes
2 answers

Use Reserved instance and autoscaling group

I would like to know if it would possible to create an architecture with both reserved instance (RI) and auto-scaling group to serve web pages. The idea would be to have one RI serving 24/7 and launching on demand instances in an auto-scaling group…
Julien
  • 185
  • 1
  • 6
17
votes
5 answers

How can I prevent EC2 instance termination by Auto Scaling?

I would like to prevent EC2 instance termination by Auto Scaling feature if that instance is in the middle of some sort of processing. Background: Suppose I have an Auto Scaling group that currently has 5 instances running. I create an alarm on…
SunnyDark
  • 291
  • 1
  • 3
  • 9
16
votes
3 answers

How to edit the Configurations of a Helm Chart?

Hi everyone, I have deployed a Kubernetes cluster based on kubeadm and for the purpose of performing HorizontalPodAutoscaling based on the Custom Metrics, I have deployed prometheus-adpater through Helm. Now, i want to edit the configuration for…
16
votes
1 answer

Can I use AWS code Deploy for pulling application code while autoscaling?

I want to autoscale the infrastructure when load gets high. I am running my infrastructure on AWS. I have a requirement where I need to pull the application code from Github when autoscaling happens. As the code changes frequently, we can't take an…
16
votes
2 answers

How does Auto Scaling "place" instances when used with multiple availability zones?

Consider the case when an Auto Scaling group is configured to span multiple availability zones (such as in this scenario). When a new Amazon EC2 instance should be added to the scaling group (scale out) based on demand, how does Auto Scaling decide…
pmaugeri
  • 319
  • 2
  • 8
15
votes
2 answers

Dynamic naming for AWS EC2 instances launched by Terraform aws_autoscaling_group

I have successfully created an autoscaling group using Terraform. I would like to now find a way to dynamically name the provisioned instances based on index value. For an aws_instance type, it can be easily done by: resource "aws_instance" "bar" { …
Timothy T.
  • 1,031
  • 1
  • 12
  • 25
15
votes
2 answers

Scaling Down Auto Scaling Group in AWS causes perpetual alarm

I am currently using two scaling policies which are attached to my auto scaling group: A A scale up policy which is invoked when a CloudWatch alarm is invoked. This CloudWatch alarm uses the CPUUtilization metric and fires at CPU over 80%. The…
Nick S.
  • 2,203
  • 3
  • 19
  • 21
15
votes
1 answer

EC2 autoscaling does not put name (tag) on servers

I've created an autoscaling group on EC2 and it's working just fine. Servers scale up and down depending on load. I'd like to have a little more info on the management side and am wondering if there's a way to get the autoscaling group to…
joelpoloney
  • 419
  • 4
  • 13
14
votes
5 answers

Kubernetes autoscaler - NotTriggerScaleUp' pod didn't trigger scale-up (it wouldn't fit if a new node is added)

I'd like to run a 'job' per node, one pod on a node at a time. I've scheduled a bunch of jobs I have a whole bunch of pending pods now I'd like these pending pods to now trigger a node scaling up event (which does NOT happen) Very much like this…
Chris Stryczynski
  • 30,145
  • 48
  • 175
  • 286