Questions tagged [autoscaling]

282 questions
4
votes
1 answer

What is the best metric for auto-scaling GPU instances for machine learning inference in the cloud?

We have an API in AWS with a GPU instance that does inference. We have an auto-scaler setup with the minimum and maximum number of instances, but aren’t sure which metric (GPU/CPU usage, RAM usage, average latency, etc) or combination of metrics…
elwray14
  • 41
  • 2
4
votes
1 answer

AWS Beanstalk Auto-scaling Scaling Triggers with UnhealthyHost count and Health Checking

I have my scaling triggers set to look for an un-healthy host count but it does not seem to be working. Now to test this - I am SSHing into one of my instances and halting the HTTPD service. Then, when I navigate to the health overview, I will…
rugbert
  • 83
  • 2
  • 10
3
votes
1 answer

How to dynamically set tag values in an AWS auto-scaling group

I'm trying to create an AWS auto-scaling group, and I want my instances to have a tag with Key=Name and Value=processor%i, where the "%i" is dynamically replaced with a 1, 2, 3, etc as the instances are added. Is this possible? I can't find any…
Cerin
  • 3,600
  • 19
  • 61
  • 79
3
votes
1 answer

How to share a EFS/s3 with code in a auto scaling group?

the main problem is: Infrastructure: Autoscaling (min 1 - max 3) RDS ELB elasticache (redis) elasticsearch I want to share a volume (EFS / S3) with the code of my application; The size of it is about 1.3 Gb. With EFS: The first attempt I tried to…
sysalam0
  • 71
  • 1
  • 5
3
votes
1 answer

Host discovery in AWS VPC

I just have a basic question. What is the recommended way to handle automatic host discovery in an AWS VPC? I have these Servers running and I want to figure out a neat way to skip having to configure the /etc/hosts file on every machine to get them…
3
votes
2 answers

Nosql autoscaling of compute

Most of the nosql autoscaling faces issue due to the fact the data have to be migrated during peak load. What if data is stored in a shared storage like CLVM which has less overhead(compared to NFS or shared file system). Now if each bucket/shard is…
kalyan
  • 249
  • 1
  • 3
  • 11
3
votes
1 answer

Searching AWS AutoScalingGroup resources by tags

I'm searching for autoscaling groups by tags - all of our AWS resources have a number of tags applied to them. However I only want to find auto scaling groups, not other types of resources. The should be possible using the tagging API; from the…
3
votes
1 answer

AWS auto-scaling policy - scaling-up too often

We have AWS auto-scaling policy configured for our Linux machine, but for some reason when I checked the scaling activity I realised that it triggers multiple 2-3 machines during very short span of minutes as you can see here: Activity…
3
votes
1 answer

Sequential hostnames and auto-scaling?

Is there a logical way to have sequentially numbered hostnames with servers that are autoscaling but not sequentially killed? So if server13 is killed and server14/15 are alive, and the cluster upscales with 2 servers it will add a server with…
ujjain
  • 3,983
  • 16
  • 53
  • 91
3
votes
2 answers

Running crontab on AWS auto scaling group

I have the following setup: 1 server with application X, that runs a crontab once an hour. The crontab connects to the database and runs some heavy calculations and exports data to a special data file and restarts the application. The export can run…
dobber
  • 31
  • 1
  • 4
3
votes
2 answers

Auto-heal an EC2 instance with an Auto Scaling Group?

I'm trying to setup an auto-healing EC2 instance using an Auto Scaling Group and a user-data startup script. If the current server has an issue where it is no longer reachable, the instance should terminate and a new one take its place. This is easy…
James Simpson
  • 1,651
  • 2
  • 15
  • 31
3
votes
2 answers

Available Instance types for marketplace ami's

I based my autoscaling AMI's on the Turnkey Linux nginx AMI from the marketplace. I am now unable to select any of the newer generation instance types; for instance, my autoscaling uses m3.large type but I'd really like it to use the c3.xlarge type…
Christian
  • 789
  • 1
  • 13
  • 31
3
votes
1 answer

Is memory usage the main cause of instance churn on Google App Engine?

I am experiencing sharp drops and spikes in instance count every 30 minutes although the request rate has been stable for 2h at 2.4k RPS. Periodically there are a lot of warmup requests after many instances get shutdown at the same time. This also…
3
votes
1 answer

ELB Instance Out of service

After implementing ELB and Autoscalling on Amazon AWS. There's a wierd issue happens and we don't know either the reason or the fix. We installed Auto Scalling configuration to add instance of AMI ID of Windows 2008 R2 x64, SQL Express and IIS. We…
3
votes
2 answers

EC2 Autoscale policy to terminate instances if not needed but before 1 hour

I have setup EC2 Autoscale policies along CPU monitors and it's working well. I have occasional bursts that triggers autoscale and more servers are added. If the servers are experiencing a burst that endures 30 minutes and two more instances are…
Claudio Poli
  • 285
  • 3
  • 10
1 2
3
18 19