Questions tagged [horizontal-scaling]

horizontal scaling refers to partitioning data in such a way, that parts of data are placed on different servers (which is opposed to vertical scaling, when the same server gets more capacity)

In horizontal scaling data is partitioned in such a way, that parts of data are placed on different servers. This is required when it is no longer possible to fit data into the same machine / server / db. In search applications this type of scaling is often referred to as sharding. This is opposite to vertical scaling, when the servers already in use are made more powerful (by adding more CPU cores / RAM / disks etc).

Horizontal scaling is aimed at copying with growing data set, while vertical scaling is aimed at growing user base (more users asking content from the same server / shard).

Popular questions:

Related links:

Related tags:

240 questions
62
votes
1 answer

Scaling Postgres horizontally

Let we say you are running your business on top of postgresql database. After some time you get so much traffic that it cannot be handled by single postgresql instance, so you want to add more instances (scale it horizontally) to be able to handle…
user232343
  • 2,008
  • 5
  • 22
  • 34
62
votes
6 answers

Can relational database scale horizontally

After some googling I have found: Note from mysql docs: MySQL Cluster automatically shards (partitions) tables across nodes, enabling databases to scale horizontally on low cost, commodity hardware to serve read and write-intensive workloads,…
Maksym
  • 4,434
  • 4
  • 27
  • 46
14
votes
3 answers

Microservices: database and microservice instances

Lets say we have a microservice A and a B. B has its own database. However B has to be horizontally scaled, thus we end up having 3 instances of B. What happens to the database? Does it scale accordingly, does it stays the same (centralized)…
nobitta
  • 195
  • 1
  • 11
11
votes
6 answers

How to avoid concurrency issues when scaling writes horizontally?

Assume there is a worker service that receives messages from a queue, reads the product with the specified Id from a document database, applies some manipulation logic based on the message, and finally writes the updated product back to the database…
zafeiris.m
  • 4,339
  • 5
  • 28
  • 41
10
votes
1 answer

Is there a way in Kubernetes to check when hpa happened?

I have hpa configured for one of my deployment in Kubernetes. Is there any way to check if HPA scaling happened to the deployment and when it happened? I don't have prometheus or any monitoring solutions deployed.
karthikeayan
  • 4,291
  • 7
  • 37
  • 75
8
votes
2 answers

How does Kubernetes Horizontal Pod Autoscaler calculate CPU Utilization for Multi Container Pods?

Question 1.) Given the scenario a multi-container pod, where all containers have a defined CPU request: How would Kubernetes Horizontal Pod Autoscaler calculate CPU Utilization for Multi Container pods? Does it average them? (((500m cpu req + 50m…
neoakris
  • 4,217
  • 1
  • 30
  • 32
8
votes
1 answer

How to distribute persistent connection across a Elixir/Phoenix cluster?

So this question is bugging me, and I actually don't need this kind of scale, I'm just curious. Recently Chris Mccord got 2 million persistent connections in a single box using Elixir/Phoenix. If I have 3 boxes in a cluster, each one handling 2…
8
votes
2 answers

Scaling out Windows Services

I am looking for some input on how to scale out a Windows Service that is currently running at my company. We are using .NET 4.0 (can and will be upgraded to 4.5 at some point in the future) and running this on Windows Server 2012. About the…
Artless
  • 4,522
  • 1
  • 25
  • 40
8
votes
3 answers

Horizontal scaling of JSF 2.0 application

Given that JavaServer Faces is inherently stateful on the server side, what methods are recommended for horizontally scaling a JSF 2.0 application? If an application runs multiple JSF servers, I can imagine the following scenarios: Sticky Sessions:…
7
votes
1 answer

Scaling GKE pods based on number of active connections per pod

I have a running GKE cluster with an HPA using a target CPU utilisation metric. This is OK but CPU utilisation is not the best scaling metric for us. Analysis suggests that active connection count is a good indicator of general platform load and…
benjimix
  • 609
  • 7
  • 18
7
votes
1 answer

Kubernetes horizontal pod autoscaling initial delay?

Is there a configuration in Kubernetes horizontal pod autoscaling to specify a minimum delay for a pod to be running or created before scaling up/down? For example with something like: # I am looking for a flag like…
Pierre B.
  • 11,612
  • 1
  • 37
  • 58
7
votes
2 answers

Using Horizontal Pod Autoscaling along with resource requests and limits

Say we have the following deployment: apiVersion: apps/v1 kind: Deployment metadata: ... spec: replicas: 2 template: spec: containers: - image: ... ... resources: requests: cpu:…
mittelmania
  • 3,393
  • 4
  • 23
  • 48
7
votes
1 answer

Node.js scaling out on Kubernetes

I built an app on node.js using Docker and I'm not sure how to scale it on a Kubernetes cluster so that I take the most out of my cluster hardware. From a performance perspective which of the following is better: clusterize my node app and run as…
Thiago Caramelo
  • 309
  • 3
  • 9
7
votes
2 answers

Horizontal scaling and cron jobs

I was recently forced to move my app to Amazon and use auto-scaling, I have stumbled on to a issue with cron jobs and automatic scaling. I have a cron job running every 15 minutes which checks if subscriptions should be charged, the query selects…
7
votes
1 answer

Horizontally scaling Scrapyd

What tool or set of tools would you use for horizontally scaling scrapyd adding new machines to a scrapyd cluster dynamically and having N instances per machine if required. Is not neccesary for all the instances to share a common job queue, but…
gerosalesc
  • 2,983
  • 3
  • 27
  • 46
1
2 3
15 16