Questions tagged [healthcheck]

108 questions
3
votes
1 answer

HAProxy health check for a single backend

I have a HAProxy configuration with a single backend something very similar to this: backend mybackend option httpchk get /ping http-check expect ! rstatus ^5 server mybackend-0 192.168.1.1:9041 weight 1 The /ping endpoint always return 200. My…
Zeeshan
  • 33
  • 1
  • 3
3
votes
1 answer

How to health check hosts over HTTPS behind the load-balancer?

I have NGINX serving my site over HTTPS on EC2 hosts behind Elastic Load Balancer. I need to set up a proper health-check in ELB. However, I cannot use https://www.example.com because I need to health-check a HOST, not my site. And on the other…
curiousboy
  • 183
  • 1
  • 5
3
votes
2 answers

Varnish configure a backend to have multiple probes?

Is it possible in Varnish 3 to configure a backend to have multiple probes? I have multiple varnish servers and multiple backend servers running Drupal. I've configured a basic 1 second interval healthcheck PHP file to verify the health of the…
Highway of Life
  • 506
  • 1
  • 7
  • 14
3
votes
3 answers

What to check during a periodic system health check

I have been tasked with preparing a list of checks to do as a part of a weekly system health check routine which my team is supposed to do. The problem is neither I nor any of my colleagues have ever been a professional system administrator and the…
3
votes
2 answers

MySQL Slave Health Check

I am working on a project like creating database config on fly , got an array of read replicas for mysql instance and keeping one open connection to each and keeping their statics in that service so when a client tries to connect a read replica than…
Kevin Lee
  • 93
  • 1
  • 5
3
votes
2 answers

Health Tests on NVMe

On the servers I have, with HDD or SSD, I have a cron that periodically runs: /usr/sbin/smartctl --test=short/long /dev/sd1 (for each disk) While it runs, it just looks at the output of /usr/sbin/smartctl -c /dev/sd1, looping until it no longer…
Nuno
  • 553
  • 2
  • 8
  • 26
3
votes
1 answer

How do I exclude health checks from Google Cloud Stackdriver traces

I'm running my app in Google Kubernetes Engine, and the Stackdriver Traces are very useful to determine the health and speed of my app. However the large majority of request coming into my app are the health checks from the load-balancer to make…
2
votes
1 answer

AWS Elastic Load Balancer and target group health check fail for no apparent reason

I have completed my AWS ELB architecture for our website and successfully created the Launch Configuration and Target Groups for which instances are created behind the Load Balancer. My configuration is as follows : Target…
2
votes
0 answers

Elastic Beanstalk environment and security group config for ELB health checker?

I have an Elastic Beanstalk environment that I can't get Health Checks to work on. Here's my Elastic Beanstalk's Load Balancer configuration: My healthcheck URL is /status/ My EC2 security group (screenshot below) allows all HTTP traffic (for debug…
2
votes
1 answer

Determine load balancer health check timeout

How do you determine reasonable health check timeouts for load balancers? My application is failing load balancer health checks. I'm using the default health check timeout of 5 seconds, but I've noticed that average latency graphs on CloudWatch jump…
2
votes
0 answers

Is RST, ACK expected in a TCP Health Check

I've been experiencing some issues with two servers that I have with my VPS hosting provider where I've been getting intermittent connection errors. I've done LAN traces from the server and I spotted what I believe is an issue between the load…
Boardy
  • 258
  • 1
  • 4
  • 23
2
votes
2 answers

Broken GKE Backend Health Check Default

I have read this and I understand (I think) the differences between Kubernetes livenessProbes, and the GKE LoadBalancer health checks. My problem is this: I am exposing most of my Kube services via NodePort, which by default creates a new Backend…
2
votes
1 answer

Haproxy http response timeout check

We use haproxy as http load balancer. Sometimes one of our servers stop responding while accepting http connection requests. So stats page displays servers as green "accessible" but our nagios server says "CRITICAL - Socket timeout after 20 seconds"…
Mustafa
  • 21
  • 1
  • 2
2
votes
3 answers

How to make HAProxy wait for successful health check before bringing server out of maintenance mode

I am using HAProxy socket communication to put my backend into maintenance mode before bringing the web server down for maintenance via this command: echo "disable server cluster-01/app-01-4" | sudo socat stdio /tmp/haproxy.sock The problem is that…
WickedElephant
  • 123
  • 1
  • 4
2
votes
2 answers

How to monitor and measure the health of Compact Flash cards used as server storage?

Context The company sells access to a sort of cash register web application. Access to the application is given through a VPN. The VPN entrypoint for the clients is a Soekris board running Voyage Linux (a trimmed down version of Debian). These…
Antoine Benkemoun
  • 7,314
  • 3
  • 42
  • 60