Questions tagged [kubernetes-health-check]

215 questions
0
votes
1 answer

Unexplained outage due to faulty (?) Readiness check results

TL;DR: At an unexpected point in time, all the "web"-pods handling traffic coming from our ingress became unhealthy. About an hour or so, and then everything was healthy again. I'm trying to figure out what happened, as it does not seem like any of…
0
votes
1 answer

Kubernetes can't detect unhealthy node

I am shutting down my k8s node manually to see if this affect the master. After shutdown I check status of nodes: kubectl get nodes The node which went down is still seen Ready in Status. As a consequence k8s still tries to schedule pods on this…
Barry Scott
  • 111
  • 1
  • 5
0
votes
1 answer

Understanding healthchecks for backend services on GKE when using ingress

I am using the following code in statefulset.yml apiVersion: apps/v1 kind: StatefulSet metadata: name: geth namespace: prod spec: serviceName: geth-service replicas: 2 selector: matchLabels: app: geth-node template: …
kosta
  • 4,302
  • 10
  • 50
  • 104
0
votes
2 answers

Mutually exclude 2 pods

I have two pods pod-a and pod-b and they belong to different applications inside an Openshift project. Both pods expose a health check http url which I can use to check if a given pod is up and running. I don't want both pods to be up together at…
Mandark
  • 798
  • 1
  • 12
  • 33
0
votes
2 answers

Kubernetes health check public access

Let's say you have microservices and running many nodes. Every nodes expose services to internet and they have also health rest services that are used internally but they should be private. How do you make private your health checks in Kubernetes…
0
votes
1 answer

How to trigger pod restart on code error

I had an incident on one of my pods where code got corrupted. One pod would output a php error message instead of the correct page content ("unexpected end of file"), but since there were numerous correct pods most of the time a visitor would get…
Steve
  • 145
  • 1
  • 8
0
votes
2 answers

How to reduce the "unhealthy" delay during pod startup?

I am using kubernetes to start java pods. The pod startup delay vary between 10 seconds and about a minute depending on the load of the node, the time flyway took to migrate the tables, ... To avoid having kubernetes killing the pods that are…
poussma
  • 7,033
  • 3
  • 43
  • 68
0
votes
1 answer

Kubernetes : Understanding Nodes resources usage, How to handle

I'm very new to Kubernetes, I just deployed few services on 2 node cluster. Following are metrics Nodes usage kubectl -n services-namespace top nodes NAME CPU(cores) CPU% MEMORY(bytes) …
Rams
  • 2,141
  • 5
  • 33
  • 59
0
votes
0 answers

Not able to health check of service which is outside the cluster in prometheus

I have setup prometheus and blackbox to monitor the kubenetes cluster and it is working fine for the dservice monitoring that is probe success for all services fine but the services which are from outside, I am not able to monitor those one. For ex:…
0
votes
1 answer

kubelet was unhealthy after install k8s from Rancher catalog

Have three hosts to run Rancher cluster. Rancher: 1.6.10 Kubernetes: 1.7.7 Install k8s from catalog on master host. Set orchestration=true and etcd=true labels to two Rancher agent hosts. After the k8s stack finished, only the kubelet went wrong.…
online
  • 4,919
  • 10
  • 32
  • 47
0
votes
1 answer

kubernetes-dashboard CrashLoopBackOff: Couldn't read CA certificate: open : no such file or directory

I just installed a single-node kubernetes cluster on CentOS7 using kubeadm according to this manual, then installed the kubernetes-dashboard extension. But the pod status is CrashLoopBackOff. I have checked the logs of the dashboard docker…
lfree
  • 1,880
  • 3
  • 24
  • 39
0
votes
1 answer

Kubernetes Deployments across the Datacenters

Is it possible to failover the traffic from a mysql k8s deployment running in one datacenter to a deployment running in another datacenter along with its storage? If yes , Do we need to spread the same k8s cluster on multiple datacenters or we…
Ijaz Ahmad
  • 11,198
  • 9
  • 53
  • 73
0
votes
1 answer

How to add kubernetes liveness probe

I am writing a simple YAML file to apply liveness probe using a TCP port on Centos.6 I pulled a centos:6 image from public repository started a container using the image. installed mysql, and started it to verify a opened port (3306) committed to…
jazzsir
  • 609
  • 2
  • 7
  • 14
0
votes
1 answer

Using fabric8io kubernetes-client; is it possible to remove a [custom] watcher?

Watcher: private final class CustomKubeWatcher implements Watcher { ... } Added using: kubeClient.pods().inNamespace(kubeNamespace).watch(customKubeWatcher); How do I stop the watcher? Does it get cleaned up when my client instance is GC'd?…
mehmetg
  • 197
  • 1
  • 4
  • 16
0
votes
0 answers

Cluster reconciliation in the event of node loss

I have a cluster of 3 nodes that I'd like to recover fast after a single node loss. By recovering I mean that I resume communication with my service after a reasonable amount of time (preferably configurable). Following are various details: k8s…
deemok
  • 2,735
  • 19
  • 11