Questions tagged [prometheus]

The Prometheus monitoring system.

136 questions
1
vote
2 answers

Prometheus not connected to alert manager in GKE

I installed kube-prometheus-stack 15.3.1 into a GKE cluster using helm (in "monitoring" namespace). I used the values.yaml to open up ingresses on some of the components and to add SMTP info and reciever details into the alert manager. For the most…
1
vote
1 answer

Prometheus auto scrape metrics from multiple kube-state-metrics in kubernetes?

I want to use a kubernetes(cluster-0)with multiple kube-state-metrics to monitor multiple other kubernetes cluster(cluster-1,2,3,4) In the (cluster-0), I split into multiple namespaces like this: namespace: monitor (here I will run prometheus to…
Lê Minh Quân
  • 301
  • 2
  • 13
1
vote
1 answer

Prometheus: Get CPU Usage % for one specific process from windows_exporter

I am trying to develop one query to show the CPU Usage(%) for one specific process in one windows server. After digging into the metrics of windows exporter, I found Metric: process.windows_process_cpu_time_total should be the one. I tried several…
Sphinx
  • 141
  • 1
  • 3
  • 8
1
vote
2 answers

Breaking down one prometheus.yml file?

I am using Prometheus for our monitoring and I have a lot of configs (our prometheus.yml main config file is 8000+ lines long). I would like to divide this out into logical groupings so that it becomes much readable. I came to know that Prometheus…
1
vote
2 answers

Missing metrics for "kubelet_volume_*" in Prometheus

I setup latest https://github.com/coreos/kube-prometheus/ in an AWS EKS cluster in which I'm using the Amazon EBS CSI driver for persistent volume claims, but I don't see any "kubelet_volume_*" metrics being available in prometheus. According to…
Catalin
  • 21
  • 5
1
vote
0 answers

kube-state-metrics doesn't show hardware utilization

I installed this yamls https://github.com/kubernetes/kube-state-metrics/tree/master/examples/standard I can see CPU per pod utilization only from system pods such as: calico-node, coredns,…
user227685
  • 11
  • 3
1
vote
1 answer

Gitlab prometheus error after upgrade with apt-get upgrade

after upgrading my gitlab from 10.9.2 to version 10.10 on my Ubuntu 18.04 it was not possible for the apt-get upgrade process to end successfully. It ended with following message: Ensuring Prometheus is updated:Traceback (most recent call last): …
eragon-2006
  • 127
  • 3
1
vote
0 answers

Monitoring SLA/SLO/SLI using Prometheus

I have done much research about monitoring SLI metrics with Prometheus. I have found only how to monitor a cluster using Kubernetes. I'm hoping to find a response here for simple monitoring. I also want to know which is better for that using the…
1
vote
1 answer

Promethius, group_left, and "on" vs "ignoring"

In Issue #2204, one of the Prometheus developers says: ...in principle you should be favouring ignoring over on to produce generic shareable rules... I'm confused how the use of ignoring would lead to more generic rules. For example, consider a…
larsks
  • 43,623
  • 14
  • 121
  • 180
1
vote
1 answer

Filtering interface results from prometheus snmp-exporter by interface name

I'm collecting interface statistics from some switches using the following configuration: modules: if_mib: walk: - ifTable auth: community: monitor The data returned from the exporter is indexed by ifIndex. That is, a query…
larsks
  • 43,623
  • 14
  • 121
  • 180
0
votes
0 answers

Some Prometheus alerts mislabelled

We've been using Prometheus with fairly solid success so far. However, we've recently noticed that one of the sets of alerts we've set up ends up mislabelled: It shows up as a different environment than it is. This happens with both environments…
Keilaron
  • 111
  • 2
0
votes
1 answer

Monitoring EKS Kubernetes LoadBalancer service Type

I have created few services in Kubernetes with type: LoadBalancer. Platform: EKS. Is there a way to get number of 4xx or 5xx errors from this LoadBalancer? I have tried the following: Prometheus - Does not seem to be any metric collected for…
0
votes
1 answer

OpenShift 3.11: Prometheus operator update

According to the docs the Prometheus operator on a OpenShift 3.11 cluster is self-upgrading. However, i have upgraded the cluster to 3.11.141 yesterday, but the operator is still stuck on 3.11.117. There are prometheus images for 3.11.141 available,…
simonszu
  • 373
  • 7
  • 14
0
votes
1 answer

Run apt-get upgrade -s in container and expect same result as when running on the host

I would like to prepare the environment for a Docker container such that when apt-get upgrade -s runs inside the container its result is the same as if the command were run outside on the host. This is in the context of and in preparation for adding…
rookie099
  • 375
  • 3
  • 14
0
votes
0 answers

Analysing latency for a single request

I have the following (very simplified) network : (ingress) -> DMZ (nginx) -> HA Proxy -> Reverse proxy (nginx) -> Application Server (tomcat) All the layers send information to Prometheus and then we use Grafana to monitor them. We had some latency…