The Prometheus monitoring system.
Questions tagged [prometheus]
136 questions
1
vote
2 answers
Prometheus not connected to alert manager in GKE
I installed kube-prometheus-stack 15.3.1 into a GKE cluster using helm (in "monitoring" namespace). I used the values.yaml to open up ingresses on some of the components and to add SMTP info and reciever details into the alert manager. For the most…

Toby 1 Kenobi
- 287
- 2
- 13
1
vote
1 answer
Prometheus auto scrape metrics from multiple kube-state-metrics in kubernetes?
I want to use a kubernetes(cluster-0)with multiple kube-state-metrics to monitor multiple other kubernetes cluster(cluster-1,2,3,4)
In the (cluster-0), I split into multiple namespaces like this:
namespace: monitor
(here I will run prometheus to…

Lê Minh Quân
- 301
- 2
- 13
1
vote
1 answer
Prometheus: Get CPU Usage % for one specific process from windows_exporter
I am trying to develop one query to show the CPU Usage(%) for one specific process in one windows server.
After digging into the metrics of windows exporter, I found Metric: process.windows_process_cpu_time_total should be the one.
I tried several…

Sphinx
- 141
- 1
- 3
- 8
1
vote
2 answers
Breaking down one prometheus.yml file?
I am using Prometheus for our monitoring and I have a lot of configs (our prometheus.yml main config file is 8000+ lines long).
I would like to divide this out into logical groupings so that it becomes much readable. I came to know that Prometheus…

PRS
- 11
- 2
1
vote
2 answers
Missing metrics for "kubelet_volume_*" in Prometheus
I setup latest https://github.com/coreos/kube-prometheus/ in an AWS EKS cluster in which I'm using the Amazon EBS CSI driver for persistent volume claims, but I don't see any "kubelet_volume_*" metrics being available in prometheus.
According to…

Catalin
- 21
- 5
1
vote
0 answers
kube-state-metrics doesn't show hardware utilization
I installed this yamls
https://github.com/kubernetes/kube-state-metrics/tree/master/examples/standard
I can see CPU per pod utilization only from system pods such as: calico-node, coredns,…

user227685
- 11
- 3
1
vote
1 answer
Gitlab prometheus error after upgrade with apt-get upgrade
after upgrading my gitlab from 10.9.2 to version 10.10 on my Ubuntu 18.04 it was not possible for the apt-get upgrade process to end successfully. It ended with following message:
Ensuring Prometheus is updated:Traceback (most recent call last):
…

eragon-2006
- 127
- 3
1
vote
0 answers
Monitoring SLA/SLO/SLI using Prometheus
I have done much research about monitoring SLI metrics with Prometheus. I have found only how to monitor a cluster using Kubernetes. I'm hoping to find a response here for simple monitoring.
I also want to know which is better for that using the…

Hasagiii
- 111
- 1
1
vote
1 answer
Promethius, group_left, and "on" vs "ignoring"
In Issue #2204, one of the Prometheus developers says:
...in principle you should be favouring ignoring over on to produce generic shareable rules...
I'm confused how the use of ignoring would lead to more generic rules. For example, consider a…

larsks
- 43,623
- 14
- 121
- 180
1
vote
1 answer
Filtering interface results from prometheus snmp-exporter by interface name
I'm collecting interface statistics from some switches using the following configuration:
modules:
if_mib:
walk:
- ifTable
auth:
community: monitor
The data returned from the exporter is indexed by ifIndex. That is, a query…

larsks
- 43,623
- 14
- 121
- 180
0
votes
0 answers
Some Prometheus alerts mislabelled
We've been using Prometheus with fairly solid success so far. However, we've recently noticed that one of the sets of alerts we've set up ends up mislabelled: It shows up as a different environment than it is. This happens with both environments…

Keilaron
- 111
- 2
0
votes
1 answer
Monitoring EKS Kubernetes LoadBalancer service Type
I have created few services in Kubernetes with type: LoadBalancer.
Platform: EKS.
Is there a way to get number of 4xx or 5xx errors from this LoadBalancer? I have tried the following:
Prometheus - Does not seem to be any metric collected for…
0
votes
1 answer
OpenShift 3.11: Prometheus operator update
According to the docs the Prometheus operator on a OpenShift 3.11 cluster is self-upgrading. However, i have upgraded the cluster to 3.11.141 yesterday, but the operator is still stuck on 3.11.117. There are prometheus images for 3.11.141 available,…

simonszu
- 373
- 7
- 14
0
votes
1 answer
Run apt-get upgrade -s in container and expect same result as when running on the host
I would like to prepare the environment for a Docker container such that when apt-get upgrade -s runs inside the container its result is the same as if the command were run outside on the host. This is in the context of and in preparation for adding…

rookie099
- 375
- 3
- 14
0
votes
0 answers
Analysing latency for a single request
I have the following (very simplified) network :
(ingress) -> DMZ (nginx) -> HA Proxy -> Reverse proxy (nginx) -> Application Server (tomcat)
All the layers send information to Prometheus and then we use Grafana to monitor them.
We had some latency…

ehi84636
- 1