Questions tagged [prometheus]

The Prometheus monitoring system, including the server, alertmanager, push gateway, exporters, client libraries and other components.

Prometheus is a go-based open-source monitoring system inspired by Google's approach to monitoring.

Prometheus itself is a time-series storage server that periodically pulls metrics from monitored entities. These metrics can then be queried and alerted-upon using simple query and alert languages.

Prometheus comes with a suite of supporting tools like alertmanager, exporters and client libraries, designed to work together in providing a highly scalable and complete monitoring solution. Metric visualizations is usually done with Grafana, which seamlessly connects with Prometheus.

Prometheus's official site.

Read more about Prometheus's approach here:

  • This article explains Prometheus metric gathering approach for beginners.

  • This chapter from the online book "Site Reliability Engineering" describes Google's monitoring system Borgmon. Prometheus is mentioned in this chapter and was designed with Google's approach in mind.

6591 questions
2
votes
3 answers

How to export Grafana datasource with basic auth details and import it again?

I am not able to export data sources in Grafana with basic auth details. As I searched on internet, people are downloading datasource JSON by URL /api/datasources and uploading same way with another API. I tried doing that. Everything works…
Moksh
  • 185
  • 1
  • 6
  • 16
2
votes
1 answer

Dynamically update prometheus scrape config based on pod labels

I'm trying to enhance my monitoring and want to expand the amount of metrics pulled into Prometheus from our Kube estate. We already have a stand alone Prom implementation which has a hard coded config file monitoring some bare metal servers, and…
damian
  • 36
  • 1
  • 3
2
votes
1 answer

Format PromQL values

i use this query rule for alert: - alert: HostOutOfMemory expr: (1 - node_memory_MemAvailable_bytes / node_memory_MemTotal_bytes) * 100 > 90 for: 5m labels: severity: warning annotations: summary: "{{ $labels.name…
Bình Bùi
  • 67
  • 8
2
votes
2 answers

Is io.prometheus.client.Gauge can be used for thread safety counters

I am using io.prometheus.client.Gauge to implement a thread safety counter to calculate the number of events processed in a time frame. Now there are several threads that are processing the events. And all can update the counter when it finishes the…
2
votes
1 answer

How to push Kuberntes metrics in Prometheus instead of pull?

I have configured Prometheus in our infra for monitoring. In our infra, we have an EKS clusters running. I have to collect EKS metrics in Prometheus. By default, Prometheus work on a pull-based mechanism. Here I have a question on how to collect…
Vishal Patil
  • 77
  • 1
  • 1
  • 7
2
votes
3 answers

Prometheus WAL Keeps on Growing Indefinitely

Currently, I have a Prometheus v2.20.0 running, and it has an issue that the WAL keeps on growing indefinitely and consuming disk space. Actually the disk space is not the issue now, but that the WAL folder is not getting cleaned, so if any time…
Wazery
  • 15,394
  • 19
  • 63
  • 95
2
votes
1 answer

Azure Kubernetes - Prometheus Automatic Service Discovery?

I have installed and configured Prometheus on my Azure Kubernetes as per the following configuration prometheus_values.yaml ====================== global: scrape_interval: 1m scrape_timeout: 10s evaluation_interval: 1m # Forcing Kubelet…
One Developer
  • 99
  • 5
  • 43
  • 103
2
votes
0 answers

Promql when a PromQL + duration would be firing

I was wondering if it's possible to acheive with PromQL: Given an promql, and duration produce a graph of when it was active for the duration. This is so you could retroactively see when a promQl would have produced alerts. I thought doing something…
aclowkay
  • 3,577
  • 5
  • 35
  • 66
2
votes
1 answer

How to configure Prometheus to get metrics of a service with 2 replicas through exposed route?

We have a service with 2 replicas deployed in OpenShift. The service uses DropWizard metrics and exposes them through spring-actuator on /actuator/prometheus endpoint. There is an exposed route for actuator's port 8082. Prometheus is configured to…
2
votes
1 answer

What does remote read means in Prometheus?

We would like to use VictoriaMetric as storage DB for time-series data that is collected from Prometheus. But it's said in the doc that this integration is only supported in write mode and not the read one. Does it mean that prometheus will only be…
Izbassar Tolegen
  • 1,990
  • 2
  • 20
  • 37
2
votes
1 answer

Using alert manager to send data to external API

Is there a way to use the alert manager to send an alert to external API, I mean instead using excage server or slack etc, I will provide an endpoint like https://mypullendpoint.host.com and the alert manager will send the alert to this URL ?
NSS
  • 755
  • 1
  • 11
  • 30
2
votes
1 answer

system_cpu_usage value much less than expected

I am monitoring a spring boot application in promethus with metrics generated by micrometer. For CPU usage, there is metrics 'system_cpu_usage'. I observe that its value is mostly under 1. Is it expected? Same application when monitored in VisualVM,…
Mandroid
  • 6,200
  • 12
  • 64
  • 134
2
votes
1 answer

and operator on two metrics with one same label - promql

in prometheus, I have two metrics like this: metricA: metrica{label1="value1",aa="aa",bb="bb"} metricB: metricb{label1="value1",cc="cc",dd="dd"} I want to and these two metrics, but each time it shows No data. How can I and these two metrics with…
Hedeesa
  • 148
  • 1
  • 12
2
votes
0 answers

using blackbox exporter with auth

Assume that I've API which I need to probe via blackboxexporter, is there a way to use Any auth with the call, lets assume I call to an URL which require user and password or some OAuth , is it possible? if yes please provide a reference and if not…
NSS
  • 755
  • 1
  • 11
  • 30
2
votes
0 answers

How can I send CSV rows to Prometheus Pushgateway?

I have a simple csv file with date columns and numbers like this: date | count 2020-09-01 | 3 2020-09-02 | 4 2020-09-03 | 2 ... There is a python script that reads the csv row by row. What I would like to do, using pushgateway, pushgateway…
monstereo
  • 830
  • 11
  • 31