Questions tagged [prometheus]

The Prometheus monitoring system, including the server, alertmanager, push gateway, exporters, client libraries and other components.

Prometheus is a go-based open-source monitoring system inspired by Google's approach to monitoring.

Prometheus itself is a time-series storage server that periodically pulls metrics from monitored entities. These metrics can then be queried and alerted-upon using simple query and alert languages.

Prometheus comes with a suite of supporting tools like alertmanager, exporters and client libraries, designed to work together in providing a highly scalable and complete monitoring solution. Metric visualizations is usually done with Grafana, which seamlessly connects with Prometheus.

Prometheus's official site.

Read more about Prometheus's approach here:

  • This article explains Prometheus metric gathering approach for beginners.

  • This chapter from the online book "Site Reliability Engineering" describes Google's monitoring system Borgmon. Prometheus is mentioned in this chapter and was designed with Google's approach in mind.

6591 questions
2
votes
3 answers

Prometheus service failing on an invalid argument error

Prometheus is failing on this error: Aug 04 11:27:36 rockpi prometheus[12548]: level=info ts=2020-08-04T11:27:36.098Z caller=main.go:549 msg="Scrape discovery manager stopped" Aug 04 11:27:36 rockpi prometheus[12548]: level=info…
Ciasto piekarz
  • 7,853
  • 18
  • 101
  • 197
2
votes
1 answer

Node level metrics are not available in Prometheus (installed with Istio 1.6)

I installed Istio 1.6, using istioctl install --set profile=demo. But I could only couple of metrics related to Kubernetes nodes. I can see configuration related Kubernetes Node: kubernetes_sd_configs: - role: node relabel_configs: - action:…
Pragmatic
  • 3,093
  • 4
  • 33
  • 62
2
votes
1 answer

Using rate instead of sum/count in micrometer

At this link, it states the reason why rate must be used for micrometer metrics. Representing a counter without rate normalization over some time window is rarely useful, as the representation is a function of both the rapidity with which the…
Mandroid
  • 6,200
  • 12
  • 64
  • 134
2
votes
2 answers

How to drop multiple metrics in Prometheus

I want to reduce the number of metrics that are scraped under Kube-state-metrics. When I use the following configuration: metric_relabel_configs: - source_labels: [__name__] separator: ; regex:…
Tomer Leibovich
  • 467
  • 1
  • 5
  • 13
2
votes
3 answers

Prometheus: filter query based on another metric

Say I have two metrics in prometheus, both counters: requests_processed_total requests_failed_total They both have a matching service label. Example: requests_processed_total{service="news"} 1097 requests_processed_total{service="store"}…
aspyct
  • 3,625
  • 7
  • 36
  • 61
2
votes
1 answer

GKE 1.16.9 Prometheus, grafana per pod details not working?

I have a GKE cluster running kubernetes 1.16.9. I'm trying to get a monitoring system working using Prometheus, and Grafana. The dashboard I'm using is the standard "Kubernetes Cluster Monitoring" https://grafana.com/grafana/dashboards/315 When I…
clse
  • 648
  • 1
  • 7
  • 10
2
votes
2 answers

Custom metric is not exposed in prometheus

I'm writing a Go application and I need to log some custom metrics using Prometheus. I have a local instance of Prometheus and this is my prometheus.yml file: scrape_configs: - job_name: myapp scrape_interval: 10s static_configs: - targets: …
mrasoolmirza
  • 787
  • 1
  • 6
  • 22
2
votes
3 answers

Inhibition with exceptions (negative matchers)

I have a problem with inhibition rules because of exceptions to inhibition rules. For example, we have 50 teams, and every team handles their alerts. When a data center goes down (e.g. because of network problems), we want to inhibit all alerts…
Denis
  • 83
  • 2
  • 14
2
votes
2 answers

How do I add sidecar containers on the prometheus stable Helm Charts values.yaml file?

I'm currently using the prometheus (NOT prometheus-operator, due to historical reasons) Helm charts in our on-premise Kubernetes cluster, and I'd like to deploy the Thanos sidecar. How do I add the sidecar containers in the values.yaml file? Are…
jrlonan
  • 45
  • 1
  • 5
2
votes
1 answer

Prometheus configuration for Spring MVC 5

In my company's web application we are using Spring 5, but the application is written in Spring MVC and we are not gonna change it anytime soon. I need to connect the application to Prometheus but I could not find any webpage explaining using…
yyy
  • 437
  • 2
  • 9
  • 23
2
votes
2 answers

How many label values can Prometheus manage?

I have hundreds of customers and I have several metrics I'd like to keep to each one of them. Let's say I have a metric orders_count. I have two possibilities: use a label - i.e. order_count{customer=customer_name} have a different counter for…
IsaacLevon
  • 2,260
  • 4
  • 41
  • 83
2
votes
1 answer

Prometheus error " server returned HTTP status 401 Unauthorized" for remote_write and remote_read

I have installed Prometheus in Kubernetes and I am trying to use remote_write and remote_read option to InfluxDB. I have created a user in DB with read and write privilege also. But I am receiving an error like…
Jayanth
  • 485
  • 3
  • 9
  • 17
2
votes
1 answer

How to use promql group by without using aggregate functions in Grafana

I am trying to create a query that groups the data by "Api" field and selects a value field by using prometheus and grafana. My sample query (promql) is max (application_apidbacesscount_total) by (Api) [30m:1m]. This works for getting max value with…
umutcanturan
  • 78
  • 1
  • 8
2
votes
1 answer

Alerting in case of exceptions

I developed few services and I would like to be able to react fast in case of a bug or failure. These services expose metrics to prometheus and I get alerts through slack. Logs are available in kibana. I can see bugs and exceptions in logs through…
sziolkow
  • 173
  • 1
  • 12
2
votes
1 answer

Network Transfer Speeds (in MB/s) using Prometheus & Grafana

I am trying to create a graph in Grafana to display the machine's network transfer speeds (in MB/s or similar) by using the following Prometheus query rate(node_network_receive_bytes_total[1m]) * 8 However, this is giving me a very flat graph and…
Nyxynyx
  • 61,411
  • 155
  • 482
  • 830
1 2 3
99
100