Questions tagged [prometheus-alertmanager]

The Alertmanager handles alerts sent by client applications such as the Prometheus server. It takes care of deduplicating, grouping, and routing them to the correct receiver integration such as email, PagerDuty, or OpsGenie. It also takes care of silencing and inhibition of alerts.

The Alertmanager handles alerts sent by client applications such as the Prometheus server. It takes care of deduplicating, grouping, and routing them to the correct receiver integration such as email, PagerDuty, or OpsGenie. It also takes care of silencing and inhibition of alerts.

Source code is available here: https://github.com/prometheus/alertmanager

753 questions
0
votes
1 answer

Verify certain metric on different instances in Prometheus alert rules

I have multiple targets in prometheus which generate multiple metrics. I need to verify the values generated by a certain metric on multiple instances and trigger an alert incase the values are not equal to each other. metric_name: …
0
votes
1 answer

Adding custom webhook configuration in alertmanager

I have a custom webhook URL and I need to configure the same in alertmanager for getting alert notification. But my configuration is not working as expected. getting following error in alertmanager, level=error ts=2019-04-22T09:31:46.038681545Z…
0
votes
2 answers

Alternative to configuration files in Prometheus?

I want to implement a monitoring system in a small organization. I have experience with Zabbix, but I'm considering Prometheus as it seems there's a big community behind it, and it's pretty much the standard today (please correct me if I'm…
roeezab
  • 19
  • 3
0
votes
0 answers

Stacklight Prometheus Function not Working

I am trying to get Prometheus Alerting UI to report in on two metrics procstat_memory_vms{process_name="ovs-vswitchd"} / mem_total{host=~"cmp.*"} From the above I get the good ol' "No data points found", however I need to try and find out how to…
R. Barrett
  • 685
  • 11
  • 34
0
votes
1 answer

Prometheus not started after adding rule file into it

I added simple rule to get free disk space from node exporter, after adding rule file to prometheus.yml file, prometheus is not starting, could be some prob with rule file. # my global config global: scrape_interval: 60s # Set the scrape…
0
votes
1 answer

Prometheus counting issue

I am trying to count how many value == 0 in past one hour in prometheus and try to create the alert rules. I come up with the rules count_over_time(instance==0 [1h])/count_over_time(instance) I got error shows I have to follow Prometheus…
qing zhang
  • 125
  • 1
  • 4
  • 13
0
votes
1 answer

How to count fired alerts in Prometheus alerts and append it to alerting labels?

For example, I have 2 hosts, alert will be fired when CPU > 90%. I can open Prometheus UI & Alert Manager UI, and I see Alerts firing for the 2 hosts. I use webhook as receiver, I want to get the alert count for CPU>90% to each hosts, how can I…
xiaodong
  • 1
  • 3
0
votes
1 answer

How to add smtp settings to prometheus-operator using helm chart?

I am new to the Kubernetes and especially using helm. I installed the charts and it works fine with default values. I want to the add smtp server setting in the values.yml file for the chart. I am confused on how to inject the values while…
0
votes
1 answer

Custom alert rule for PODS and Clusters

I am trying to set up some alerts in Prometheus. I am able to create the alerts for nodes for the following category (network utilization, CPU usage, memory usage). I am stuck with the pods. Which metrics should I use for PODs/Containers/clusters…
gamechanger17
  • 4,025
  • 6
  • 22
  • 38
0
votes
1 answer

Error ICP 3.1.1 Grafana Prometheus Kubernetes Status Pods Always 'Init'

I Was Complete Installing ICP with VA. Using 1 Master, 1 Proxy, 1 Management, 1 VA, and 3 Workers with GlusterFS Inside. This List Kubernetes Pods Not Running Storage - PersistentVolume GlusterFS on ICP This Describe Kubernetes Pods Error…
0
votes
1 answer

Alert triggered node name is not specifying in the alert being received when more than one is triggered

I have Configured an alert for node memory usage in prometheus. My alert template is as follows: - alert: NodeMemory Usage(development) annotations: description: '{{$labels.instance}} Memory usage is critical (current value is: {{…
0
votes
2 answers

What is the Alert Manager alert rule for notifying about Docker container restarting?

I'm trying to add an alert rule that would allow Alert Manager to notify me whenever a particular docker container restarts. Does such rule exist? Any help would be greatly appreciated!
Matthew
  • 411
  • 6
  • 22
0
votes
1 answer

How to disable the annotaion description from the alerts in Prometheus

I have a running Prometheus and configured alerts by alertmanager to my slack. and I am able to get the triggered alerts with its Description. For that I have added the following in my config file. Summary: '{{ range .Alerts }}{{…
manu thankachan
  • 433
  • 3
  • 9
  • 19
0
votes
1 answer

alertmanger cannot load configuration file

I am getting the following error message on starting Alertmanager: /alertmanager/config.yml err="yaml: unmarshal errors:\n line 48: cannot unmarshal !!map into []*config.WebhookConfig" The only difference to a working config file is adding the…
Christian Schyma
  • 190
  • 1
  • 2
  • 16
0
votes
1 answer

Dynamic label values in Promethues alerting rules

I am a bit of a novice with the monitoring world. Here is my question. I want to fire an alert only for a set of assets based on asset-id. My metrics looks like the below. test_value{asset_id="123"} 0.215 My alert manager rules looks like the…
TheMonkWhoSoldHisCode
  • 2,182
  • 3
  • 26
  • 40