Questions tagged [alerting]
38 questions
1
vote
2 answers
What are some good patterns for cleaning up noisy logging alerts
In addition to traditional logging from applications going into e.g. Elasticsearch, an organisation may have an alerting system "Sentry" that receives log messages/exception events sent by applications over HTTP, and notifies developers of potential…

Will Sheppard
- 131
- 3
1
vote
0 answers
Is there possibility in Prometheus alerts calculate another alert status?
I want to make an ALERT with IF that contains expression that depends on another alert status. Is that ever possible somehow? I know about alert inhibition with alermanager but I need not to firing some alerts at all in some cases.
Prometheus…

Shtlzut
- 290
- 3
- 9
1
vote
1 answer
Supermicro SYS-5019S-M and IPMI temperature threshold
I have a Supermicro SYS-5019S-M server configured to send an SMTP alert (an email) when IPMI detects that temperatures exceed some defined threshold.
However it seems that, for some component at least, the threshold is way to high: for example,…

shodanshok
- 47,711
- 7
- 111
- 180
1
vote
0 answers
Sending alert emails from a bunch of servers to G Suite
We have a always changing infrastructure (DigitalOcean droplets and AWS EC2 instances that are created and deleted). We use Ansible to bootstrap VMs and to install netdata on each VM. The goal is to have emails coming from each machine with the from…

Elisiário Couto
- 11
- 1
1
vote
1 answer
Methods for monitoring and alerting data issues when dealing with complex dependencies
In this hypothetical example, we have a data flow across multiple engineering teams in an ecommerce company. These teams deliver services, produce data, and consume data at different points of the flow.
For example;
'Team Orders' maintains the…

John Honan
- 11
- 1
1
vote
0 answers
Is there a way to setup alerts for a failed DFS topology connection?
I have a large DFS structure setup between multiple remote sites and multiple hub servers in a full mesh topology. Each remote site has its own namespace or namespaces and replication group. I have an automated PowerShell script that collects the…

Dave H.
- 125
- 1
- 2
- 10
1
vote
1 answer
SCCM 2012: Is it possible to create an alert for when a site or component logs a critical message?
I'm investigating some issues on my SCCM2012 single site installation, and am heavily relying on the 'site status' and 'component status' to tell me what's wrong. I've configured the site server to be able to send emails, and would like to be able…

growse
- 8,020
- 13
- 74
- 115
1
vote
1 answer
Filter Zenoss Alerting Rules for certain Data Source only (or certain Data sources active only at a given time)
We're working to monitor a couple of services with Zenoss over SNMP and have implemented the SNMP part.
We have configured data sources (to various OIDs) and thresholds for these values and hosts to monitor.
For alerting rules, some of the data…

Touko
- 241
- 3
- 7
1
vote
2 answers
Freeform monitoring tools
Can you suggest some sort of freeform monitoring tool?
I have been trying Graphite, which allows you to add any metrics you like, and graph them on the spot. You don't have to define a new host, service, whatever. If all you want is to graph…

Mathieu Longtin
- 199
- 3
- 13
1
vote
0 answers
Using CloudWatch to monitor concurrent users
I'm looking for a good way to monitor concurrent users in my server. The server's got a websocket connection, so it's easy to determine whether a user is connected or not. After looking into CloudWatch for a bit, I have a plan:
Every minute, each…

Waneck
- 145
- 4
1
vote
1 answer
Email alerts about RAID hard disk errors from HPE ProLiant server with iLO
I am looking for a way to get email alerts about possible RAID hard disk failures from the HPE ProLiant server running the free version of ESXi.
Has anyone used iLO successfully for this with the advanced license or even with standard version some…

jm90
- 11
- 2
1
vote
1 answer
Can Performance Co-Pilot serve as a standalone NMS solution?
I was wondering whether Performance Co-Pilot (PCP) be set up in the way that it has all the features of NMS application, ie. Nagios or Zabbix. I was using it to extract metrics from a host and plot values on a dashboard, but I would like to know how…

mac13k
- 133
- 7
0
votes
1 answer
Creating custom alerting policies using data streamed to the cloud from a sensor
I was looking for information and help on how to use data acquired from a sensor device and uploaded to my google cloud project as a metric for an alerting policy on Stackdriver Monitoring.
Currently my team and I are using BigQuery to query the…
0
votes
1 answer
Set Nagios to alert if a threshold is exceeded over time?
I have a server with a memory value that I want to monitor. If the value is over a certain amount over the course of an hour, I'd like to have an alert sent, but until then I'd like to have it ignore the plugin alert.
Is it possible to have Nagios…

Bart Silverstrim
- 31,172
- 9
- 67
- 87
0
votes
2 answers
Need an alerting system if my cloning script fails
Possible Duplicate:
How to repeatedly call rsync until files are sucessfully transferred
I've configured a nightly rsync to mirror one server to a standby offsite backup server. The total datastore on the primary is 1.5TB. In the course of…

rahum
- 121
- 1