Questions tagged [nagios]

Nagios is a monitoring system that enables organizations to identify and resolve IT infrastructure problems before they affect critical business processes.

Nagios is a powerful monitoring system that enables organizations to identify and resolve IT infrastructure problems before they affect critical business processes.

Nagios watches hosts and services that you specify, alerting you when things go bad and when they get better. Some of its many features include monitoring of network services (SMTP, POP3, HTTP, NNTP, ICMP, etc.), monitoring of host resources (processor load, disk usage, etc.), and contact notifications when service or host problems occur and get resolved (via email, pager, or user-defined method).

Nagios is extremely customizable. Its mechanism for checking the status of hosts and services is to run standalone programs the communicate with the Nagios master via their standardoutput and return codes. It is therefore relatively easy to add new status checks to Nagios without much work beyond writing the standalone program and configuring Nagios to call it.

1294 questions
5
votes
3 answers

nagios nrpe unable to read output

I'm trying to use check_nrpe to check on a remote server, but it's not working and I can't figure out what am I missing... # /usr/lib/nagios/plugins/check_nrpe -H XXX -c check_load -a 6,5,4 8,7,6 NRPE: Unable to read output # remote's server…
alexus
  • 13,112
  • 32
  • 117
  • 174
5
votes
1 answer

Notify when some grouped services fail in Nagios

I have three DNS services, each on different hosts configured in Nagios. Each service notifies me whenever it is failing (DNS on host A down). Is it possible to configure Nagios so that I'm only getting a (customized) notification when all three DNS…
user211195
  • 63
  • 3
5
votes
3 answers

Nagios: Disable notifications from command line on Windows/Linux

Our routine maintenance requires services have to be brought down. The problem is these services have Nagios checks and we either have to go to the web interface and disable the check or get the emails. Is there a way from command line on a Windows…
andyhky
  • 2,732
  • 2
  • 25
  • 26
5
votes
2 answers

What's the best method to monitor an OpenVPN server?

I have an OpenVPN server machine which resides on an isolated network, a NAT rule in the company's firewall is redirecting all traffic on port 1194 (tcp) from the public IP I chose to the internal address of the OpenVPN server machine. I'd like to…
Itai Ganot
  • 10,644
  • 29
  • 93
  • 146
5
votes
6 answers

nrpe: Host is not allowed to talk to us

I installed nrpe on new servers. Nagios is already running and checking other server (I haven't installed). I've got several error lines in my syslog file : Oct 31 15:17:01 myservername nrpe[41848]: Host nagios_ip is not allowed to talk to us! My…
5
votes
5 answers

Nagios: NRPE: Unable to read output, Can't find the reason, can you?

I have a Nagios server and a monitored server. On the monitored server: [root@Monitored ~]# netstat -an |grep :5666 tcp 0 0 0.0.0.0:5666 0.0.0.0:* LISTEN [root@Monitored ~]# locate…
Itai Ganot
  • 10,644
  • 29
  • 93
  • 146
5
votes
1 answer

Simple monitoring solution for a single server

I have a little experience with server monitoring products from my previous job (mainly Nagios), but never had the chance to set up a monitoring scheme from scratch. Every time there was a number of machines that needed to be monitored, running the…
dlyk1988
  • 1,674
  • 4
  • 24
  • 36
5
votes
2 answers

Nagios: Service dependency on same host for an entire hostgroup

I have a Nagios system with a large number of hosts categorised in different hostgroups. Each of these hosts has two services monitored on it, one of which is intended to be dependent on the other - if service1 is WARNING or CRITICAL, service2 on…
jreid9001
  • 161
  • 2
  • 6
5
votes
3 answers

Nagios - Define as service for all the members of a group *except* some hosts?

We're using nagios to check our ssh server is running. We have the check_ssh group defined for all members of a certain group. We have other services defined for that group as well. Everything is fine. However for 2 of the members of this group,…
Amandasaurus
  • 31,471
  • 65
  • 192
  • 253
5
votes
3 answers

Check if puppet agent is up to date

I would like to check with my nagios monitoring if every node use current catalog version provided by puppetmaster. In my situation, there are: puppetmaster host1 host2 hostX I would like to create nrpe plugin on host1, host2, hostX to: Check…
Tomasz Olszewski
  • 898
  • 1
  • 9
  • 20
5
votes
2 answers

Monitoring tool which does not average data over time

I am looking for a monitoring system like Cacti which doesn't loose data over a time, all the tools I have found use rrd files which averages the data as time goes by. I would like a to be able to go back to (for example) April 1 at 12:00 and see…
Epaphus
  • 1,021
  • 6
  • 8
5
votes
2 answers

The best nagios plugin for SMART?

I've tried the best plugin for nagios to SMART monitoring. There are some, but only monitoring temperature. But with smartctl we can find more data. Do you have some better plugins with all data from smart?
Rafał Kamiński
  • 187
  • 2
  • 5
  • 15
5
votes
1 answer

In Icinga (Nagios), how do I configure hosts with multiple IPs?

I'm setting up Icinga (Nagios fork) and I have some machines with multiple interfaces. Some services are only listening on one of them and to check them correctly, I like to know if it's possible to have multiple IP addresses configured for a single…
gertvdijk
  • 3,504
  • 4
  • 30
  • 46
5
votes
1 answer

Nagios best practices for setting up hosts/services?

I am having a hard time trying to get my head around some of the more basic concepts of Nagios that I thought would be straight forward. Basically, as well as the standard local infrastructure, I need to monitor a bunch of 3rd party web services,…
William Hilsum
  • 3,536
  • 6
  • 29
  • 39
5
votes
3 answers

Nagios (Return code of 141 is out of bounds) on random services

We have had Nagios running on one of our servers with out any problems for a while but lately we get (Return code of 141 is out of bounds). The load on the server raised because we gone online with our service, but its still not really high (load…
PortKnox.net
  • 51
  • 1
  • 3