|
From: John K. <jk...@cr...> - 2007-08-30 18:43:35
|
I am just glad that we don't have to send up/down messages to our = CEO.... He would think we can't keep anything running.... (or = paper/toner loaded in a printer) :) -----Original Message----- From: nag...@li... = [mailto:nag...@li...] On Behalf Of Dennis = Huenseler Sent: Thursday, August 30, 2007 1:22 PM To: Patrick M.; nag...@li... Subject: Re: [Nagios-users] Critical Plugin Timed Out Hi Patrick, if you think there is a problem with the machine running nagios i would = monitor localhost too :-) Or another way to get information about the = performance of the Nagios-host is to get load and memory values through = snmp and create load and memusage graphs with mrtg or cacti for example. = That's the way I take a look @ the performance of my linux hosts Kind regards, =A0 Dennis H=FCnseler -----Original Message----- From: nag...@li... = [mailto:nag...@li...] On Behalf Of Patrick = M. Sent: Thursday, August 30, 2007 8:19 PM To: nag...@li... Subject: [Nagios-users] Critical Plugin Timed Out Hi all, I've been running Nagios 2.6 for about 6 months now, and every now and = then we get critical pages about a machine being down, or at least = Nagios can't connect to it. It causes the CEO to freak out and believe = something is up with our network. To me, it seems like the box is getting stressed out during the tests = and is causing the plugins to time out. Here's some of the alerts from this morning: ####################################### [08-30-2007 09:24:10] HOST ALERT: tu.xyz.com;DOWN;SOFT;1;CRITICAL - = Plugin timed out after 10 seconds Service Critical [08-30-2007 09:24:00] SERVICE ALERT:=20 seismo.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 p.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - popen timeout received, but no = child process Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 ap.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - popen timeout received, but = no child process Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 cry.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 wns.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 qke.xyz.com;/work;CRITICAL;SOFT;1;CHECK_NRPE: Socket timeout after 10 = seconds. Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 hl-hayes-br.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - popen timeout = received, but no child process Service Critical[08-30-2007 09:24:00] = SERVICE ALERT:=20 pl.xyz.com;SMTP;CRITICAL;SOFT;1;CRITICAL - Socket timeout after 10 = seconds Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 qke.xyz.com;/home2;CRITICAL;SOFT;1;CHECK_NRPE: Socket timeout after 10 = seconds. Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 qke.xyz.com;/;CRITICAL;SOFT;1;CHECK_NRPE: Socket timeout after 10 = seconds. Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 o.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - popen timeout received, but no = child process Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 o.xyz.com;SSH;CRITICAL;SOFT;1;CRITICAL - Socket timeout after 10 seconds = Service Critical[08-30-2007 09:24:00] SERVICE ALERT:=20 o.xyz.com;DNS;CRITICAL;SOFT;1;CRITICAL - Plugin timed out while = executing system call Service Critical[08-30-2007 09:23:41] SERVICE = ALERT:=20 sgull.xyz.com;PING;CRITICAL;HARD;3;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 hister.xyz.com;PING;CRITICAL;HARD;3;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 hs1.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 nbridged.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - Plugin timed out after = 10 seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 h1.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 dfied-1.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - Plugin timed out after = 10 seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 pes.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 ruits.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 nge-routed.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out = after 10 seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 eng-1.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 pe.xyz.com;FTP;CRITICAL;SOFT;1;CRITICAL - Socket timeout after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 g1.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 gb1.xyz.com;PING;CRITICAL;SOFT;2;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 jith.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Service Warning[08-30-2007 09:23:40] SERVICE ALERT:=20 pule.xyz.com;PING;WARNING;SOFT;1;PING WARNING - Packet loss =3D 44%, RTA = =3D 3.64 ms Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 gd2.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 hx1.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 g2.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:40] SERVICE ALERT:=20 eo.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Host Up[08-30-2007 09:23:40] HOST ALERT: = e.xyz.com;UP;SOFT;3;PING OK - Packet loss =3D 0%, RTA =3D 4.58 ms Host = Down[08-30-2007 09:23:40] HOST ALERT:=20 e.xyz.com;DOWN;SOFT;2;CRITICAL - Plugin timed out after 10 seconds Host = Down[08-30-2007 09:23:20] HOST ALERT:=20 e.xyz.com;DOWN;SOFT;1;CRITICAL - Plugin timed out after 10 seconds = Service Critical[08-30-2007 09:23:10] SERVICE ALERT:=20 cr.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:02] SERVICE ALERT:=20 wm.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:23:00] SERVICE ALERT:=20 t.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds Service Critical[08-30-2007 09:22:52] SERVICE ALERT:=20 smo.xyz.com;PING;CRITICAL;SOFT;1;CRITICAL - Plugin timed out after 10 = seconds ####################################### The machine is a p4 2.4 ghz with 1gb ram. I'm not sure how to troubleshoot this - any ideas? What can I provide = you folks in order to help me out? Thanks in advance. -------------------------------------------------------------------------= This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ = _______________________________________________ Nagios-users mailing list Nag...@li... https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when = reporting any issue.=20 ::: Messages without supporting info will risk being sent to /dev/null -------------------------------------------------------------------------= This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ = _______________________________________________ Nagios-users mailing list Nag...@li... https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when = reporting any issue.=20 ::: Messages without supporting info will risk being sent to /dev/null |