From: Wilson P. J. <wp....@te...> - 2004-08-26 18:18:36
|
>> Hi all. >> >> I've observed my nagios restart every perfparse.sh run. I use >> perfparse-0.99.07 >> >> [26-08-2004 12:35:00] Nagios 1.2 starting... (PID=12443) >> [26-08-2004 12:35:00] Caught SIGHUP, restarting... >> >> Is this ok? If so, why is needed nagios restarting? > If you are using --delete* option, yes, perfparse sends some SIGHUP signal > to nagios to ask for restart and use a new serviceperf.log file. Otherwise, > nothing to see with perfparse. Yes. I use --delete (default into perfparse.sh). Is there any advantage or disadvantage using this option? The problem I see restarting nagios every 10 minutes (in my case with pp), is that nagios log grows too much, since I get around 700 services and hosts alerts saying "OK". (I monitor 100 hosts and 600 services). >> I'm trying to figure out why my nagios stops working after few hours and >> I'd like to know if this "nagios restarting" has some relation to "nagios >> stopping". > perfparse does not ask nagios to stop, but only to start writing to a new > serviceperf.log file, which also mean restarting, but not stopping. > In your log, strange that nagios starts and restarts at the same time. > SIGHUP caught by change at that time ? Every 10 minutes I get the same situation: [26-08-2004 14:40:11] SERVICE ALERT: bbrdcsrv1;RADIUS;OK;HARD;1; Radius.exe: Running [26-08-2004 14:40:11] SERVICE ALERT: bbrdcsrv1;Drive_C;OK;HARD;1;C::\ - total: 2.93 Gb - used: 1.07 Gb (36%) - free 1.87 Gb (64%) [26-08-2004 14:40:11] HOST ALERT: bbrdcsrv1;UP;HARD;1;(Host assumed to be up) [26-08-2004 14:40:11] SERVICE ALERT: bbr002363sis;HTTP;OK;HARD;1;HTTP OK HTTP/1.1 200 OK - 3970 bytes in 0.316 seconds [26-08-2004 14:40:11] HOST ALERT: bbr002363sis;UP;HARD;1;(Host assumed to be up) [26-08-2004 14:40:01] Nagios 1.2 starting... (PID=12443) [26-08-2004 14:40:01] Caught SIGHUP, restarting... [26-08-2004 14:39:41] SERVICE ALERT: bbrsrdir1;Drive_E;WARNING;SOFT;5;E::\ - total: 19.53 Gb - used: 19.00 Gb (97%) - free 0.53 Gb (3%) [26-08-2004 14:39:41] SERVICE ALERT: spossra;PING;CRITICAL;SOFT;1;PING CRITICAL - Packet loss = 0%, RTA = 20.24 ms [26-08-2004 14:39:31] SERVICE ALERT: rjo1601a;CPU5m-Cisco;WARNING;SOFT;3;SNMP WARNING - *32* As you can see "Caught SIGHUP, restarting..." and "Nagios 1.2 restarting..." at the same time. And then, nagios starts to log the 700 "OK". >> When nagios stops its checks, it always has a zombie process. > How long do they live ? (just asking for if others can help :) nagios runs from 2 to 6 hours, then stops checking: Process Status Information Process Status: WARNING Check Command Output: Nagios problem: located 14 processes, status log updated 260136 seconds ago So, with ps -aux I see a nagios <defunct> process. If I don't killall -9 nagios, the defunct process remain for days and nagios doesn't check anything. I don't know how to trouble this situation. Could anybody help me? > Yves >> >> Regards, >> >> Wilson >> >> > > > ------------------------------------------------------- > SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media > 100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33 > Save 50% off Retail on Ink & Toner - Free Shipping and Free Gift. > http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285 > _______________________________________________ > Perfparse-users mailing list > Per...@li... > https://lists.sourceforge.net/lists/listinfo/perfparse-users > > -- - Homepage - http://ymettier.free.fr - http://www.logicacmg.com - - GPG key - http://ymettier.free.fr/gpg.txt - - Maitretarot - http://www.nongnu.org/maitretarot/ - - GTKtalog - http://www.nongnu.org/gtktalog/ - ------------------------------------------------------- SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media 100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33 Save 50% off Retail on Ink & Toner - Free Shipping and Free Gift. http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285 _______________________________________________ Perfparse-users mailing list Per...@li... https://lists.sourceforge.net/lists/listinfo/perfparse-users |