From: Buchan M. <bg...@st...> - 2009-01-22 16:08:14
|
On Friday 09 January 2009 14:43:37 Thomas Kähn wrote: > Hi Buchan, > > On Fri, Jan 09, 2009 at 01:17:22PM +0200, Buchan Milne wrote: > > > one of our devmon servers turned purple this morning. It stopped here: > > > > > > [09-01-09@07:24:36] DEBUG TEMPLATES: running post_template_load() > > > [09-01-09@07:24:36] DEBUG CFG: running read_hosts > > > [09-01-09@07:24:36] DEBUG SNMP: running poll_devices() > > > [09-01-09@07:24:36] Starting snmp queries > > > [09-01-09@07:24:36] Getting device status from hobbit at > > > xx.xx.xx.xx:1984 > > > > > > It is possible, that this system couldn't reach the hobbit server > > > at that time. However it didn't recover from this status. > > > > Hmm, try as I might I can't get hobbitd to die between "my $sock = > > IO::Socket::INET->new" (or "if(defined $sock)" ) and while(<$sock>). > > > > However, simply adding a timeout to the socket may be enough to fix this, > > can you try with this patch ? > > I've added this patch to our devmon servers. Let's see what happens. Did you add the patch? Has it improved matters? Regards, Buchan |