From: Nathan H. <na...@ma...> - 2007-12-17 03:59:52
Attachments:
Devmon_checkpoint-splat.tar.gz
dm_tests.pm.diff
|
I've attached a small patch to negate the threshold regexp with the "!" symbol. This patch is necessary for the Checkpoint Firewall-1 (SPLAT) template I have also attached. Cheers, Nathan Hand |
From: xbgmsharp <xbg...@gm...> - 2007-12-18 10:11:00
|
Hello, First thanks for helping. I have commit your change. Regards, Francois Nathan Hand a écrit : > I've attached a small patch to negate the threshold regexp with the "!" > symbol. This patch is necessary for the Checkpoint Firewall-1 (SPLAT) > template I have also attached. > > Cheers, > Nathan Hand > > > ------------------------------------------------------------------------ > > ------------------------------------------------------------------------- > SF.Net email is sponsored by: > Check out the new SourceForge.net Marketplace. > It's the best place to buy or sell services > for just about anything Open Source. > http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace > ------------------------------------------------------------------------ > > _______________________________________________ > Devmon-support mailing list > Dev...@li... > https://lists.sourceforge.net/lists/listinfo/devmon-support > |
From: Morsiani, M. <mas...@gi...> - 2007-12-18 10:22:06
|
Hi all, instead of a day-by-day patching, is it possible to have a stable = release? Or even a devmon roadmap? I'm a little bit confused because is not clear which is devmon evolution = now. Devmon 3 beta3 (the one available on Kaya wiki) is stable? Buchan, Francois, do you think to release a single package (devmon core + templates) on a = regular basis? Or do you think to release two distinct packages (devmon core, template) = just like old owner? And finally do you want to release everything using SVN only? Thank you for your work. Regards. Massimo Morsiani Information Technology Dept. ------ Gilbarco S.p.a. via de' Cattani, 220/G 50145 Firenze, Italy tel: +39-055-30941 fax: +39-055-318603 email: mas...@gi... web: http://www.gilbarco.it -----Original Message----- From: dev...@li... = [mailto:dev...@li...] On Behalf Of = xbgmsharp Sent: marted=EC 18 dicembre 2007 11.12 To: dev...@li... Subject: Re: [Devmon] negated threshold patch Hello, First thanks for helping. I have commit your change. Regards, Francois Nathan Hand a =E9crit : > I've attached a small patch to negate the threshold regexp with the = "!" > symbol. This patch is necessary for the Checkpoint Firewall-1 (SPLAT)=20 > template I have also attached. > > Cheers, > Nathan Hand > > =20 > ---------------------------------------------------------------------- > -- > > ---------------------------------------------------------------------- > --- > SF.Net email is sponsored by: > Check out the new SourceForge.net Marketplace. > It's the best place to buy or sell services for just about anything=20 > Open Source. > http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marke > tplace > ---------------------------------------------------------------------- > -- > > _______________________________________________ > Devmon-support mailing list > Dev...@li... > https://lists.sourceforge.net/lists/listinfo/devmon-support > =20 -------------------------------------------------------------------------= SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open = Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketpl= ace _______________________________________________ Devmon-support mailing list Dev...@li... https://lists.sourceforge.net/lists/listinfo/devmon-support This message (including any attachments) contains confidential and/or proprietary information intended only for the addressee. Any unauthorized disclosure, copying, distribution or reliance on the contents of this information is strictly prohibited and may constitute a violation of law. If you are not the intended recipient, please notify the sender immediately by responding to this e-mail, and delete the message from your system. If you have any questions about this e-mail please notify the sender immediately. |
From: Buchan M. <bg...@st...> - 2007-12-18 12:23:45
|
On Tuesday 18 December 2007 12:21:54 Morsiani, Massimo wrote: > Hi all, > > instead of a day-by-day patching, is it possible to have a stable release? Yes, after an upcoming beta. > Or even a devmon roadmap? Later (IMHO, we should get another beta out first, before a roadmap). However, for the actual 0.3.3 release, I would like to see at least: -working graphs for as much as is possible without significant work -fix known bugs related to template features -ship a few more templates for common hardware (that exhibits the graphs) -try and understand and/or resolve the "devmon tests go purple" issue. > I'm a little bit confused because is not clear which is devmon evolution > now. Devmon 3 beta3 (the one available on Kaya wiki) is stable? Why would a beta be stable? > Buchan, Francois, > do you think to release a single package (devmon core + templates) on a > regular basis? Or do you think to release two distinct packages (devmon > core, template) just like old owner? I think splitting the software and the templates is fine. However, in some cases this is may not lead to a working solution (as the patches/templates in this thread show). In fact, one feature I would like to add is for a template I would like to get working ... > And finally do you want to release > everything using SVN only? Of course not, but it's better to have a single source repository accessible to everyone, with history (revision control), than for each user to have separate source trees and mail patches around (well, at least without using an RCS that supports this). Regards, Buchan |
From: Morsiani, M. <mas...@gi...> - 2007-12-18 15:07:56
|
Hi Buchan, thanks for your prompt reply. It's more clear now. I made a mistake in my last email when I wrote > Devmon 3 beta3 (the one available on Kaya wiki) is stable? The right one is > Is Devmon 3 beta 3 usable (frozen) or still in progress? That is: Are you working on beta 4 or still working on beta 3? Moreover. I agree with you to use two two distinct packages (devmon core, = template). What do you think about using devmon-devel mailing list to submit/update = templates? Thank you. Regards. Massimo Morsiani Information Technology Dept. ------ Gilbarco S.p.a. via de' Cattani, 220/G 50145 Firenze, Italy tel: +39-055-30941 fax: +39-055-318603 email: mas...@gi... web: http://www.gilbarco.it -----Original Message----- From: Buchan Milne [mailto:bg...@st...]=20 Sent: marted=EC 18 dicembre 2007 13.23 To: dev...@li... Cc: Morsiani, Massimo Subject: Re: [Devmon] negated threshold patch On Tuesday 18 December 2007 12:21:54 Morsiani, Massimo wrote: > Hi all, > > instead of a day-by-day patching, is it possible to have a stable = release? Yes, after an upcoming beta. > Or even a devmon roadmap? Later (IMHO, we should get another beta out first, before a roadmap). However, for the actual 0.3.3 release, I would like to see at least: -working graphs for as much as is possible without significant work -fix = known bugs related to template features -ship a few more templates for = common hardware (that exhibits the graphs) -try and understand and/or = resolve the "devmon tests go purple" issue. > I'm a little bit confused because is not clear which is devmon=20 > evolution now. Devmon 3 beta3 (the one available on Kaya wiki) is = stable? Why would a beta be stable? > Buchan, Francois, > do you think to release a single package (devmon core + templates) on=20 > a regular basis? Or do you think to release two distinct packages=20 > (devmon core, template) just like old owner? I think splitting the software and the templates is fine. However, in = some cases this is may not lead to a working solution (as the = patches/templates in this thread show). In fact, one feature I would like to add is for a template I would like = to get working ... > And finally do you want to release > everything using SVN only? Of course not, but it's better to have a single source repository = accessible to everyone, with history (revision control), than for each = user to have separate source trees and mail patches around (well, at = least without using an RCS that supports this). Regards, Buchan This message (including any attachments) contains confidential and/or proprietary information intended only for the addressee. Any unauthorized disclosure, copying, distribution or reliance on the contents of this information is strictly prohibited and may constitute a violation of law. If you are not the intended recipient, please notify the sender immediately by responding to this e-mail, and delete the message from your system. If you have any questions about this e-mail please notify the sender immediately. |
From: xbgmsharp <xbg...@gm...> - 2007-12-18 16:38:38
|
Morsiani, Massimo a écrit : > Hi Buchan, > > thanks for your prompt reply. > It's more clear now. > > I made a mistake in my last email when I wrote > > >> Devmon 3 beta3 (the one available on Kaya wiki) is stable? >> > > The right one is > I use it on my parc. So stable enough for me. The Devmon 3 beta 3 only include bugfix. With the Devmon 3 beta 4 -working graphs for as much as is possible without significant work -ship a few more templates for common hardware (that exhibits the graphs) I still say that i have not this bug on the Devmon 3 beta 3 or on my wiki which is avalaible on svn trunk. -try and understand and/or resolve the "devmon tests go purple" issue. > >> Is Devmon 3 beta 3 usable (frozen) or still in progress? >> > > That is: Are you working on beta 4 or still working on beta 3? > > Moreover. > I agree with you to use two two distinct packages (devmon core, template). > > What do you think about using devmon-devel mailing list to submit/update templates? > Thank you. > > > Regards. > > Massimo Morsiani > Information Technology Dept. > ------ > Gilbarco S.p.a. > via de' Cattani, 220/G > 50145 Firenze, Italy > tel: +39-055-30941 > fax: +39-055-318603 > email: mas...@gi... > web: http://www.gilbarco.it > > > -----Original Message----- > From: Buchan Milne [mailto:bg...@st...] > Sent: martedì 18 dicembre 2007 13.23 > To: dev...@li... > Cc: Morsiani, Massimo > Subject: Re: [Devmon] negated threshold patch > > On Tuesday 18 December 2007 12:21:54 Morsiani, Massimo wrote: > >> Hi all, >> >> instead of a day-by-day patching, is it possible to have a stable release? >> > > Yes, after an upcoming beta. > > >> Or even a devmon roadmap? >> > > Later (IMHO, we should get another beta out first, before a roadmap). > > However, for the actual 0.3.3 release, I would like to see at least: > -working graphs for as much as is possible without significant work -fix known bugs related to template features -ship a few more templates for common hardware (that exhibits the graphs) -try and understand and/or resolve the "devmon tests go purple" issue. > > >> I'm a little bit confused because is not clear which is devmon >> evolution now. Devmon 3 beta3 (the one available on Kaya wiki) is stable? >> > > Why would a beta be stable? > > >> Buchan, Francois, >> do you think to release a single package (devmon core + templates) on >> a regular basis? Or do you think to release two distinct packages >> (devmon core, template) just like old owner? >> > > I think splitting the software and the templates is fine. However, in some cases this is may not lead to a working solution (as the patches/templates in this thread show). > > In fact, one feature I would like to add is for a template I would like to get working ... > > >> And finally do you want to release >> everything using SVN only? >> > > Of course not, but it's better to have a single source repository accessible to everyone, with history (revision control), than for each user to have separate source trees and mail patches around (well, at least without using an RCS that supports this). > > > Regards, > Buchan > > > This message (including any attachments) contains confidential > and/or proprietary information intended only for the addressee. > Any unauthorized disclosure, copying, distribution or reliance on > the contents of this information is strictly prohibited and may > constitute a violation of law. If you are not the intended > recipient, please notify the sender immediately by responding to > this e-mail, and delete the message from your system. If you > have any questions about this e-mail please notify the sender > immediately. > > ------------------------------------------------------------------------- > SF.Net email is sponsored by: > Check out the new SourceForge.net Marketplace. > It's the best place to buy or sell services > for just about anything Open Source. > http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace > _______________________________________________ > Devmon-support mailing list > Dev...@li... > https://lists.sourceforge.net/lists/listinfo/devmon-support > > |
From: Nathan H. <na...@ma...> - 2007-12-18 21:29:42
|
On 19/12/2007, at 3:39 AM, xbgmsharp wrote: > > I still say that i have not this bug on the Devmon 3 beta 3 or on my > wiki which is avalaible on svn trunk. > > -try and understand and/or resolve the "devmon tests go purple" issue. I have some information regarding this bug. Most switches support the CPUTotal5min oid which devmon happily displays in the cpu column. I have a 2900 switch running an older IOS which does not return any value for CPUTotal5min. When I use the 2900 template on this old switch I get clear and/or purple messages for if_dsc, if_stat, if_load and if_err in addition to a clear and/or purple message for the cpu column. If I modify the template to remove the CPUTotal5min oid from the cpu/ oids file, and remove the matching name from the cpu/message file, then all the other columns go green. Similarly I have a number of HP servers with hpasm. I'm using the hp- server template to monitor power, fans, disks, raid, etc. I also have an older HP Windows 2000 server which doesn't support one of the oids - from memory I think it was resmem - and I get purple alerts for power, fans, disks, raid, etc. I had to remove resmem from the template before the other columns would go green. Short summary: If any single oid fails to retrieve then devmon stops collecting the rest of the oids in the template. I've confirmed this hypothesis with with -f -p -vvvvvvvvvvv. Devmon says something about "too many failures" and "skipping device". Perhaps devmon should only skip that single column and still test the others. |
From: xbgmsharp <xbg...@gm...> - 2007-12-18 22:06:05
|
> Short summary: If any single oid fails to retrieve then devmon stops > collecting the rest of the oids in the template. > > I've confirmed this hypothesis with with -f -p -vvvvvvvvvvv. Devmon > says something about "too many failures" and "skipping device". > Perhaps devmon should only skip that single column and still test the > others. In the code, when 2 tests fails it stop chec the device and send a =20 clear status. in modules/dm_snmp.pm 535 # We dont want to do every table if we are failing alot of wal= ks 536 if($failed_query > 2) { 537 my $error_str =3D 538 "Failed too many queries on $dev, aborting query"; 539 $data_out{'error'}{$error_str} =3D 1; 540 send_data($sock, \%data_out); 541 $session->close(); 542 next DEVICE; 543 } I have the same problem with memory on my pix. But it never made the test go purple. Actually i increase the test failure to 5. Many of the failure are due =20 to snmpget not working as snmpwalk does. For my memroy test on some pix i did have a make result in a table; --=20 Thanks for using xbgm# / Devmon. http://xbgm.sourceforge.net/ http://devmon.sourceforge.net/ Please feedback. |
From: Nathan H. <na...@ma...> - 2007-12-18 23:17:01
|
On Tue, 18 Dec 2007 23:05:48 +0100, "xbgmsharp" <xbg...@gm...> said: > > > Short summary: If any single oid fails to retrieve then devmon stops > > collecting the rest of the oids in the template. > > > > I've confirmed this hypothesis with with -f -p -vvvvvvvvvvv. Devmon > > says something about "too many failures" and "skipping device". > > Perhaps devmon should only skip that single column and still test the > > others. > > > In the code, when 2 tests fails it stop chec the device and send a > clear status. > > in modules/dm_snmp.pm > > 535 # We dont want to do every table if we are failing alot of > walks > 536 if($failed_query > 2) { > 537 my $error_str = > 538 "Failed too many queries on $dev, aborting query"; > 539 $data_out{'error'}{$error_str} = 1; > 540 send_data($sock, \%data_out); > 541 $session->close(); > 542 next DEVICE; > 543 } > > > I have the same problem with memory on my pix. > > But it never made the test go purple. The interesting thing is it makes _other_ tests go purple. My guess is that because it aborts the device query it doesn't send any message for subsequent tests on that device, not even clear messages. So after an hour the other tests go purple. |
From: xbgmsharp <xbg...@gm...> - 2007-12-19 09:42:05
|
Your are right. It just send clear status on the test with error. It doesn't send status on other test so they go purple by hobbit. if you use this option -f -p -vvvvvvvvvvv --debug. You will see: [07-12-19@10:34:53] DEBUG SNMP: Dethawing data for device [07-12-19@10:34:53] ERROR: snmpget device (Received SNMP response with error code) Here is the code refrence to this error. # Looks like we got some data my $hashref = thaw($data_in); my %returned; if (defined $hashref) { do_log("DEBUG SNMP: Dethawing data for $dev",0) if $g{'debug'}; %returned = %{ thaw($data_in) }; # If we got good data, reset the fail counter to 0 $g{'fail'}{$dev} = 0; } else { print "failed thaw on $dev\n"; next; } The problem that i still don't undestand weel the hash structure via the thaw function. Francois Nathan Hand a écrit : > On Tue, 18 Dec 2007 23:05:48 +0100, "xbgmsharp" <xbg...@gm...> > said: > >>> Short summary: If any single oid fails to retrieve then devmon stops >>> collecting the rest of the oids in the template. >>> >>> I've confirmed this hypothesis with with -f -p -vvvvvvvvvvv. Devmon >>> says something about "too many failures" and "skipping device". >>> Perhaps devmon should only skip that single column and still test the >>> others. >>> >> In the code, when 2 tests fails it stop chec the device and send a >> clear status. >> >> in modules/dm_snmp.pm >> >> 535 # We dont want to do every table if we are failing alot of >> walks >> 536 if($failed_query > 2) { >> 537 my $error_str = >> 538 "Failed too many queries on $dev, aborting query"; >> 539 $data_out{'error'}{$error_str} = 1; >> 540 send_data($sock, \%data_out); >> 541 $session->close(); >> 542 next DEVICE; >> 543 } >> >> >> I have the same problem with memory on my pix. >> >> But it never made the test go purple. >> > > The interesting thing is it makes _other_ tests go purple. > > My guess is that because it aborts the device query it doesn't send any > message for subsequent tests on that device, not even clear messages. So > after an hour the other tests go purple. > > ------------------------------------------------------------------------- > SF.Net email is sponsored by: > Check out the new SourceForge.net Marketplace. > It's the best place to buy or sell services > for just about anything Open Source. > http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace > _______________________________________________ > Devmon-support mailing list > Dev...@li... > https://lists.sourceforge.net/lists/listinfo/devmon-support > > |
From: xbgmsharp <xbg...@gm...> - 2007-12-19 10:32:21
|
Re, Could you add this line: my ($bindings) = $session->decode_get_response($response); if(!defined $bindings or $bindings eq '') { my $snmp_err; do_log("DEBUG SNMP MSG: $SNMP_Session::errmsg()",0) if $g{'debug'} ($snmp_err = $SNMP_Session::errmsg) =~ s/\n.*//s; on line 474 in the file modules/dm_snmp.pm. SNMP Error: Received SNMP response with error code error status: tooBig index 0 SNMPv2c_Session (....... I got this error the error code "error status: tooBig" is explain. http://www.juniper.net/security/auto/vulnerabilities/vuln2591.html it seen very weird. But if i remove a test everything is fine. Also if i pool it by hand with smnpwalk or snmpget it is working. Could you also test and tell me you have the same error msg. Here is a shell command which allow you test all oid from a template. find . -type f -name "oids" | xargs cat | cut -d ':' -f 2 | tr -d ' ' | grep -v "^$" | awk '{ print "snmpwalk -t 5 -On -v2c -c COMMUNAUTY IP " $1 }' | sh Francois. Nathan Hand a écrit : > On Tue, 18 Dec 2007 23:05:48 +0100, "xbgmsharp" <xbg...@gm...> > said: > >>> Short summary: If any single oid fails to retrieve then devmon stops >>> collecting the rest of the oids in the template. >>> >>> I've confirmed this hypothesis with with -f -p -vvvvvvvvvvv. Devmon >>> says something about "too many failures" and "skipping device". >>> Perhaps devmon should only skip that single column and still test the >>> others. >>> >> In the code, when 2 tests fails it stop chec the device and send a >> clear status. >> >> in modules/dm_snmp.pm >> >> 535 # We dont want to do every table if we are failing alot of >> walks >> 536 if($failed_query > 2) { >> 537 my $error_str = >> 538 "Failed too many queries on $dev, aborting query"; >> 539 $data_out{'error'}{$error_str} = 1; >> 540 send_data($sock, \%data_out); >> 541 $session->close(); >> 542 next DEVICE; >> 543 } >> >> >> I have the same problem with memory on my pix. >> >> But it never made the test go purple. >> > > The interesting thing is it makes _other_ tests go purple. > > My guess is that because it aborts the device query it doesn't send any > message for subsequent tests on that device, not even clear messages. So > after an hour the other tests go purple. > > |
From: xbgmsharp <xbg...@gm...> - 2007-12-19 10:35:08
|
From this: http://www.cisco.com/en/US/docs/net_mgmt/ciscoworks_ciscoview/4.1/quick/guide/cvgstrcv.html SNMP Error Messages tooBig: The request you made cannot fit into a single packet. Generally, CiscoView splits requests for physical view status until the device can respond. In certain cases, CiscoView assumes that if an agent times out on 20 or more variables, the agent might not be able to respond because the request is too big; it splits the request and resends it. Check that the MTU size on the SNMP interface is as large as possible so that CiscoView does not waste bandwidth by sending more than one request. xbgmsharp a écrit : > Re, > > Could you add this line: > > my ($bindings) = $session->decode_get_response($response); > if(!defined $bindings or $bindings eq '') { > my $snmp_err; > do_log("DEBUG SNMP MSG: $SNMP_Session::errmsg()",0) if $g{'debug'} > ($snmp_err = $SNMP_Session::errmsg) =~ s/\n.*//s; > > on line 474 in the file modules/dm_snmp.pm. > > SNMP Error: > Received SNMP response with error code > error status: tooBig > index 0 > SNMPv2c_Session (....... > > I got this error the error code "error status: tooBig" is explain. > http://www.juniper.net/security/auto/vulnerabilities/vuln2591.html > > it seen very weird. > But if i remove a test everything is fine. Also if i pool it by hand > with smnpwalk or snmpget it is working. > Could you also test and tell me you have the same error msg. > > Here is a shell command which allow you test all oid from a template. > > find . -type f -name "oids" | xargs cat | cut -d ':' -f 2 | tr -d ' ' | > grep -v "^$" | awk '{ print "snmpwalk -t 5 -On -v2c -c COMMUNAUTY IP " > $1 }' | sh > > > > Francois. > > > > > Nathan Hand a écrit : > >> On Tue, 18 Dec 2007 23:05:48 +0100, "xbgmsharp" <xbg...@gm...> >> said: >> >> >>>> Short summary: If any single oid fails to retrieve then devmon stops >>>> collecting the rest of the oids in the template. >>>> >>>> I've confirmed this hypothesis with with -f -p -vvvvvvvvvvv. Devmon >>>> says something about "too many failures" and "skipping device". >>>> Perhaps devmon should only skip that single column and still test the >>>> others. >>>> >>>> >>> In the code, when 2 tests fails it stop chec the device and send a >>> clear status. >>> >>> in modules/dm_snmp.pm >>> >>> 535 # We dont want to do every table if we are failing alot of >>> walks >>> 536 if($failed_query > 2) { >>> 537 my $error_str = >>> 538 "Failed too many queries on $dev, aborting query"; >>> 539 $data_out{'error'}{$error_str} = 1; >>> 540 send_data($sock, \%data_out); >>> 541 $session->close(); >>> 542 next DEVICE; >>> 543 } >>> >>> >>> I have the same problem with memory on my pix. >>> >>> But it never made the test go purple. >>> >>> >> The interesting thing is it makes _other_ tests go purple. >> >> My guess is that because it aborts the device query it doesn't send any >> message for subsequent tests on that device, not even clear messages. So >> after an hour the other tests go purple. >> >> >> > > ------------------------------------------------------------------------- > SF.Net email is sponsored by: > Check out the new SourceForge.net Marketplace. > It's the best place to buy or sell services > for just about anything Open Source. > http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace > _______________________________________________ > Devmon-support mailing list > Dev...@li... > https://lists.sourceforge.net/lists/listinfo/devmon-support > > |
From: xbgmsharp <xbg...@gm...> - 2007-12-19 15:18:58
|
Hi all, After a day of DEBUG. I works. I modify the file dm_snmp.pm which do snmp pooling. So now instead of pooling all leaf oid in one request, 1 oid equal 1 request. So now than i am pooling each oid separitly devmon should not go purple. I try to make it per pakets like 5 or 10 but devmon get lost when corresponding data with test done in dm_tests.pm I will let it working on some PIX and F5 devices and commit by the end of the week. All of this, is for not getting SNMP Error: "error status: tooBig". Because of this error all my leaf test was going clear. Comments? Regards, Francois xbgmsharp a écrit : > From this: > http://www.cisco.com/en/US/docs/net_mgmt/ciscoworks_ciscoview/4.1/quick/guide/cvgstrcv.html > > > SNMP Error Messages > tooBig: > The request you made cannot fit into a single packet. Generally, > CiscoView splits requests for physical view status until the device can > respond. In certain cases, CiscoView assumes that if an agent times out > on 20 or more variables, the agent might not be able to respond because > the request is too big; it splits the request and resends it. Check that > the MTU size on the SNMP interface is as large as possible so that > CiscoView does not waste bandwidth by sending more than one request. > > > > xbgmsharp a écrit : > >> Re, >> >> Could you add this line: >> >> my ($bindings) = $session->decode_get_response($response); >> if(!defined $bindings or $bindings eq '') { >> my $snmp_err; >> do_log("DEBUG SNMP MSG: $SNMP_Session::errmsg()",0) if $g{'debug'} >> ($snmp_err = $SNMP_Session::errmsg) =~ s/\n.*//s; >> >> on line 474 in the file modules/dm_snmp.pm. >> >> SNMP Error: >> Received SNMP response with error code >> error status: tooBig >> index 0 >> SNMPv2c_Session (....... >> >> I got this error the error code "error status: tooBig" is explain. >> http://www.juniper.net/security/auto/vulnerabilities/vuln2591.html >> >> it seen very weird. >> But if i remove a test everything is fine. Also if i pool it by hand >> with smnpwalk or snmpget it is working. >> Could you also test and tell me you have the same error msg. >> >> Here is a shell command which allow you test all oid from a template. >> >> find . -type f -name "oids" | xargs cat | cut -d ':' -f 2 | tr -d ' ' | >> grep -v "^$" | awk '{ print "snmpwalk -t 5 -On -v2c -c COMMUNAUTY IP " >> $1 }' | sh >> >> >> >> Francois. >> >> >> >> >> Nathan Hand a écrit : >> >> >>> On Tue, 18 Dec 2007 23:05:48 +0100, "xbgmsharp" <xbg...@gm...> >>> said: >>> >>> >>> >>>>> Short summary: If any single oid fails to retrieve then devmon stops >>>>> collecting the rest of the oids in the template. >>>>> >>>>> I've confirmed this hypothesis with with -f -p -vvvvvvvvvvv. Devmon >>>>> says something about "too many failures" and "skipping device". >>>>> Perhaps devmon should only skip that single column and still test the >>>>> others. >>>>> >>>>> >>>>> >>>> In the code, when 2 tests fails it stop chec the device and send a >>>> clear status. >>>> >>>> in modules/dm_snmp.pm >>>> >>>> 535 # We dont want to do every table if we are failing alot of >>>> walks >>>> 536 if($failed_query > 2) { >>>> 537 my $error_str = >>>> 538 "Failed too many queries on $dev, aborting query"; >>>> 539 $data_out{'error'}{$error_str} = 1; >>>> 540 send_data($sock, \%data_out); >>>> 541 $session->close(); >>>> 542 next DEVICE; >>>> 543 } >>>> >>>> >>>> I have the same problem with memory on my pix. >>>> >>>> But it never made the test go purple. >>>> >>>> >>>> >>> The interesting thing is it makes _other_ tests go purple. >>> >>> My guess is that because it aborts the device query it doesn't send any >>> message for subsequent tests on that device, not even clear messages. So >>> after an hour the other tests go purple. >>> >>> >>> |
From: Stewart, T. L. <Tom...@la...> - 2007-12-19 16:03:40
|
All, As I sometimes have a problem with Devmon hanging and not reporting anything, (used beta-2 and upgraded to beta-3 on a Solaris 10 box) I have a cron job that stops and restarts devmon every 15 minutes to prevent losing too much data. As I run this on the Hobbit monitor, I am now seeing where the memory usage has constantly gone up since I started Devmon (first of October). The hobbit server is now starting to do some swapping as physical memory is all used up. I have devmon set to use the standard 10 and most likely could reduce the number of procs, but this won't help in the long run as I keep adding more systems. Is anyone else seen this problem as an issue? Thank you, Tom |
From: Buchan M. <bg...@st...> - 2007-12-19 17:26:04
|
On Wednesday 19 December 2007 17:19:34 xbgmsharp wrote: > Hi all, > > After a day of DEBUG. I works. > I modify the file dm_snmp.pm which do snmp pooling. > So now instead of pooling all leaf oid in one request, 1 oid equal 1 > request. So now than i am pooling each oid separitly devmon should not go > purple. I try to make it per pakets like 5 or 10 but devmon get lost when > corresponding data with test done in dm_tests.pm > > I will let it working on some PIX and F5 devices and commit by the end of > the week. > > All of this, is for not getting SNMP Error: "error status: tooBig". > Because of this error all my leaf test was going clear. > > Comments? I have a number of servers using the compaq-server template, and on some, all the tests are always green. On the others, the are all clear, and I have error messages such as this: Missing repeater data for primary OID cpqHeFltTolFanIndex There is no apparent difference between the servers. One DL380 works, one does not. Two DL580s work, three don't. I'm wondering if there is a similar issue here. If I snmpwalk the whole Compaq OID, or each branch, I get the data I expect on the servers that aren't working. I think that is one of the last few issues I'd like to see fixed before 0.3.0 goes out. Regards, Buchan |
From: Morsiani, M. <mas...@gi...> - 2007-12-19 18:07:17
|
Hi all, in order to monitor hp servers (DL360, DL380, DL580, etc) what do I need = to install on our machines? SNMP agents? Where can I find them? Thank you. Regards. Massimo Morsiani Information Technology Dept. ------ Gilbarco S.p.a. via de' Cattani, 220/G 50145 Firenze, Italy tel: +39-055-30941 fax: +39-055-318603 email: mas...@gi... web: http://www.gilbarco.it -----Original Message----- From: dev...@li... = [mailto:dev...@li...] On Behalf Of = Buchan Milne Sent: mercoled=EC 19 dicembre 2007 18.26 To: dev...@li... Subject: Re: [Devmon] "devmon tests go purple" issue On Wednesday 19 December 2007 17:19:34 xbgmsharp wrote: > Hi all, > > After a day of DEBUG. I works. > I modify the file dm_snmp.pm which do snmp pooling. > So now instead of pooling all leaf oid in one request, 1 oid equal 1=20 > request. So now than i am pooling each oid separitly devmon should not = > go purple. I try to make it per pakets like 5 or 10 but devmon get=20 > lost when corresponding data with test done in dm_tests.pm > > I will let it working on some PIX and F5 devices and commit by the end = > of the week. > > All of this, is for not getting SNMP Error: "error status: tooBig". > Because of this error all my leaf test was going clear. > > Comments? I have a number of servers using the compaq-server template, and on = some, all the tests are always green. On the others, the are all clear, = and I have error messages such as this: Missing repeater data for primary OID cpqHeFltTolFanIndex There is no apparent difference between the servers. One DL380 works, = one does not. Two DL580s work, three don't. I'm wondering if there is a similar issue here. If I snmpwalk the whole = Compaq OID, or each branch, I get the data I expect on the servers that = aren't working. I think that is one of the last few issues I'd like to see fixed before = 0.3.0 goes out. Regards, Buchan -------------------------------------------------------------------------= SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open = Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketpl= ace _______________________________________________ Devmon-support mailing list Dev...@li... https://lists.sourceforge.net/lists/listinfo/devmon-support This message (including any attachments) contains confidential and/or proprietary information intended only for the addressee. Any unauthorized disclosure, copying, distribution or reliance on the contents of this information is strictly prohibited and may constitute a violation of law. If you are not the intended recipient, please notify the sender immediately by responding to this e-mail, and delete the message from your system. If you have any questions about this e-mail please notify the sender immediately. |
From: Nathan H. <na...@ma...> - 2007-12-19 19:05:03
|
On Linux install the "hpasm" package. On Windows install the Systems Insight Manager package. On 20/12/2007, at 5:06 AM, Morsiani, Massimo wrote: > Hi all, > > in order to monitor hp servers (DL360, DL380, DL580, etc) what do I =20= > need to install on our machines? > SNMP agents? Where can I find them? > Thank you. > > > Regards. > > Massimo Morsiani > Information Technology Dept. > ------ > Gilbarco S.p.a. > via de' Cattani, 220/G > 50145 Firenze, Italy > tel: +39-055-30941 > fax: +39-055-318603 > email: mas...@gi... > web: http://www.gilbarco.it > > > -----Original Message----- > From: dev...@li... = [mailto:dev...@li...=20 > ] On Behalf Of Buchan Milne > Sent: mercoled=EC 19 dicembre 2007 18.26 > To: dev...@li... > Subject: Re: [Devmon] "devmon tests go purple" issue > > On Wednesday 19 December 2007 17:19:34 xbgmsharp wrote: >> Hi all, >> >> After a day of DEBUG. I works. >> I modify the file dm_snmp.pm which do snmp pooling. >> So now instead of pooling all leaf oid in one request, 1 oid equal 1 >> request. So now than i am pooling each oid separitly devmon should =20= >> not >> go purple. I try to make it per pakets like 5 or 10 but devmon get >> lost when corresponding data with test done in dm_tests.pm >> >> I will let it working on some PIX and F5 devices and commit by the =20= >> end >> of the week. >> >> All of this, is for not getting SNMP Error: "error status: tooBig". >> Because of this error all my leaf test was going clear. >> >> Comments? > > I have a number of servers using the compaq-server template, and on =20= > some, all the tests are always green. On the others, the are all =20 > clear, and I have error messages such as this: > > Missing repeater data for primary OID cpqHeFltTolFanIndex > > There is no apparent difference between the servers. One DL380 =20 > works, one does not. Two DL580s work, three don't. > > I'm wondering if there is a similar issue here. If I snmpwalk the =20 > whole Compaq OID, or each branch, I get the data I expect on the =20 > servers that aren't working. > > I think that is one of the last few issues I'd like to see fixed =20 > before 0.3.0 goes out. > > Regards, > Buchan > > = ------------------------------------------------------------------------- > SF.Net email is sponsored by: > Check out the new SourceForge.net Marketplace. > It's the best place to buy or sell services for just about anything =20= > Open Source. > = http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketpla= ce > _______________________________________________ > Devmon-support mailing list > Dev...@li... > https://lists.sourceforge.net/lists/listinfo/devmon-support > > > This message (including any attachments) contains confidential > and/or proprietary information intended only for the addressee. > Any unauthorized disclosure, copying, distribution or reliance on > the contents of this information is strictly prohibited and may > constitute a violation of law. If you are not the intended > recipient, please notify the sender immediately by responding to > this e-mail, and delete the message from your system. If you > have any questions about this e-mail please notify the sender > immediately. > > = ------------------------------------------------------------------------- > SF.Net email is sponsored by: > Check out the new SourceForge.net Marketplace. > It's the best place to buy or sell services > for just about anything Open Source. > = http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketpla= ce > _______________________________________________ > Devmon-support mailing list > Dev...@li... > https://lists.sourceforge.net/lists/listinfo/devmon-support |
From: xbgmsharp <xbg...@gm...> - 2007-12-20 09:54:57
|
Hi all, In order to figure if it is the same bug, i would sujest. In template directory execute: find . -type f -name "oids" | xargs cat | grep leaf | cut -d ':' -f 2 | tr -d ' ' | awk '{ print "snmpget -t 20 -On -v2c -c COMMUNAUTY IP " $1 }' | sh It should work, this do an snmpget on all leaf oid like in my version. To do it like devmon do it, it is: find . -type f -name "oids" | xargs cat | grep leaf | tr -d ' ' | tr '\n' ' ' | awk -F '|' '{ print "snmpget -t 20 -On -v2c -c COMMUNAUTY IP " $1 }' | sh Which request all oids leaf in one request. If yo get Tobbig error, my version fix the bug. If not, send the error msg. If the problem is due to snmpwalk, i would sujest do add this line: do_log("DEBUG SNMP MSG: $SNMP_Session::errmsg",0) if $g{'debug'} before line 474 and 524 in the file modules/dm_snmp.pm. Then running devmon in devmon will print you the error msg you received when polling. For anyone having this kind of problem, please do it. Regards, Francois Buchan Milne a écrit : > On Wednesday 19 December 2007 17:19:34 xbgmsharp wrote: >> Hi all, >> >> After a day of DEBUG. I works. >> I modify the file dm_snmp.pm which do snmp pooling. >> So now instead of pooling all leaf oid in one request, 1 oid equal 1 >> request. So now than i am pooling each oid separitly devmon should not go >> purple. I try to make it per pakets like 5 or 10 but devmon get lost when >> corresponding data with test done in dm_tests.pm >> >> I will let it working on some PIX and F5 devices and commit by the end of >> the week. >> >> All of this, is for not getting SNMP Error: "error status: tooBig". >> Because of this error all my leaf test was going clear. >> >> Comments? > > I have a number of servers using the compaq-server template, and on some, all > the tests are always green. On the others, the are all clear, and I have > error messages such as this: > > Missing repeater data for primary OID cpqHeFltTolFanIndex > > There is no apparent difference between the servers. One DL380 works, one does > not. Two DL580s work, three don't. > > I'm wondering if there is a similar issue here. If I snmpwalk the whole Compaq > OID, or each branch, I get the data I expect on the servers that aren't > working. > > I think that is one of the last few issues I'd like to see fixed before 0.3.0 > goes out. > > Regards, > Buchan > |
From: Buchan M. <bg...@st...> - 2007-12-21 07:16:35
|
On Thursday 20 December 2007 11:55:38 xbgmsharp wrote: > Hi all, > > In order to figure if it is the same bug, i would sujest. > In template directory execute: > find . -type f -name "oids" | xargs cat | grep leaf | cut -d ':' -f 2 | > tr -d ' ' | awk '{ print "snmpget -t 20 -On -v2c -c COMMUNAUTY IP " $1 > }' | sh > > It should work, this do an snmpget on all leaf oid like in my version. Except I am having problems with branch oids. > > To do it like devmon do it, it is: > find . -type f -name "oids" | xargs cat | grep leaf | tr -d ' ' | tr > '\n' ' ' | awk -F '|' '{ print "snmpget -t 20 -On -v2c -c COMMUNAUTY > IP " $1 }' | sh > > Which request all oids leaf in one request. > If yo get Tobbig error, my version fix the bug. If not, send the error > msg. Can you commit your fix in svn? If you commit it today I can test (I will be on leave after today until 2 January). > If the problem is due to snmpwalk, i would sujest do add this line: > do_log("DEBUG SNMP MSG: $SNMP_Session::errmsg",0) if $g{'debug'} > > before line 474 and 524 in the file modules/dm_snmp.pm. > > Then running devmon in devmon will print you the error msg you received > when polling. On some devices I get something like this: ERROR: snmpget xxx (Received SNMP response with error code) So, I would prefer to test your fix. Regards, Buchan |
From: Buchan M. <bg...@st...> - 2008-01-14 17:36:41
|
On Wednesday 19 December 2007 19:25:55 Buchan Milne wrote: > On Wednesday 19 December 2007 17:19:34 xbgmsharp wrote: > > Hi all, > > > > After a day of DEBUG. I works. > > I modify the file dm_snmp.pm which do snmp pooling. > > So now instead of pooling all leaf oid in one request, 1 oid equal 1 > > request. So now than i am pooling each oid separitly devmon should not go > > purple. I try to make it per pakets like 5 or 10 but devmon get lost when > > corresponding data with test done in dm_tests.pm > > > > I will let it working on some PIX and F5 devices and commit by the end of > > the week. > > > > All of this, is for not getting SNMP Error: "error status: tooBig". > > Because of this error all my leaf test was going clear. > > > > Comments? > > I have a number of servers using the compaq-server template, and on some, > all the tests are always green. On the others, the are all clear, and I > have error messages such as this: > > Missing repeater data for primary OID cpqHeFltTolFanIndex > > There is no apparent difference between the servers. One DL380 works, one > does not. Two DL580s work, three don't. > > I'm wondering if there is a similar issue here. If I snmpwalk the whole > Compaq OID, or each branch, I get the data I expect on the servers that > aren't working. I have resolved most of the issues here. In some cases it may have been that the Insight agent for the device type was not running (I have two servers that currently aren't showing any info on their raid controllers). In the other case, the IML log was empty, which results in a "No Such Object available on this agent at this OID" for the parent of all the branches in the log test. When the IML log is emptied with the "hpasmcli -s 'clear iml'" command, the IML is empty, and this error occurs. When emptying the log from the iLO (integrated lights out) card's web interface, it puts one entry in the log, as follows: Informational Unknown 2008-1-14 13:14 2008-1-14 13:14 1 IML Cleared (iLO user:admin) So, I have updated the text on the log page to indicate that the IML should be cleared from the web interface. Note that if the IML log is empty and the log test is enabled on the server, most likely most of the other tests will be clear (in the case where I upped the limit for failed branches in the code, around line 560 in dm_snmp) or purple. I'm undoing the change on the "Failed too many queries" limit I had done. However, it would have been nicer if we could just skip the test in question (rather than the whole device). Regards, Buchan |
From: xbgmsharp <xbg...@gm...> - 2008-01-14 18:04:14
|
... > So, I have updated the text on the log page to indicate that the IML =20 > should be > cleared from the web interface. Note that if the IML log is empty and the = log > test is enabled on the server, most likely most of the other tests will be > clear (in the case where I upped the limit for failed branches in the code= , > around line 560 in dm_snmp) or purple. I'm undoing the change on the "Fail= ed > too many queries" limit I had done. However, it would have been nicer if w= e > could just skip the test in question (rather than the whole device). > > Regards, > Buchan > I think i could be a nice features to skip the test in question rather =20 than the whole device. In fact it has been done like this in order to =20 not block other devices. Because sometimes when a device is not weel =20 confiogured it will appear all clear instead of some test's color. A limit of error for the device is usefull i think. So maybe implement both a skip test error and a skip device error counter. Regards, Francois. --=20 Thanks for using xbgm# / Devmon / BBwin. http://xbgm.sourceforge.net/ http://devmon.sourceforge.net/ http://bbwin.sourceforge.net/ Please feedback. |