From: Steve B <rec...@gm...> - 2015-09-10 09:11:29
|
Hi All, A question, in the last couple of days I have witnessed some devmon shutdowns. I have been making some new templates and in some cases, I have been using some full MIB names, example: cpqRackPowerSupplyEnclosureSerialNum <http://www.circitor.fr/Mibs/Html/CPQRACK-MIB.php#cpqRackPowerSupplyEnclosureSerialNum> Is this too long? Normally I do not use such long names and abbreviate them so I was wondering if this could be the culprit of Devmon stopping working. Here you can see in the devmon.log, a shut down at 10:27 (it shut down by itself) which brings purples to all devmon checks: [15-09-10@10:27:53] No SNMP data found for ifDescr [15-09-10@10:27:53] No SNMP data found for ifDescr [15-09-10@10:27:53] No SNMP data found for ifDescr [15-09-10@10:27:53] No SNMP data found for ifDescr [15-09-10@10:27:53] Shutting down [15-09-10@10:57:54] ---Initilizing devmon... [15-09-10@10:57:54] Node 0 reporting to <servername> [15-09-10@10:57:54] Running under process id: 2363 [15-09-10@10:57:54] Entering poll loop [15-09-10@10:57:54] Undefined oid 'cpqRackEncTmpGcond' referenced in [15-09-10@10:57:54] Attempting to redefine HPG2 /RT3000/batt-capacity template when reading data from [15-09-10@10:57:54] Attempting to redefine HPG2 /RT3000/pwr template when reading data from [15-09-10@10:57:54] Attempting to redefine HPG2 /RT3000/temp template when reading data from [15-09-10@10:57:54] Undefined oid 'upsSecsOnBatt' referenced in [15-09-10@10:59:25] Fork 20 (2514) exceeded poll time polling [15-09-10@10:59:25] Fork 20 (2514) exceeded poll time polling [15-09-10@10:59:52] Fork 15 (2506) exceeded poll time polling [15-09-10@10:59:52] Fork 15 (2506) exceeded poll time polling [15-09-10@11:00:07] Fork 11 (2500) exceeded poll time polling [15-09-10@11:00:07] Fork 11 (2500) exceeded poll time polling [15-09-10@11:00:47] Fork 18 (2510) exceeded poll time polling [15-09-10@11:00:47] Fork 18 (2510) exceeded poll time polling [15-09-10@11:01:02] Fork 12 (2502) exceeded poll time polling [15-09-10@11:01:02] Fork 12 (2502) exceeded poll time polling [15-09-10@11:01:03] Fork 19 (2511) exceeded poll time polling [15-09-10@11:01:03] Fork 19 (2511) exceeded poll time polling At 10:00ish I had introduced the new oids (with long Mib names). I had this same issue yesterday and last week. I started Devmon up again at 10:57. I do have poll time issues from time to time as I am polling devices all over the world and some places are less responsive due to setup but typically my thresholds are ok and my checks are responsive. I have Devmon updating in a 5 minute cycle in Xymon. Any ideas about the length of the oids + subsequent shutdown? Thanks S |