|
From: Andy S. <ab...@sh...> - 2015-04-22 21:12:45
|
Andy Smith wrote: > All, > > With this patch installed, I cannot get xymond_client to run in the > production system. The pre-prod system (identical platform but with no > load) it runs fine, but OTOH we dont have any Windows clients in pre-prod. > > Our windows clients in prod are all BBWin. > > I managed to encourage it to run with debug enabled for a couple of > seconds, apparently OK with the UNIX clients until it confronted this > message :- > > 5923 2015-04-22 16:44:06.007725 xymond_client: Got message 12462 > @@client#12462/WWWWWWWWWWW|1429717445.990578|NN.NNN.NN.NN|WWWWWWWWWWWW|bbwin|win32| > 5923 2015-04-22 16:44:06.007750 startpos 571640, fillpos 620792, endpos -1 > 5923 2015-04-22 16:44:06.007781 Client report from host WWWWWWWWWWW > 5923 2015-04-22 16:44:06.008566 Uptime check host WWWWWWWWWWW > 5923 2015-04-22 16:44:06.008610 uptimestr [6409428] > 2015-04-22 16:44:06.119740 Peer at 0.0.0.0:0 failed: Broken pipe > 2015-04-22 16:44:06.119927 Child process 5923 died: Signal 6 > > 2015-04-22 16:44:06.227366 Peer not up, flushing message queue > .......................276 similar lines ................ > 2015-04-22 16:45:04.592976 Peer not up, flushing message queue > > We definitely have the memory leak issue with 4.3.19, which is why I was > keen to get this patch installed. > > Is there a way to debug this safely in production since that is the only > place I can reproduce it? > Thanks bash-3.2$ gdb /apps/xymon/server/bin/xymond_client-r7629 /apps/xymon/server/tmp/core.5923 GNU gdb Fedora (6.8-27.el5) Copyright (C) 2008 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html> This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Type "show copying" and "show warranty" for details. This GDB was configured as "i386-redhat-linux-gnu"... warning: core file may not match specified executable file. warning: Can't read pathname for load map: Input/output error. Reading symbols from /lib/librt.so.1...done. Loaded symbols for /lib/librt.so.1 Reading symbols from /apps/contrib/ssl/lib/libssl.so.1.0.0...done. Loaded symbols for /apps/contrib/ssl/lib/libssl.so.1.0.0 Reading symbols from /apps/contrib/ssl/lib/libcrypto.so.1.0.0...done. Loaded symbols for /apps/contrib/ssl/lib/libcrypto.so.1.0.0 Reading symbols from /apps/contrib/lib/libpcre.so.1...done. Loaded symbols for /apps/contrib/lib/libpcre.so.1 Reading symbols from /lib/libc.so.6...done. Loaded symbols for /lib/libc.so.6 Reading symbols from /lib/libpthread.so.0...done. Loaded symbols for /lib/libpthread.so.0 Reading symbols from /lib/ld-linux.so.2...done. Loaded symbols for /lib/ld-linux.so.2 Reading symbols from /lib/libdl.so.2...done. Loaded symbols for /lib/libdl.so.2 Reading symbols from /apps/contrib/lib/libz.so.1...done. Loaded symbols for /apps/contrib/lib/libz.so.1 Core was generated by `xymond_client --debug'. Program terminated with signal 6, Aborted. [New process 5923] #0 0x00ea2402 in __kernel_vsyscall () (gdb) bt #0 0x00ea2402 in __kernel_vsyscall () #1 0x002dedf0 in raise () from /lib/libc.so.6 #2 0x002e0701 in abort () from /lib/libc.so.6 #3 0x0806e8d3 in stackfclist (v_listhead=0x78652d70) at stackio.c:303 #4 <signal handler called> #5 0x0805f4c7 in getrule (hostname=0x9bd9e08 "WWWWWWWW", pagename=<value optimized out>, classname=0xb7a5bfc7 "win32", hinfo=0x9bd9d80, ruletype=C_LOAD) at client_config.c:361 #6 0x08060463 in get_cpu_thresholds (hinfo=0x9bd9d80, classname=0xb7a5bfc7 "win32", loadyellow=0xbffa6070, loadred=0xbffa606c, recentlimit=0xbffa6068, ancientlimit=0xbffa6064, uptimecolor=0xbffa6060, maxclockdiff=0xbffa605c, clockdiffcolor=0xbffa6058) at client_config.c:1954 #7 0x08055321 in handle_win32_bbwin_client ( hostname=0xb7a5bfb4 "WWWWWWWW", clienttype=0xb7a5bfc7 "win32", os=OS_WIN32_BBWIN, hinfo=0x9bd9d80, sender=0xb7a5bfa7 "NN.NNN.NN.NN", timestamp=1429717445, clientdata=0xb7a5bfce "client WWWWWWWW.DC.COM.bbwin win32\n[date") at client/bbwin.c:51 #8 0x0805cd44 in main (argc=2, argv=0xbffa6294) at xymond_client.c:2279 (gdb) -- Andy |