Re: [Dpcl-develop] Using 'diag' test

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Steve
This looks like the target program is what is crashing and you are ending 
up with a core file for that process. Is that correct? If not, can you 
clarify what is crashing and how you are getting the traceback? Where are 
you placing print statements in eut_diag.C? The callback functions 
dcall_back and dcall_back2 or somewhere else?

Assuming this is the target, nothing that eut_diag.C should be causing 
timing problems with the possible exception of the dcall_back and 
dcall_back2 functions, and even then I have a hard time seeing a problem 
with the printf slowing down things enough to influence target execution. 
The path from the target program thru the daemon back to the client is 
pretty long anyway, and buffering of messages which happen to arrive so 
fast they cause Ais_send to slow down should be buffered in the daemon 
anyway.

Basically, what eut_diag does is loads or attaches to the application, 
suspends it, finds all the function entry and exit points, instruments 
them, then resumes target execution. 

The traceback you included does appear to be from within a call to 
Ais_send within the target after the target has been resumed. What is the 
target program logic? Are there functions being called in tight loops, 
such that the entry and exit probes are being called rapidly, or are there 
things like sleep() calls slowing down execution. Does that make a 
difference?  Are you loading or connecting to the application? Is the 
target application single threaded?

Those questions will help get us started. I will find some time next week 
to look at this further.

Dave

Steve Collins <sl...@sg...> 
Sent by: dpc...@ww...
05/28/2004 02:08 PM

To
dpc...@ww...
cc
sl...@sg...
Subject
[Dpcl-develop] Using 'diag' test

    Hello, DPCL'ers. I am using DPCL/Dyninst on ia64 and, for some
  time now, have been trying to get the 'diag' (eut_diag) sample DPCL
  test running. It seems to (non-deterministically) be dying in the
  daemon_RT shared memory code, to wit:

              #0  shmFObjectAllocV (buffer=0x2000000000f15000, shm_key=
                     {daemon_address = 0x2000000000e15000, process_address 
= 0x2000000000f15000}, 
                   object_number=1, object_holder=0x2000000000e263c8, 
rc=0x60000fffffff8100)
                   at ../src/os/linux/ShmManager.C:687
               687            *p_free_object = 
               (gdb) where
               #0  shmFObjectAllocV (buffer=0x2000000000f15000, shm_key=
                     {daemon_address = 0x2000000000e15000, process_address 
= 0x2000000000f15000}, 
                   object_number=1, object_holder=0x2000000000e263c8, 
rc=0x60000fffffff8100)
                   at ../src/os/linux/ShmManager.C:687
               #1  0x2000000000e11090 in shm_processObjectAllocV (shm_key=
                     {daemon_address = 0x2000000000e15000, process_address 
= 0x2000000000f15000}, 
                   object_number=0, object_holder=0x2000000000e263c8, 
rc=0x60000fffffff8100)
                   at ../src/os/linux/ShmManagerAPI_app.C:48
               #2  0x2000000000e119d0 in Ais_send 
(msg_handle_id=0x2000000000f225a4 "", 
                   message=0x200000000083e320, message_size=30) at 
../src/os/linux/ShmMessageAPI_app.C:319
               #3  0x2000000000e11670 in Ais_send_int 
(msg_handle_id=0x2000000000f225a4 "", 
                   message=0x200000000083e320, message_size=30) at 
../src/os/linux/ShmMessageAPI_app.C:85
               #4  0x200000000083e3e0 in DYNINSTstaticHeap_4M_anyHeap_1 ()
                  from /scratch/slc/dpcl-install/lib/libdyninstAPI_RT.so.1

  This walkback from gdb is, itself, suspicious, but about all I have 
available.
  Again, the failure doesn't happen every time. If I tweak 'diag' 
(eut_diag.C)
  in a small, seemingly innocuous, way, then things work. If I put any 
sort of
  debug prints in place, then the SEGV doesn't appear. It really seems 
like a
  timing thing, but I can't finger the culprit.

    This is admittedly a fishing expedition. Does 'diag' have some sort of 
known
   race condition, or flaky behaviour? Just fishing....

                                                 SteveC - SGI 
Tools/Compilers
_______________________________________________
Dpcl-develop mailing list
Dpc...@ww...
http://www-124.ibm.com/developerworks/oss/mailman/listinfo/dpcl-develop