Menu

#2654 clm: clm test asserts due to timeout in poll

never
not-reproducible
None
defect
clm
-
major
False
2018-08-30
2017-10-25
No

In CLM tests, immadm command is used in many places for executing CLM admin operations for locking, unlocking and shautting down nodes.
In overloaded system this can make problems, and the execution of immadm can take long time.
Since we have sanity check for executing immadm in another thread, this situation may go to timeout in poll in an overloaded system.

Thread 1 (Thread 0x7f3b61c0c740 (LWP 280)):
#0 0x00007f3b61007428 in __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:54
resultvar = 0
pid = 280
selftid = 280
#1 0x00007f3b6100902a in __GI_abort () at abort.c:89
save_stage = 2
act = {__sigaction_handler = {sa_handler = 0x4, sa_sigaction = 0x4}, sa_mask = {__val = {0, 0, 140732357794048, 47244640256, 139893019865088, 94726958067288, 865, 94726958071200, 0, 0, 139893007538572, 139893008635480, 139893008649136, 0, 139893008635480, 94726958067288}}, sa_flags = 1640067072, sa_restorer = 0x562756afae58}
sigs = {__val = {32, 0 <repeats 15 times>}}
#2 0x00007f3b60fffbd7 in __assert_fail_base (fmt=<optimized out>, assertion=assertion@entry=0x562756afae58 "ret == 1", file=file@entry=0x562756afb398 "src/clm/apitest/tet_saClmClusterTrack.c", line=line@entry=865, function=function@entry=0x562756afbda0 <__PRETTY_FUNCTION__.7254> "saClmClusterTrack_27") at assert.c:92
str = 0x562758c0d360 ""
total = 4096
#3 0x00007f3b60fffc82 in __GI___assert_fail (assertion=assertion@entry=0x562756afae58 "ret == 1", file=file@entry=0x562756afb398 "src/clm/apitest/tet_saClmClusterTrack.c", line=line@entry=865, function=function@entry=0x562756afbda0 <__PRETTY_FUNCTION__.7254> "saClmClusterTrack_27") at assert.c:101
No locals.
#4 0x0000562756af64c7 in saClmClusterTrack_27 () at src/clm/apitest/tet_saClmClusterTrack.c:865
fds = {{fd = 10, events = 1, revents = 0}}
thread8 = 139892944803584
__PRETTY_FUNCTION__ = "saClmClusterTrack_27"
#5 0x0000562756afa309 in run_test_case (suite=<optimized out>, tcase=<optimized out>) at src/osaf/apitest/utest.c:178
No locals.
#6 0x0000562756afa824 in test_run (suite=<optimized out>, tcase=<optimized out>) at src/osaf/apitest/utest.c:202
i = 7
j = 27
#7 0x00007f3b60ff2830 in __libc_start_main (main=0x562756af2910 <main>, argc=1, argv=0x7ffece31db98, init=<optimized out>, fini=<optimized out>, rtld_fini=<optimized out>, stack_end=0x7ffece31db88) at ../csu/libc-start.c:291
result = <optimized out>
unwind_buf = {cancel_jmp_buf = {{jmp_buf = {0, -8017775631807234294, 94726958036048, 140732357794704, 0, 0, -4393240863189713142, -4430610558499635446}, mask_was_saved = 0}}, priv = {pad = {0x0, 0x0, 0x7ffece31dba8, 0x7f3b61c1c168}, data = {prev = 0x0, cleanup = 0x0, canceltype = -835593304}}}
not_first_call = <optimized out>
#8 0x0000562756af3479 in _start ()

Discussion

  • Zoran Milinkovic

    • status: accepted --> review
     
  • Anders Widell

    Anders Widell - 2017-11-03
    • Milestone: 5.17.11 --> 5.18.01
     
  • Anders Widell

    Anders Widell - 2018-02-02
    • Milestone: 5.18.01 --> 5.18.04
     
  • Richa

    Richa - 2018-08-30

    Hi Zoran,
    The patch is already ACKed by Gary, can you please push it.
    Thanks

     

    Last edit: Richa 2018-08-30
  • Gary Lee

    Gary Lee - 2018-08-30

    Hi Richa

    Are clm tests failing for you?

    I think we saw some clmtest failures and thought this would help, but it turned out to be something else so we didn't need to push it.

     

    Last edit: Gary Lee 2018-08-30
  • Richa

    Richa - 2018-08-30

    Hi Gary,
    I checked that this ticket is in review for long time and patch has been ACKed, so just thought to remind.
    Thanks
    Richa

     
  • Gary Lee

    Gary Lee - 2018-08-30
    • status: review --> not-reproducible
     
  • Gary Lee

    Gary Lee - 2018-08-30

    It's not happening anymore

     
  • Richa

    Richa - 2018-08-30
    • Milestone: 5.18.04 --> never
     
  • Richa

    Richa - 2018-08-30

    Thanks Gary.

     

Log in to post a comment.