Menu

#745 ckpt retention duration set continuously timesout

4.4.1
fixed
None
defect
ckpt
-
4.4.M0
major
2014-07-24
2014-01-24
No

The issue is seen on cs4733 along with patch #561. This is observed only when 32bit application is running on 64bit opensaf.

The application does the following:

1) Open an asynchronous checkpoint
2) Set the retention duration of the checkpoint

The api continuously return ERR_TIMEOUT after which application exits.

Attached are the journals and traces.

1 Attachments

Related

Tickets: #745
Wiki: ChangeLog-4.4.1

Discussion

  • A V Mahesh (AVM)

    • Milestone: future --> 4.5.FC
     
  • Sirisha Alla

    Sirisha Alla - 2014-04-23

    This issue is also observed in 4.3.2 cs 5143. There is also core file formed during the test. When the issue was reported earlier core dump is not observed since cores are not generated when run with non root user. Now when the same test is repeated with root user, core dumps are observed.

    Thread 1 (Thread 0x7f6b677a5b00 (LWP 4053)):
    #0 0x00007f6b67356965 in ncs_decode_32bit (stream=0x7f6b677a0d08) at hj_dec.c:197
    #1 0x00007f6b673575eb in ncs_edp_uns32 (hdl=0x6381c8, edu_tkn=0x0, ptr=0x7f6b677a0e68, ptr_data_len=0x7f6b677a0e64, buf_env=0x7f6b677a2d00, op=EDP_OP_TYPE_DEC,
    o_err=0x7f6b677a2e6c) at hj_edp.c:566
    #2 0x00007f6b67359820 in ncs_edu_run_edp (edu_hdl=0x6381c8, edu_tkn=0x0, rule=0x7f6b677a1260, edp=0x403080 ncs_edp_uns32@plt, ptr=0x7f6b677a0e68, dcnt=0x7f6b677a0e64,
    buf_env=0x7f6b677a2d00, optype=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c) at hj_edu.c:481
    #3 0x00007f6b6735af04 in ncs_edu_prfm_dec_on_non_ptr (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a1260, ptr=0x7f6b60003ab0, ptr_data_len=0x7f6b677a1434,
    buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c) at hj_edu.c:1149
    #4 0x00007f6b6735a04e in ncs_edu_perform_exec_action_on_non_ptr (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a1260, optype=EDP_OP_TYPE_DEC,
    ptr=0x7f6b60003ab0, ptr_data_len=0x7f6b677a1434, buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c) at hj_edu.c:810
    #5 0x00007f6b67359f97 in ncs_edu_perform_exec_action (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a1260, optype=EDP_OP_TYPE_DEC, ptr=0x7f6b60003ab0,
    ptr_data_len=0x7f6b677a1434, buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c) at hj_edu.c:783
    #6 0x00007f6b67359cf0 in ncs_edu_exec_rule (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a1260, ptr=0x7f6b60003ab0, ptr_data_len=0x7f6b677a1434,
    buf_env=0x7f6b677a2d00, optype=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c) at hj_edu.c:630
    #7 0x00007f6b6735bb3e in ncs_edu_run_rules_for_dec (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, prog=0x7f6b677a1160, ptr=0x7f6b60003ab0, ptr_data_len=0x7f6b677a1434,
    buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c, instr_count=6) at hj_edu.c:1821
    #8 0x00007f6b67359a19 in ncs_edu_run_rules (edu_hdl=0x6381c8, edu_tkn=0x0, prog=0x7f6b677a1160, ptr=0x7f6b60003ab0, ptr_data_len=0x7f6b677a1434, buf_env=0x7f6b677a2d00,
    optype=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c, instr_count=6) at hj_edu.c:536
    #9 0x000000000042010a in cpsv_edp_CPSV_CKPT_RDSET_info (edu_hdl=0x6381c8, edu_tkn=0x0, ptr=0x7f6b677a1438, ptr_data_len=0x7f6b677a1434, buf_env=0x7f6b677a2d00,
    op=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c) at cpsv_edu.c:782
    #10 0x00007f6b67359820 in ncs_edu_run_edp (edu_hdl=0x6381c8, edu_tkn=0x0, rule=0x7f6b677a1980, edp=0x41fde9 <cpsv_edp_CPSV_CKPT_RDSET_info>, ptr=0x7f6b677a1438,
    dcnt=0x7f6b677a1434, buf_env=0x7f6b677a2d00, optype=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c) at hj_edu.c:481
    #11 0x00007f6b6735af04 in ncs_edu_prfm_dec_on_non_ptr (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a1980, ptr=0x7f6b60003aa0, ptr_data_len=0x7f6b677a2614,
    buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c) at hj_edu.c:1149
    #12 0x00007f6b6735a04e in ncs_edu_perform_exec_action_on_non_ptr (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a1980, optype=EDP_OP_TYPE_DEC,
    ptr=0x7f6b60003aa0, ptr_data_len=0x7f6b677a2614, buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c) at hj_edu.c:810
    ---Type <return> to continue, or q <return> to quit---
    #13 0x00007f6b67359f97 in ncs_edu_perform_exec_action (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a1980, optype=EDP_OP_TYPE_DEC, ptr=0x7f6b60003aa0,
    ptr_data_len=0x7f6b677a2614, buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c) at hj_edu.c:783
    #14 0x00007f6b67359cf0 in ncs_edu_exec_rule (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a1980, ptr=0x7f6b60003aa0, ptr_data_len=0x7f6b677a2614,
    buf_env=0x7f6b677a2d00, optype=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c) at hj_edu.c:630
    #15 0x00007f6b6735bb3e in ncs_edu_run_rules_for_dec (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, prog=0x7f6b677a1740, ptr=0x7f6b60003aa0, ptr_data_len=0x7f6b677a2614,
    buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c, instr_count=54) at hj_edu.c:1821
    #16 0x00007f6b67359a19 in ncs_edu_run_rules (edu_hdl=0x6381c8, edu_tkn=0x0, prog=0x7f6b677a1740, ptr=0x7f6b60003aa0, ptr_data_len=0x7f6b677a2614, buf_env=0x7f6b677a2d00,
    optype=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c, instr_count=54) at hj_edu.c:536
    #17 0x000000000042296d in cpsv_edp_CPND_EVT_info (edu_hdl=0x6381c8, edu_tkn=0x0, ptr=0x7f6b677a2618, ptr_data_len=0x7f6b677a2614, buf_env=0x7f6b677a2d00, op=EDP_OP_TYPE_DEC,
    o_err=0x7f6b677a2e6c) at cpsv_edu.c:1524
    #18 0x00007f6b67359820 in ncs_edu_run_edp (edu_hdl=0x6381c8, edu_tkn=0x0, rule=0x7f6b677a2a20, edp=0x4227ff <cpsv_edp_CPND_EVT_info>, ptr=0x7f6b677a2618, dcnt=0x7f6b677a2614,
    buf_env=0x7f6b677a2d00, optype=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c) at hj_edu.c:481
    #19 0x00007f6b6735af04 in ncs_edu_prfm_dec_on_non_ptr (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a2a20, ptr=0x7f6b60003a90, ptr_data_len=0x7f6b677a2d18,
    buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c) at hj_edu.c:1149
    #20 0x00007f6b6735a04e in ncs_edu_perform_exec_action_on_non_ptr (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a2a20, optype=EDP_OP_TYPE_DEC,
    ptr=0x7f6b60003a90, ptr_data_len=0x7f6b677a2d18, buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c) at hj_edu.c:810
    #21 0x00007f6b67359f97 in ncs_edu_perform_exec_action (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a2a20, optype=EDP_OP_TYPE_DEC, ptr=0x7f6b60003a90,
    ptr_data_len=0x7f6b677a2d18, buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c) at hj_edu.c:783
    #22 0x00007f6b67359cf0 in ncs_edu_exec_rule (edu_hdl=0x6381c8, edu_tkn=0x0, hdl_node=0x0, rule=0x7f6b677a2a20, ptr=0x7f6b60003a90, ptr_data_len=0x7f6b677a2d18,
    buf_env=0x7f6b677a2d00, optype=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c) at hj_edu.c:630
    #23 0x00007f6b6735bb3e in ncs_edu_run_rules_for_dec (edu_hdl=0x6381c8, edu_tkn=0x7f6b677a2c30, hdl_node=0x0, prog=0x7f6b677a2920, ptr=0x7f6b60003a90,
    ptr_data_len=0x7f6b677a2d18, buf_env=0x7f6b677a2d00, o_err=0x7f6b677a2e6c, instr_count=7) at hj_edu.c:1821
    #24 0x00007f6b67359a19 in ncs_edu_run_rules (edu_hdl=0x6381c8, edu_tkn=0x7f6b677a2c30, prog=0x7f6b677a2920, ptr=0x7f6b60003a90, ptr_data_len=0x7f6b677a2d18,
    buf_env=0x7f6b677a2d00, optype=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c, instr_count=7) at hj_edu.c:536
    #25 0x0000000000423062 in cpsv_edp_CPSV_EVT_info (edu_hdl=0x6381c8, edu_tkn=0x7f6b677a2c30, ptr=0x7f6b677a2f80, ptr_data_len=0x7f6b677a2d18, buf_env=0x7f6b677a2d00,
    op=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c) at cpsv_edu.c:1807
    #26 0x00007f6b67359820 in ncs_edu_run_edp (edu_hdl=0x6381c8, edu_tkn=0x7f6b677a2c30, rule=0x0, edp=0x422ef4 <cpsv_edp_CPSV_EVT_info>, ptr=0x7f6b677a2f80, dcnt=0x7f6b677a2d18,
    buf_env=0x7f6b677a2d00, optype=EDP_OP_TYPE_DEC, o_err=0x7f6b677a2e6c) at hj_edu.c:481
    #27 0x00007f6b6735d52d in ncs_edu_perform_dec_op (edu_hdl=0x6381c8, edp=0x422ef4 <cpsv_edp_CPSV_EVT_info>, buf_env=0x7f6b677a2d00, cnt=0x7f6b677a2d18, arg=0x7f6b677a2f80,
    o_err=0x7f6b677a2e6c, var_cnt=0 '\000', var_array=0x0) at hj_edu.c:2997
    #28 0x00007f6b67358efa in ncs_edu_ver_exec (edu_hdl=0x6381c8, edp=0x422ef4 <cpsv_edp_CPSV_EVT_info>, uba=0x7f6b600033c0, op=EDP_OP_TYPE_DEC, arg=0x7f6b677a2f80,
    o_err=0x7f6b677a2e6c, to_version=4, var_cnt=0 '\000') at hj_edu.c:234
    #29 0x0000000000412a9b in cpnd_mds_dec (cb=0x637f80, dec_info=0x7f6b677a2f70) at cpnd_mds.c:522
    #30 0x0000000000411e60 in cpnd_mds_callback (info=0x7f6b677a2f60) at cpnd_mds.c:232
    #31 0x00007f6b67387760 in mds_mcm_do_decode_full_or_flat (svccb=0x649830, cbinfo=0x7f6b677a2f60, recv_msg=0x7f6b600033b8, orig_msg=0x0) at mds_c_sndrcv.c:4933
    #32 0x00007f6b67386a5d in mds_mcm_process_recv_snd_msg_common (svccb=0x649830, recv=0x7f6b600033b8) at mds_c_sndrcv.c:4280
    #33 0x00007f6b67387015 in mcm_recv_normal_sndrsp (svccb=0x649830, recv=0x7f6b600033b8) at mds_c_sndrcv.c:4510
    #34 0x00007f6b67386613 in mds_mcm_ll_data_rcv (recv=0x7f6b600033b8) at mds_c_sndrcv.c:4106
    #35 0x00007f6b67377801 in mdtm_process_recv_message_common (flag=0 '\000', buffer=0x7f6b677a328a "\251", len=52, transport_adest=72075203100876831, seq_num_check=238,
    buff_dump=0x7f6b677a5228) at mds_dt_common.c:472
    #36 0x00007f6b673783bf in mdtm_process_recv_data (buffer=0x7f6b677a3282 "", len=60, transport_adest=72075203100876831, buff_dump=0x7f6b677a5228) at mds_dt_common.c:879
    #37 0x00007f6b67396cc0 in mdtm_process_recv_events () at mds_dt_tipc.c:706
    ---Type <return> to continue, or q <return> to quit---
    #38 0x00007f6b668e87b6 in start_thread () from /lib64/libpthread.so.0
    #39 0x00007f6b666449cd in clone () from /lib64/libc.so.6
    #40 0x0000000000000000 in ?? ()

    Attached the full bt

     
  • Sirisha Alla

    Sirisha Alla - 2014-04-23
    • Priority: minor --> major
     
  • A V Mahesh (AVM)

    • status: unassigned --> review
    • assigned_to: A V Mahesh (AVM)
    • Milestone: 4.5.FC --> 4.4.1
     
  • A V Mahesh (AVM)

    • status: review --> fixed
     
  • A V Mahesh (AVM)

    changeset: 5480:993cbf72f036
    branch: opensaf-4.4.x
    parent: 5473:0b35a99df663
    user: A V Mahesh mahesh.valla@oracle.com
    date: Thu Jul 24 10:58:01 2014 +0530
    summary: cpa: correct peer msg_fmt_ver in cpa_mds_enc function [#745]

    changeset: 5479:a26c875f712f
    user: A V Mahesh mahesh.valla@oracle.com
    date: Thu Jul 24 09:06:46 2014 +0530
    summary: cpa: correct peer msg_fmt_ver in cpa_mds_enc function [#745]

     

    Related

    Tickets: #745


Log in to post a comment.