Menu

#1788 cpsv: saCkptCheckpointWrite() returns SA_AIS_ERR_NOT_EXIST after headless state

5.0.1
fixed
None
defect
ckpt
-
5.0
major
2016-09-09
2016-04-28
No

The problem happened in the following scenario:

  1. Application calls saCkptCheckpointOpen() to create a collocated checkpoint on SC-2. Replica of the checkpoint on SC-2 is active
  2. Application calls saCkptCheckpointOpen() to open a collocated checkpoint on PL-5.
  3. Application creates section and accesses the checkpoint on PL-5.
  4. Both SCs are down.
  5. Both SCs are up again.
  6. Application accesses the checkpoint with saCkptCheckpointWrite(). The fault code SA_AIS_ERR_NOT_EXIST is return.

This problem happened because the osafckptnd process ID on SC-2 before headless and after headless are same. This leads their MDS destination are same. Thus when the SC-2 is up and in short time when CPD hadn't been assigned a new active replica, the application send checkpoint access request to CPND on SC-2 which no longer hosts the active replica. Then it returns SA_AIS_ERR_NOT_EXIST.

Related

Tickets: #1788
Wiki: ChangeLog-5.0.1

Discussion

  • Pham Hoang Nhat

    Pham Hoang Nhat - 2016-04-28
    • status: assigned --> accepted
     
  • Mathi Naickan

    Mathi Naickan - 2016-04-28
    • Milestone: 5.0.RC2 --> 5.0.GA
     
  • Pham Hoang Nhat

    Pham Hoang Nhat - 2016-05-04
    • Milestone: 5.0.GA --> 5.1.FC
     
  • Mathi Naickan

    Mathi Naickan - 2016-05-04
    • Milestone: 5.1.FC --> 5.0.1
     
  • Pham Hoang Nhat

    Pham Hoang Nhat - 2016-05-13
    • status: accepted --> review
     
  • A V Mahesh (AVM)

    • status: review --> fixed
     
  • A V Mahesh (AVM)

     
  • A V Mahesh (AVM)

    changeset: 8036:62fe50517eaf
    user: Hoang Vo hoang.m.vo@dektech.com.au
    date: Fri Sep 09 15:00:35 2016 +0530
    summary: cpd: To reduce updating time out [#1788]

    changeset: 8037:435ec42e4847
    branch: opensaf-5.1.x
    parent: 8034:4d1aef39599a
    user: Hoang Vo hoang.m.vo@dektech.com.au
    date: Fri Sep 09 15:01:23 2016 +0530
    summary: cpd: To reduce updating time out [#1788]

    changeset: 8038:bc1bac48177a
    branch: opensaf-5.0.x
    tag: tip
    parent: 8027:bc9afc86a424
    user: Hoang Vo hoang.m.vo@dektech.com.au
    date: Fri Sep 09 15:02:40 2016 +0530
    summary: cpd: To reduce updating time out [#1788]

     

    Related

    Tickets: #1788


Log in to post a comment.