Menu

#2323 imm: CCB operations fail after SC absence (Headless)

5.0.2
fixed
None
defect
imm
nd
major
2017-03-08
2017-02-23
Hung Nguyen
No

Reproduce steps:

1. Start SC-1
2. Commit some CCBs
    # immcfg -c Test test=0
    # immcfg -c Test test=1
    # immcfg -c Test test=2
    # immcfg -c Test test=3
3. Start PL-3
4. Restart SC-1
5. When SC-1 is back, it fails to add operations to CCB
    # immcfg -c Test test=10
    error - saImmOmCcbObjectCreate_2 FAILED with SA_AIS_ERR_FAILED_OPERATION (21)
    OI reports: IMM: Resource abort: CCB is not in an expected state
    error - saImmOmCcbApply FAILED: SA_AIS_ERR_FAILED_OPERATION (21)
    OI reports: IMM: Resource abort: CCB is not in an expected state

cb->mLatestCcbId was not updated on PL-3 when it joined the cluster so it still had value of zero.

When SC-1 was back from headless, IMMND on PL-3 sent re-introduce message to IMMD on SC-1 with cb->mLatestCcbId = 0.

IMMD failed to update cb->ccb_id_count so when new CCB is created, it will start from 0+1 instead of mLatestCcbId + 1.

That results in the conflict with the CCB in sCcbVector and the CCB operation failure.

Attached is logs and traces.

1 Attachments

Related

Tickets: #2323
Wiki: ChangeLog-5.0.2
Wiki: ChangeLog-5.1.1

Discussion

  • Hung Nguyen

    Hung Nguyen - 2017-03-01
    • status: accepted --> review
     
  • Anders Bjornerstedt

    • summary: imm: CCB operations fail after SC absence --> imm: CCB operations fail after SC absence (Headless)
     
  • Anders Bjornerstedt

    Added "headless" clarification because "SC absence" can be missunderstood as just one (out of normally two) SCs being absent.

     

    Last edit: Anders Bjornerstedt 2017-03-01
  • Hung Nguyen

    Hung Nguyen - 2017-03-08
    • status: review --> fixed
     

Log in to post a comment.