When Standby IMMD is up at the same time with a IMMND exiting, the info of that IMMND might not be removed from immnd_tree of the Standby IMMD.
Details of the problem is explained in the sequence diagram below
sequence diagram
SC-5 was Active, SC-2 was Standby, IMMND on SC-1 was exiting
18:35:03 SC-1 osafimmnd[441]: exiting for shutdown
18:35:03 SC-2 osafrded[413]: NO RDE role set to STANDBY
18:35:03 SC-2 osafimmd[430]: NO MDS event from svc_id 25 (change:3, dest:568511936070075)
18:35:03 SC-2 osafimmd[430]: NO MDS event from svc_id 25 (change:3, dest:567412424442298)
18:35:03 SC-2 osafimmd[430]: NO MDS event from svc_id 25 (change:3, dest:566312912814523)
18:35:03 SC-2 osafimmd[430]: NO MDS event from svc_id 25 (change:3, dest:565213401186744)
18:35:03 SC-5 osafimmd[433]: NO MDS event from svc_id 25 (change:4, dest:564113889558969)
Down event for IMMND@SC-1 was received on SC-5 but not on SC-2.
The symptoms:
18:35:11 SC-2 osafimmd[430]: WA IMMND coordinator at 2050f apparently crashed => electing new coord
No more logs about newly elected coordinator were printed out.
18:35:29 SC-1 osafimmnd[441]: NO SERVER STATE: IMM_SERVER_ANONYMOUS --> IMM_SERVER_CLUSTER_WAITING
18:35:29 SC-1 osafimmnd[441]: NO This IMMND is now the NEW Coord
18:35:29 SC-1 osafimmnd[441]: ER 3 > 0, exiting
I the defect only occurs in a headless system, then I think the ticket slogan, or at least the description sholud say so.
5.17.08 (develop) [code:85c90b]
5.17.06 (release) [code:c1a37f]
default (mercurial) [staging:dc6067]
Related
Commit: [dc6067]
Tickets:
#2418Commit: [85c90b]
Commit: [c1a37f]
Re-open this ticket since the new active IMMD (switches from STANDBY role) has problem with dead IMMND in the immnd_tree. The dead IMMND should be cleanup before switching to ACTIVE.
5.17.08 (develop) [code:ff044b]
5.17.06 (release) [code:b6d724]
Related
Commit: [b6d724]
Commit: [ff044b]