Menu

#2150 amfd: standby amfd crash while decoding node info during cold sync period

never
invalid
None
defect
amf
d
major
2016-11-15
2016-10-31
No
  • Description:
    When standby cold sync occurs, there may have a chance that
    node creation information is missed in standby node. Active node sending
    node information to standby node (checkpointing) will lead to a standby amfd crash.
    One way to get over this situation is to create node when the node is null.

  • Reproduction:
    1) Start a cluster (e.g. 5 nodes).
    2) On Standby controller, add a sleep (e.g. 5 seconds) to main.cc:
    3) Reboot standby controller.
    4) Use the script scale_opensaf in python/samples directory to add a node (e.g. PL-6) while standby is rebooting.
    5) Observe a coredump on standby node.

1 Attachments

Discussion

  • Long H Buu Nguyen

    • status: unassigned --> accepted
    • assigned_to: Long HB Nguyen
     
  • Long H Buu Nguyen

    • Description has changed:

    Diff:

    --- old
    +++ new
    @@ -2,7 +2,7 @@
       When standby cold sync occurs, there may have a chance that 
       node creation information is missed in standby node. Active node sending
       node information to standby node (checkpointing) will lead to a standby amfd crash.
    -  One way to get over this situation is to re-read node info when the node info is null.
    +  One way to get over this situation is to create node when the node is null.
    
     - Reproduction:
     1) Start a cluster (e.g. 5 nodes).
    
    • status: accepted --> review
     
  • Long H Buu Nguyen

    • status: review --> invalid
    • Milestone: 5.0.2 --> never
     
  • Long H Buu Nguyen

    With the patch of #2112, this issue does not occur.

     

Log in to post a comment.