From: Bangalore R. R. <rak...@rb...> - 2024-07-17 12:18:48
|
Hi I tried stopping also , basically when quorum fails that's is 2 Outta 3 nodes are not reachable then node goes to reboot . If etcdctl command fails when quorum fails .which is causing issue . Regards Rakesh ________________________________ From: Thang Nguyen <tha...@en...> Sent: Wednesday, July 17, 2024 08:29 To: Bangalore Ramesh, Rakesh <rak...@rb...>; ope...@li... <ope...@li...> Subject: [EXTERNAL] RE: Regarding FMS_RELAXED_NODE_PROMOTION in fmd.conf Hi Rakesh, >From my understanding, if export the below in fmd.conf export FMS_RELAXED_NODE_PROMOTION=1 The SC will not be rebooted if it lost connection to etcd consensus service but still see the peer SC. Could you share the fmd.conf and all the syslog ? B.R/ Thang D Nguyen -----Original Message----- From: Bangalore Ramesh, Rakesh <rak...@rb...> Sent: Friday, July 12, 2024 1:49 PM To: ope...@li... Subject: [devel] Regarding FMS_RELAXED_NODE_PROMOTION in fmd.conf CAUTION - EXTERNAL EMAIL Hi All, I am trying to avoid opensaf bringing node down cause of no quorum with etcd cluster . I saw this config in fmd. eventhough i have enabled it , # Default behaviour is not to allow promotion of this node to Active # unless a lock can be obtained, if split brain prevention is enabled. # Uncomment the next line to allow promotion of this node at cluster startup, # if a peer SC can be seen and we have a lower node ID, in the event the # consensus service is not available. # Also if the consensus service is down, but a peer SC can be seen, # then an active SC may remain active. # This mode should not be used together with the roaming SC feature # Default is 0 #export FMS_RELAXED_NODE_PROMOTION=0 opensaf fails with error given below - Jul 11 23:24:15 rakeshgr01 osafamfd[6898]: NO (KeyValue::Execute): Executed '/opt/opensaf/osaf-etcd3.plugin unlock "SC-1"', returning 2 Jul 11 23:24:15 rakeshgr01 osafamfd[6898]: WA Unlock failed (6) Jul 11 23:24:15 rakeshgr01 osafamfd[6898]: ER Failed to demote this node from consensus service Jul 11 23:24:15 rakeshgr01 osafimmnd[5987]: WA Failed to retrieve search continuation, client died ? Jul 11 23:24:15 rakeshgr01 osafimmnd[5987]: NO Implementer disconnected 4 <21, 2010f> (safAmfService) Jul 11 23:24:15 rakeshgr01 osafimmnd[5987]: NO Implementer (applier) connected: 13 (@safAmfService2010f) <21, 2010f> Need help , how to use this config , so even if quorum is failed , I need the instance to not go from reboot . Regards, Rakesh Disclaimer This e-mail together with any attachments may contain information of Ribbon Communications Inc. and its Affiliates that is confidential and/or proprietary for the sole use of the intended recipient. Any review, disclosure, reliance or distribution by others or forwarding without express permission is strictly prohibited. If you are not the intended recipient, please notify the sender immediately and then delete all copies, including any attachments. _______________________________________________ Opensaf-devel mailing list Ope...@li... https://lists.sourceforge.net/lists/listinfo/opensaf-devel<https://lists.sourceforge.net/lists/listinfo/opensaf-devel> The information in this email is confidential and may be legally privileged. It is intended solely for the addressee. Any opinions expressed are mine and do not necessarily represent the opinions of the Company. Emails are susceptible to interference. If you are not the intended recipient, any disclosure, copying, distribution or any action taken or omitted to be taken in reliance on it, is strictly prohibited and may be unlawful. If you have received this message in error, do not open any attachments but please notify the Endava Service Desk on (+44 (0)870 423 0187), and delete this message from your system. The sender accepts no responsibility for information, errors or omissions in this email, or for its use or misuse, or for any act committed or omitted in connection with this communication. If in doubt, please verify the authenticity of the contents with the sender. Please rely on your own virus checkers as no responsibility is taken by the sender for any damage rising out of any bug or virus infection. Endava plc is a company registered in England under company number 5722669 whose registered office is at 125 Old Broad Street, London, EC2N 1AR, United Kingdom. Endava plc is the Endava group holding company and does not provide any services to clients. Each of Endava plc and its subsidiaries is a separate legal entity and has no liability for another such entity's acts or omissions. |