From: Hans F. <han...@er...> - 2013-04-05 13:21:03
|
More likely the same dtm bug I just reported last week or so. Sent from my smart phone Neelakanta Reddy <red...@or...> skrev: Hi, 1. The node_name, slot_id and nodes.cfg looks correct. 2. There is an connection problem observed at the given payload: Apr 5 09:27:07 payload3 opensafd: Starting OpenSAF Services Apr 5 09:27:07 payload3 osafdtmd[2857]: Started Apr 5 09:27:07 payload3 osafimmnd[2875]: Started Apr 5 09:27:07 payload3 osafimmnd[2875]: Director Service is up Apr 5 09:27:07 payload3 osafimmnd[2875]: MDTM:socket_recv() = 0, conn lost with dh server, exiting library Apr 5 09:27:08 payload3 osafdtmd: osafdtmd Process down, Rebooting the node The above log says that there is a connection issue. Please share the controller1 messages to know what is the state of the controller1. 3. Check the network connectivity between the nodes. 4. while sharing the logs share the following things: a. syslog of all the nodes you have started b. Enable dtmd traces and share them, which are available at /var/log/opensaf/ dtmd traces can be enabled in /etc/opensaf/ # Uncomment the next line to enable trace args="--tracemask=0xffffffff" Neel. On Friday 05 April 2013 06:04 PM, Nivrutti Kale wrote: Hi All, Continuing the attachments. Thanks & Regards On Fri, Apr 5, 2013 at 5:26 PM, Neelakanta Reddy <red...@or...<mailto:red...@or...>> wrote: Hi, The node names in /etc/opensaf/node_name and the imm.xml are not matching. Hans, you are correct. I over looked into the logs. Please correct the node_name and slot_id according to the imm.xml (if you are using the imm.xml generated by opensaf). EG: for controller2: /etc/opensaf/node_name SC-2 /etc/opensaf/slot_id 2 paylod1: node_name PL-3 slot_id 3 paylod2 node_name PL-4 slot_id 4. - - - Neel. On Friday 05 April 2013 05:19 PM, Neelakanta Reddy wrote: > On Friday 05 April 2013 04:32 PM, Hans Feldt wrote: >> On 04/05/2013 01:09 PM, Neelakanta Reddy wrote: >>> Hi, >>> >>> Depending on the requirement, which ever controller want to start first >>> must have either imm.xml or imm.db. The other controllers or payloads >>> will sync accordingly. >>> >>> In this case controller2 is failing because there is no imm.xml in >>> /etc/opensaf directory. >> No SC2 connects with SC1 and takes the standby role. IMM on SC2 is not >> loading from file. But of course IMM on both controllers needs to have >> access to either the same imm.xml via a shared file system or to local >> identical copies. >> /Hans >> > In this case, the user want to bring SC2 first, then SC2 will become > active. In this case imm.xml must be present at SC-2. > > Neel. >>> Neel. >>> >>> On Friday 05 April 2013 04:15 PM, Nivrutti Kale wrote: >>>> Hi All, >>>> >>>> I have configured the openSAF to run on 2 controllers and 6 payloads >>>> nodes. >>>> I am using virtual machine on ESXi. >>>> OpenSAF release is 4.2.2. >>>> >>>> I am able to start the openSAF on SC-1, but not on the other nodes. >>>> Even if controller 1 is down, I am not able to start controller2(SC-2). >>>> I am attaching logs where I tried to start controller2. >>>> >>>> If anyone has faced similar issues let me know. >>>> >>>> Thanks & Regards >>>> Nivrutti >>>> >>>> >>>> ------------------------------------------------------------------------------ >>>> >>>> Minimize network downtime and maximize team effectiveness. >>>> Reduce network management and security costs.Learn how to hire >>>> the most talented Cisco Certified professionals. Visit the >>>> Employer Resources Portal >>>> http://www.cisco.com/web/learning/employer_resources/index.html >>>> >>>> >>>> _______________________________________________ >>>> Opensaf-users mailing list >>>> Ope...@li...<mailto:Ope...@li...> >>>> https://lists.sourceforge.net/lists/listinfo/opensaf-users >>> >>> >>> ------------------------------------------------------------------------------ >>> >>> Minimize network downtime and maximize team effectiveness. >>> Reduce network management and security costs.Learn how to hire >>> the most talented Cisco Certified professionals. Visit the >>> Employer Resources Portal >>> http://www.cisco.com/web/learning/employer_resources/index.html >>> >>> >>> >>> _______________________________________________ >>> Opensaf-users mailing list >>> Ope...@li...<mailto:Ope...@li...> >>> https://lists.sourceforge.net/lists/listinfo/opensaf-users >>> > > ------------------------------------------------------------------------------ > Minimize network downtime and maximize team effectiveness. > Reduce network management and security costs.Learn how to hire > the most talented Cisco Certified professionals. Visit the > Employer Resources Portal > http://www.cisco.com/web/learning/employer_resources/index.html > _______________________________________________ > Opensaf-users mailing list > Ope...@li...<mailto:Ope...@li...> > https://lists.sourceforge.net/lists/listinfo/opensaf-users ------------------------------------------------------------------------------ Minimize network downtime and maximize team effectiveness. Reduce network management and security costs.Learn how to hire the most talented Cisco Certified professionals. Visit the Employer Resources Portal http://www.cisco.com/web/learning/employer_resources/index.html _______________________________________________ Opensaf-users mailing list Ope...@li...<mailto:Ope...@li...> https://lists.sourceforge.net/lists/listinfo/opensaf-users |