Menu

#690 Opensaf start failed when RDE could not RESPAWN.

4.2.5
fixed
None
defect
base
-
4.4
minor
2014-02-17
2013-12-24
manu
No

Changeset:- 4733
Opensaf is up and running.

1)Set the SCED_BATCH for thread priority in nid.conf

export OSAF_MDS_SCHED_PRIORITY=5
export OSAF_MDS_SCHED_POLICY=3

2) Restart the opensaf on the node. And scheduling policy (for MDS) sets to its default value (Priority 85 and Policy 2)

Dec 24 14:37:37 SLES_NEW2 osafckptnd[14637]: scheduling priority 85 for given policy 2 to the task OSAF_TMR
Dec 24 14:37:37 SLES_NEW2 osafckptnd[14637]: scheduling priority 5 for given policy 3 to the task OSAF_MDS is not within the range, setting to default values
Dec 24 14:37:37 SLES_NEW2 osafckptnd[14637]: scheduling priority 85 for given policy 2 to the task OSAF_MDS
Dec 24 14:37:37 SLES_NEW2 osafamfwd[14652]: Started

3) Again Reset SCED_BATCH for thread priority in nid.conf

export OSAF_MDS_SCHED_PRIORITY=0
export OSAF_MDS_SCHED_POLICY=3

4) Restart the opensaf on the node. Now opensaf is not coming up. And default value is also not reflecting (for MDS).

Dec 24 14:40:02 SLES_NEW2 opensafd[15010]: ER Timed-out for response from RDE
Dec 24 14:40:02 SLES_NEW2 opensafd[15010]: ER
Dec 24 14:40:02 SLES_NEW2 opensafd[15010]: ER Going for recovery
Dec 24 14:40:02 SLES_NEW2 opensafd[15010]: ER Trying To RESPAWN /usr/lib64/opensaf/clc-cli/osaf-rded attempt #1
Dec 24 14:40:02 SLES_NEW2 opensafd[15010]: ER Sending SIGKILL to RDE, pid=15031
Dec 24 14:40:17 SLES_NEW2 osafrded[15095]: Started
Dec 24 14:40:17 SLES_NEW2 osafrded[15095]: scheduling priority 85 for given policy 2 to the task OSAF_TMR
Dec 24 14:42:17 SLES_NEW2 opensafd[15010]: ER Timed-out for response from RDE
Dec 24 14:42:17 SLES_NEW2 opensafd[15010]: ER Could Not RESPAWN RDE
Dec 24 14:42:17 SLES_NEW2 opensafd[15010]: ER
Dec 24 14:42:17 SLES_NEW2 opensafd[15010]: ER Trying To RESPAWN /usr/lib64/opensaf/clc-cli/osaf-rded attempt #2
Dec 24 14:42:17 SLES_NEW2 opensafd[15010]: ER Sending SIGKILL to RDE, pid=15090
Dec 24 14:42:32 SLES_NEW2 osafrded[15147]: Started
Dec 24 14:42:32 SLES_NEW2 osafrded[15147]: scheduling priority 85 for given policy 2 to the task OSAF_TMR
Dec 24 14:44:32 SLES_NEW2 opensafd[15010]: ER Timed-out for response from RDE
Dec 24 14:44:32 SLES_NEW2 opensafd[15010]: ER Could Not RESPAWN RDE
Dec 24 14:44:32 SLES_NEW2 opensafd[15010]: ER
Dec 24 14:44:32 SLES_NEW2 opensafd[15010]: ER FAILED TO RESPAWN

                                                                 failed

Dec 24 14:44:32 SLES_NEW2 kernel: [ 8481.612777] TIPC: Disabling bearer <eth:eth0>
Dec 24 14:44:32 SLES_NEW2 kernel: [ 8481.612783] TIPC: Left network mode
Dec 24 14:44:32 SLES_NEW2 kernel: [ 8481.612809] NET: Unregistered protocol family 30
Dec 24 14:44:32 SLES_NEW2 kernel: [ 8481.612812] TIPC: Deactivated
Dec 24 14:44:32 SLES_NEW2 opensafd: Starting OpenSAF failed</eth:eth0>

Related

Tickets: #690

Discussion

  • Anders Widell

    Anders Widell - 2014-01-09
    • Milestone: 4.4.FC --> 4.4.RC1
     
  • Anders Widell

    Anders Widell - 2014-01-17
    • status: unassigned --> accepted
    • assigned_to: Anders Widell
    • Component: rde --> base
    • Priority: major --> minor
    • Milestone: 4.4.RC1 --> 4.2.5
     
  • Anders Widell

    Anders Widell - 2014-01-17

    Here you are trying to set the scheduling policy to 3 (SCHED_BATCH), which is not a valid scheduling policy for threads. We can add a check that falls back to the default policy whenever an invalid policy is encountered.

     
  • Anders Widell

    Anders Widell - 2014-01-17
    • status: accepted --> review
     
  • Anders Widell

    Anders Widell - 2014-01-22
    • status: review --> fixed
     

Log in to post a comment.