OpenSAF / Tickets / #426 AMF: IMM returns ERR_TRY_AGAIN for saImmOiRtObjectUpdate() in an IMM initiated callback

Nagendra Kumar - 2013-09-06

status: unassigned --> assigned

assigned_to: Nagendra Kumar

Milestone: future --> 4.4.FC
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nagendra Kumar - 2013-12-02

assigned_to: Nagendra Kumar --> nobody

Milestone: 4.4.FC --> 4.2.5
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Anders Bjornerstedt - 2014-02-17

summary: IMM returns ERR_TRY_AGAIN for saImmOiRtObjectUpdate() in an IMM initiated callback --> AMF: IMM returns ERR_TRY_AGAIN for saImmOiRtObjectUpdate() in an IMM initiated callback
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Anders Bjornerstedt - 2014-02-17

This is a very strange and complicated case and the information (syslogs)
provided is incomplete.

I suggest that the AMF gets out of this situation by abandoning the
attempt to update the pure RTA. Instead it would just return
ERR_NO_RESOURCES on the rtUpdate callback.

The only function compromised by that would be some read performed by
some om-client. They would get an error on that read.
If (as I suspect) the root cause is an ongoing fail-over or switch-over
and the unavailability of the IMMD (no functioning active). Then the
case should be rare. A user getting an error on a search request during
a failover/switchover should not result in anything catastrophic.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Anders Bjornerstedt - 2014-02-17

status: assigned --> unassigned
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Praveen - 2014-02-17

Milestone: 4.2.5 --> 4.3.3
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nagendra Kumar - 2014-06-16

status: unassigned --> assigned

assigned_to: Nagendra Kumar
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nagendra Kumar - 2014-07-31

I could reproduced it by keeping some sleep in compcstype_ccb_completed_cb after CCBUTIL_DELETE:
case CCBUTIL_DELETE:
TRACE_ENTER2("Before nag CCB ID ");
sleep(4);
The following command were run:
immcfg -f /tmp/AppConfig-2N.xml
From one terminal: immcfg -d safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1;
From another terminal: immlist safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1;

Jul 31 12:10:10 PM_SC-180 osafamfd[31053]: WA saImmOiRtObjectUpdate of 'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' saAmfSURestartCount failed with 6
Jul 31 12:10:10 PM_SC-180 osafimmnd[30996]: NO Ccb 4 COMMITTED (immcfg_PM_SC-180_31433)
Jul 31 12:10:10 PM_SC-180 osafamfd[31053]: WA saImmOiRtObjectUpdate of 'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' saAmfSUNumCurrStandbySIs failed with 12
Jul 31 12:10:10 PM_SC-180 osafamfd[31053]: WA saImmOiRtObjectUpdate of 'safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' saAmfSUNumCurrActiveSIs failed with 12
Jul 31 12:10:10 PM_SC-180 osafimmnd[30996]: ER Internal IMM server problem - failure from internal searchInit: 12

Also:
immcfg -d "safSupportedCsType=safVersion=1\,safC SType=AmfDemo1,safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1"
immlist "safSupportedCsType=safVersion=1\,safCSType=AmfDemo1,safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1"

Jul 31 11:48:04 PM_SC-180 osafimmnd[25894]: NO Ccb 2 COMMITTED (immcfg_PM_SC-180_26142)
Jul 31 11:48:09 PM_SC-180 osafamfd[25952]: WA saImmOiRtObjectUpdate of 'safSupportedCsType=safVersion=1\,safCSType=AmfDemo1,safComp=AmfDemo,safSu=SU1,safSg=AmfDemo,safApp=AmfDemo1' saAmfCompNumCurrActiveCSIs failed with 6
Jul 31 11:48:09 PM_SC-180 osafimmnd[25894]: NO Ccb 3 COMMITTED (immcfg_PM_SC-180_26146)
Jul 31 11:48:09 PM_SC-180 osafimmnd[25894]: ER Internal IMM server problem - failure from internal searchInit: 12

So, the fix as suggested by Anders Bj could be returning SA_AIS_ERR_FAILED_OPERATION to compcstype_rt_attr_callback when avd_saImmOiRtObjectUpdate_sync fails. Any way, the object is being deleted, so attribute update may not be visible to anybody.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nagendra Kumar - 2014-07-31

status: assigned --> review
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nagendra Kumar - 2014-08-12

changeset: 5562:a4124a505df9
tag: tip
user: Nagendra Kumarnagendra.k@oracle.com
date: Tue Aug 12 20:43:13 2014 +0530
summary: amfd: return FAILED_OP to RtAttrUpdateCallbackT if RtObjectUpdate_2 fails [#426]

[staging:a4124a]

Related

Tickets: ~~#426~~
Commit: [a4124a]

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nagendra Kumar - 2014-08-21

status: review --> fixed
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nagendra Kumar - 2014-08-21

changeset: 5628:6d4c5e78b31f
branch: opensaf-4.3.x
user: Nagendra Kumarnagendra.k@oracle.com
date: Thu Aug 21 11:42:11 2014 +0530
summary: amfd: return FAILED_OP to RtAttrUpdateCallbackT if RtObjectUpdate_2 fails [#426]

changeset: 5629:caabfba704e8
branch: opensaf-4.4.x
tag: tip
parent: 5626:ae283908b0d5
user: Nagendra Kumarnagendra.k@oracle.com
date: Thu Aug 21 11:42:39 2014 +0530
summary: amfd: return FAILED_OP to RtAttrUpdateCallbackT if RtObjectUpdate_2 fails [#426]

[staging:6d4c5e]
[staging:caabfb]

Related

Tickets: ~~#426~~
Commit: [6d4c5e]
Commit: [caabfb]

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

AMF: IMM returns ERR_TRY_AGAIN for saImmOiRtObjectUpdate() in an IMM...

Milestone

Searches

Help

#426 AMF: IMM returns ERR_TRY_AGAIN for saImmOiRtObjectUpdate() in an IMM initiated callback

/var/log/messages show :

Related

Discussion

Related

Related