When provision rhels7 cn. The postbootscript ospkgs will fail case cn status is failed.
The error message of ospkgs in /var/log/xcat/xcat.log:
Tue Aug 12 05:02:41 EDT 2014 Running postscript: ospkgs
There are no enabled repos.
Run "yum repolist all" to see the repos you have.
You can enable repos with yum-config-manager --enable <repo>
ospkgs: yum -y upgrade
Loaded plugins: product-id, subscription-manager
This system is not registered to Red Hat Subscription Management. You can use subscription-manager to register.
http://20.10.10.100/install/rhels7.0/ppc64/repodata/repomd.xml: [Errno 14] curl#7 - "Failed to connect to 20.10.10.100: Network is unreachable"
Trying other mirror.
http://20.10.10.100/install/rhels7.0/ppc64/repodata/repomd.xml: [Errno 14] curl#7 - "Failed to connect to 20.10.10.100: Network is unreachable"
Trying other mirror.
......</repo>
I add 'ifconfig' command at begin of ospkgs script and find no ip available when ospkgs executed.
I think maybe it's because the network is ready too late after cn reboot.
Some useful message(please pay attention to the time ):
1.[root@c931f07p02 ~]# systemctl status network.service -l
network.service - LSB: Bring up/down networking
Loaded: loaded (/etc/rc.d/init.d/network)
Active: failed (Result: exit-code) since Tue 2014-08-12 05:02:33 EDT; 33min ago
Aug 12 05:02:33 c931f07p02.private.dns.zone network[3043]: Bringing up loopback interface: Could not load file '/etc/sysconfig/network-scripts/ifcfg-lo'
Aug 12 05:02:33 c931f07p02.private.dns.zone network[3043]: Could not load file '/etc/sysconfig/network-scripts/ifcfg-lo'
Aug 12 05:02:33 c931f07p02.private.dns.zone network[3043]: Could not load file '/etc/sysconfig/network-scripts/ifcfg-lo'
Aug 12 05:02:33 c931f07p02.private.dns.zone network[3043]: Could not load file '/etc/sysconfig/network-scripts/ifcfg-lo'
Aug 12 05:02:33 c931f07p02.private.dns.zone network[3043]: [ OK ]
Aug 12 05:02:33 c931f07p02.private.dns.zone network[3043]: Bringing up interface eth0: Error: Connection activation failed: Connection 'System eth0' is not available on the device eth0 at this time.
Aug 12 05:02:33 c931f07p02.private.dns.zone network[3043]: [FAILED]
Aug 12 05:02:33 c931f07p02.private.dns.zone systemd[1]: network.service: control process exited, code=exited status=1
Aug 12 05:02:33 c931f07p02.private.dns.zone systemd[1]: Failed to start LSB: Bring up/down networking.
Aug 12 05:02:33 c931f07p02.private.dns.zone systemd[1]: Unit network.service entered failed state.
[root@c931f07p02 ~]#
2.MN log /var/log/message:
Aug 12 05:02:41 c931f07p01 xCAT[57446]: xCAT: Allowing rpower to test state for root from localhost
Aug 12 05:02:43 c931f07p01 dhcpd: DHCPDISCOVER from 6c:ae:8b:68:61:64 via enP1p128s0f1
Aug 12 05:02:43 c931f07p01 dhcpd: DHCPOFFER on 20.10.10.3 to 6c:ae:8b:68:61:64 via enP1p128s0f1
Aug 12 05:02:43 c931f07p01 dhcpd: DHCPREQUEST for 20.10.10.3 (20.10.10.100) from 6c:ae:8b:68:61:64 via enP1p128s0f1
Aug 12 05:02:43 c931f07p01 dhcpd: DHCPACK on 20.10.10.3 to 6c:ae:8b:68:61:64 via enP1p128s0f1
Aug 12 05:02:58 c931f07p01 xCAT[57473]: xCAT: Allowing getAllEntries for root from localhost
Aug 12 05:02:58 c931f07p01 xCAT[57475]: xCAT: Allowing getAllEntries for root from localhost
Aug 12 05:02:58 c931f07p01 rpc.mountd[41077]: authenticated mount request from 20.10.10.3:749 for /install/shared (/install/shared)
Aug 12 05:02:58 c931f07p01 named[40887]: sockmgr 0x3fffafb79010: maximum number of FD events (64) received
Aug 12 05:02:58 c931f07p01 named[40887]: sockmgr 0x3fffafb79010: maximum number of FD events (64) received
Aug 12 05:02:58 c931f07p01 xCAT[37925]: Get request for change status for nodes: $VAR1 = 'c931f07p02';
to status: failed.
Aug 12 05:02:58 c931f07p01 xCAT[37925]: PCM-xCAT plugin get current provisioning status for nodes: $VAR1 = 'c931f07p02';
have status: $VAR1 = {
'c931f07p02' => 'provisioning'
};
.
Aug 12 05:02:58 c931f07p01 xCAT[37925]: Node c931f07p02 's old provisioning status provisioning, update it to to failed
Aug 12 05:02:58 c931f07p01 xCAT[37925]: Calling setAppStatus to set nodes: $VAR1 = 'c931f07p02';
provisioning status to failed
fixed in 2.8.5:
commit 6daba3107cdbb2c2f15907e799e887d2cbe6f54c
Author: immarvin yangsbj@cn.ibm.com
Date: Mon Aug 25 21:28:21 2014 -0700
diff --git a/xCAT/postscripts/xcatinstallpost b/xCAT/postscripts/xcatinstallpost
index c0ac65f..03fd10b 100755
--- a/xCAT/postscripts/xcatinstallpost
+++ b/xCAT/postscripts/xcatinstallpost
@@ -12,6 +12,24 @@ fi
SLI=$(awk 'BEGIN{srand(); printf("%d\n",rand()*10)}')
sleep $SLI
+
+
+#the nics have not been configured when running the PBS sometimes, need to make sure...
+RETRY=0
+until ip -4 --oneline addr show|grep -v 127.0.0.1 >/dev/null
+do
+ sleep 2
+ RETRY=$[ $RETRY + 1 ]
+ if [ $RETRY -eq 15 ];then
+ logger -t xcat -p local4.err "Network not configured, please check..."
+ echo
date"xcatinstallpost: Network not configured, please check..." >> /var/log/xcat/xcat.log+ exit 1
+ fi
+
+done
+
+
+
cd /xcatpost;
PATH=/xcatpost:$PATH
export PATH
fixed in 2.9:
commit 9b5f4e2d05943a8f583318345993172e1a2fb9df
Author: immarvin yangsbj@cn.ibm.com
Date: Mon Aug 25 21:28:21 2014 -0700