Using rhels6 SN install a rhels6 CN :
error when run postscript and got message in /vat/log/message on MN:
*start*
Dec 20 21:50:39 xcatn06 xCAT: xcatd: possible BUG encountered by xCAT DB worker Describe failed during DBI::st=HASH(0x1000864a648)->FETCH(NAME) at /opt/xcat/lib/perl/xCAT/Table.pm line 2682, <gen154> line 8.
Dec 20 21:50:39 xcatn06 xCAT: xcatd: getpostscript plugin bug, pid 9533, process description: 'xCATd SSL: getpostscript for xcatn05: getpostscript instance' with error 'Describe failed during DBI::st=HASH(0x1000864a648)->FETCH(NAME) at /opt/xcat/lib/perl/xCAT/Table.pm line 2682, <gen154> line 8.#012'
Dec 20 20:53:16 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:19 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:22 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:25 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:27 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:30 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 21:50:57 xcatn06 xCAT: xCAT: Allowing getpostscript from xcatn05
Dec 20 21:50:57 xcatn06 xCAT: xcatd: possible BUG encountered by xCAT DB worker Can't call method "execute" on an undefined value at /opt/xcat/lib/perl/xCAT/Table.pm line 2681, <gen199> line 8.
Dec 20 21:50:57 xcatn06 xCAT: xcatd: getpostscript plugin bug, pid 9547, process description: 'xCATd SSL: getpostscript for xcatn05: getpostscript instance' with error 'Can't call method "execute" on an undefined value at /opt/xcat/lib/perl/xCAT/Table.pm line 2681, <gen199> line 8.#012'
Dec 20 20:53:33 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:36 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:39 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:42 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:45 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:47 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 21:51:14 xcatn06 xCAT: xCAT: Allowing getpostscript from xcatn05
Dec 20 21:51:15 xcatn06 xCAT: xcatd: possible BUG encountered by xCAT DB worker Can't call method "execute" on an undefined value at /opt/xcat/lib/perl/xCAT/Table.pm line 2681, <gen244> line 8.
Dec 20 21:51:15 xcatn06 xCAT: xcatd: getpostscript plugin bug, pid 9562, process description: 'xCATd SSL: getpostscript for xcatn05: getpostscript instance' with error 'Can't call method "execute" on an undefined value at /opt/xcat/lib/perl/xCAT/Table.pm line 2681, <gen244> line 8.#012'
Dec 20 20:53:53 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:53:56 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:54:00 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:54:03 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:54:06 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 21:51:31 xcatn06 xCAT: xCAT: Allowing getpostscript from xcatn05
Dec 20 21:51:31 xcatn06 xCAT: xcatd: possible BUG encountered by xCAT DB worker Can't call method "execute" on an undefined value at /opt/xcat/lib/perl/xCAT/Table.pm line 2681, <gen289> line 8.
Dec 20 21:51:31 xcatn06 xCAT: xcatd: getpostscript plugin bug, pid 9565, process description: 'xCATd SSL: getpostscript for xcatn05: getpostscript instance' with error 'Can't call method "execute" on an undefined value at /opt/xcat/lib/perl/xCAT/Table.pm line 2681, <gen289> line 8.#012'
Dec 20 20:54:08 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:54:11 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:54:14 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:54:17 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 21:51:44 xcatn06 xCAT: xCAT: Allowing getpostscript from xcatn05
Dec 20 21:51:44 xcatn06 xCAT: xcatd: possible BUG encountered by xCAT DB worker Can't call method "execute" on an undefined value at /opt/xcat/lib/perl/xCAT/Table.pm line 2681, <gen334> line 8.
Dec 20 21:51:44 xcatn06 xCAT: xcatd: getpostscript plugin bug, pid 9580, process description: 'xCATd SSL: getpostscript for xcatn05: getpostscript instance' with error 'Can't call method "execute" on an undefined value at /opt/xcat/lib/perl/xCAT/Table.pm line 2681, <gen334> line 8.#012'
Dec 20 20:54:20 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:54:23 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:54:26 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:54:28 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:54:31 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0</gen334></gen334></gen289></gen289></gen244></gen244></gen199></gen199></gen154></gen154>
Dec 20 20:56:16 hv32s5fp16 dhcpd: DHCPOFFER on 192.168.5.67 to 0a:91:a7:ca:af:06 via eth1
Dec 20 20:56:16 hv32s5fp16 dhcpd: Unable to add forward map from xcatn05.ppd.pok.ibm.com to 192.168.5.67: bad DNS key
Dec 20 20:56:16 hv32s5fp16 dhcpd: DHCPREQUEST for 192.168.5.67 (192.168.5.37) from 0a:91:a7:ca:af:06 via eth1
Dec 20 20:56:16 hv32s5fp16 dhcpd: DHCPACK on 192.168.5.67 to 0a:91:a7:ca:af:06 via eth1
Dec 20 20:56:17 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
Dec 20 20:56:20 hv32s5fp16 dhcpd: DHCPDISCOVER from 00:14:5e:b5:57:8d via eth0: network eth0: no free leases
**end*
error when run postbootscript :
begin*
Starting sshd: [ OK ]
Starting postfix: [ OK ]
Starting abrt daemon: [ OK ]
Starting xcatpostinit1++ uname
Red Hat Enterprise Linux Server release 6.0 (Santiago)
Kernel 2.6.32-71.el6.ppc64 on an ppc64
*end*
minjun found that if use xcatd -f this error will not appear and installation will be successfully.
and he will probe this problem.
This problem found on AIx too.
using aix71B MN and aix71B SN install a aix71B CN:
get postscript error on CN when the bos installation finished:
begin
AIX Version 7
Copyright IBM Corporation, 1982, 2010.
Console login: 0513-059 The ctrmc Subsystem has been started. Subsystem PID is 4915364.
Thu Dec 23 02:12:36 CST 2010 xcataixpost: run: cat /xcatpost/myxcatpost_hv32s38p21 | grep 'INSTALLDIR=' 2>&1 -
Use of uninitialized value $installdir in scalar chomp at /xcatpost/xcataixpost line 169.
Use of uninitialized value $installdir in substitution (s///) at /xcatpost/xcataixpost line 170.
Thu Dec 23 02:12:36 CST 2010 xcataixpost: run: cat /xcatpost/myxcatpost_hv32s38p21 | grep 'TIMEZONE=' 2>&1 -
end
On SN got the message:
begin
bash-3.2# xCAT: Allowing getpostscript from hv32s38p21
DBD::DB2::db prepare failed: [IBM][CLI Driver] SQL30081N A communication error has been detected. Communication protocol being used: "TCP/IP". Communication API being used: "SOCKETS". Location where the error was detected: "192.168.5.112". Communication function detecting the error: "recv". Protocol specific error code(s): "", "", "0". SQLSTATE=08001
DBD::DB2::st execute failed: [IBM][CLI Driver] CLI0108E Communication link failure. SQLSTATE=40003 at /opt/xcat/lib/perl/xCAT/Table.pm line 2681, <gen478> line 8.
xcatd: possible BUG encountered by xCAT DB worker Describe failed during DBI::st=HASH(0x23210de8)->FETCH(NAME) at /opt/xcat/lib/perl/xCAT/Table.pm line 2682, <gen478> line 8.</gen478></gen478>
xcatd: getpostscript plugin bug, pid 6422630, process description: 'xCATd SSL: getpostscript for hv32s38p21: getpostscript instance' with error 'Describe failed during DBI::st=HASH(0x23210de8)->FETCH(NAME) at /opt/xcat/lib/perl/xCAT/Table.pm line 2682, <gen478> line 8.
end
and the CN is on booting status at last.</gen478>
Lissa,
I spent a lot of time on debugging on the RHELs6 environment. Here is what I found:
If the "xcatd" daemon on the service node is running foreground, "getpostscript.awk" will be OK without any errors. But if the xCAT service is running background, "getpostscript.awk" will be failed like Fang Zheng described here.
I can recreate this issue without re-installing the compute node. Here is what I have done:
1) login the compute node;
2) running the command:
USEOPENSSLFORXCAT=1 XCATSERVER=192.168.5.78:3001 /xcatpost/getpostscript.awk
3) check the command's output; you will see the error message.
besides, I also added several debug spots in xCAT code, the error is triggered in the line 318 of perl/xCAT/Postage.pm, where "xCAT::Utils->isSN($node)" is called.
then, I tracked into the "isSN()" subroutine, the error is triggered in line 3861 of perl/xCAT/Utils.pm, where "$servicenodetab->getAllNodeAttribs(['tftpserver']);" is called. Following this thread, I guess it is related with the code to handle the database.
I am curious about it because when "xcatd -f" is running, there's no issue for any xCAT commands.
Thank you!
Is this still a problem because xcatn05 seems to be installed.
yes, this problem still exist on xcatn05 which is a cn.
after the installation :
Starting xcatpostinit1/tmp/mypostscript.post: line 1: quot: command not found
/tmp/mypostscript.post: line 1: getpostscript: command not found
/tmp/mypostscript.post: line 1: quot: command not found
/tmp/mypostscript.post: line 1: execute: command not found
/tmp/mypostscript.post: line 1: lt: command not found
/tmp/mypostscript.post: line 1: on: command not found
/tmp/mypostscript.post: line 1: GEN822: command not found
/tmp/mypostscript.post: line 1: gt: command not found
/tmp/mypostscript.post: line 1: line: command not found
/tmp/mypostscript.post: line 2: unexpected EOF while looking for matching `''
/tmp/mypostscript.post: line 5: syntax error: unexpected end of file
you can run
nodeset xcatn05 install and rnetboot xcatn05 on hv32s5fp16(9...68) to make this problem.
On aix the problem is still exist too.
run nimnodeset -i 1042A71BCNimage hv32s38p21 and rnetboot hv32s38p21 on Mn(hv32s38p01 9...85). you can try to install the cn and see the result, but please don't do any modify for this is my test environment.
Lissa, Fang Zheng,
Project mgmt is now tracking sev 1 (priority 9 & 8) and sev 2 (priority 7) bugs. Lissa, do you know what is causing this problem? Fang Zheng, is this still an important enough bug for it to be priority 7? Is it blocking your testing?
I believe this is the same as defect 3167038 , which we know the cause as of yesterday, but I am waiting for a response from Jarrod. Since the change was to fix a memory leak/performance proble, I don't just want t change it.
This is the same machine Linda and I were debuggin on so I am pretty confident , this will fix the problem.
To prove we can fix the code temporarily in NodeRange.pm
Just remove the unless around the new function this lie
Try to match groups?
I believe if you get the latest snapshot of 2.6, you will not hit this problem.
This Tracker item was closed automatically by the system. It was
previously set to a Pending status, and the original submitter
did not respond within 28 days (the time period specified by
the administrator of this Tracker).