Re: [Mondo-devel] Mondoarchive failing w/segfault
Brought to you by:
bcornec
|
From: Rene F. <Ren...@ps...> - 2013-11-25 20:38:01
|
Mondo team:
In case this matters, the 2 systems where mondoarchive is failing with a segfault are clustered nodes. RHCS 5.7. Is Mondo adverse to high availability LVM (2 volume groups exclusively activated on only one node at a time). If you need more information about these nodes, please let me know.
Regards,
Rene
Rene Feitelson
Senior Technical Consultant
Premier Systems, Ltd.
267-218-3505
ren...@ps...<mailto:ren...@ps...>
This is a PRIVATE MESSAGE. If you are not the intended recipient, please delete without copying, and kindly advise me by email at the above address of the mistake in delivery. NOTE: Regardless of content, this email shall not operate to bind me to any order or other contract, unless pursuant to explicit written agreement or government initiative expressly permitting the use of email for such purposes.
_____________________________________________
From: Rene Feitelson
Sent: Monday, November 25, 2013 12:36
To: mon...@li...
Subject: Mondoarchive failing w/segfault
Mondo team:
I have the same Mondo components installed on 3 systems. Same hardware configurations: HP Proliant DL380G7 running RHEL 5.7. (The first system, where mondoarchive works is now patched up to RHEL 5.10.). mondoarchive works on one system, but fails with segfauls on the other 2. Here is what I have:
afio-2.5-1.rhel5
buffer-1.19-4.rhel5
mondo-3.0.4-1.rhel5
mindi-2.1.7-1.rhel5
mindi-busybox-1.18.5-3.rhel5
My mondoarchive scripts runs the following mondoarchive command on 1 of the 2 failing systems. The 2 systems write to 2 directories under the same NFS mount.
mondoarchive -O -E "/qad|/prog|/proddevl|/cqreports|/cqreportsdevl" -p klpjlinux2 -n nfs://kpjfps01.kolmarpj.osg.com:/devl_data_2 -N -G -9 -S /mondo/klpjlinux2/scratch -T /mondo/klpjlinux2/temp -d klpjlinux2 -s 4600m
mondoarchive -O -E "/qad|/prog|/proddevl|/cqreports|/cqreportsdevl" -p klpjlinux1 -n nfs://kpjfps01.kolmarpj.osg.com:/devl_data_2 -N -G -9 -S /mondo/klpjlinux1/scratch -T /mondo/klpjlinux1/temp -d klpjlinux1 -s 4600m
mondoarchive runs, but then hangs (never returns a prompt). The last line displayed is: "Checking sanity of your Linux distribution".
I don't see any errors in /var/log/mondoarchive.log or /var/log/mindi.log. See attached files from the klpjlinux2 system.
Here are the segfault messages from klpjlinux2:/var/log/messages:
[root@klpjlinux2 log]# grep mondo /var/log/messages
Nov 25 11:01:19 klpjlinux2 kernel: mondoarchive[2100]: segfault at 0000000000000000 rip 00000000004375f4 rsp 00007fff8c24d7a0 error 4
Nov 25 11:24:17 klpjlinux2 kernel: mondoarchive[24027]: segfault at 0000000000000000 rip 00000000004375f4 rsp
00007fff8073ab90 error 4
Nov 25 11:42:12 klpjlinux2 kernel: mondoarchive[3863]: segfault at 0000000000000000 rip 00000000004375f4 rsp 0
0007ffffa3f25a0 error 4
Here is the segfault message from the klpjlinux1 system:
[root@klpjlinux1 admin]# grep mondo /var/log/messages
Nov 25 12:15:32 klpjlinux1 kernel: mondoarchive[9155]: segfault at 0000000000000000 rip 00000000004375f4 rsp 00007fffc5a35240 error 4
Please advise.
<< File: mindi.log >> << File: mondoarchive.log >>
Regards,
Rene
Rene Feitelson
Senior Technical Consultant
Premier Systems, Ltd.
267-218-3505
ren...@ps...<mailto:ren...@ps...>
This is a PRIVATE MESSAGE. If you are not the intended recipient, please delete without copying, and kindly advise me by email at the above address of the mistake in delivery. NOTE: Regardless of content, this email shall not operate to bind me to any order or other contract, unless pursuant to explicit written agreement or government initiative expressly permitting the use of email for such purposes.
|