Share

kernel virtual machine

Tracker: Bugs

7 Restarted Windows 2003 Server guests have disk corruption - ID: 2001452
Last Update: Comment added ( gerdwachs )

I have a number of Windows 2003 32Bit guests.

I use them to perform installation and configuration
tests of a large software product.

During these tests, the guests are restarted.

Randomly, the guests produce disk corruption messages
after a restart.

The following are two examples :

-------------------------------------------------------
Windows – Registry Hive Recovered

Registry hive (file): SOFTWARE was corrupted and it has
been recovered. Some data might have been lost.
-------------------------------------------------------
The system cannot log on due to the following error:
Unable to complete the requested operation because of
either a catastrophic media failure or a data structure
corruption on the disk.
-------------------------------------------------------

OS : Ubuntu 8.04 x86_64

Kernel : 2.6.24-18-server #1 SMP x86_64 GNU/Linux

KVM : kvm-70

CPU : Intel(R) Core(TM)2 Quad CPU @ 2.40GHz

flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat
pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx lm
constant_tsc arch_perfmon pebs bts rep_good pni monitor ds_cpl vmx est tm2
ssse3 cx16 xt

Start Command : sudo /usr/local/kvm/bin/qemu-system-x86_64 -hda
asit51ascs.img \
-m 1024 -std-vga -boot c -k sv -usb -usbdevice tablet
-snapshot -vnc :51 \
-net nic,vlan=0,macaddr=00:16:3e:00:51:00 -net
tap,vlan=0,script=/etc/qemu-ifup-br0 \
-net nic,vlan=1,macaddr=00:16:3e:00:51:01 -net
tap,vlan=1,script=/etc/qemu-ifup-br1

no-kvm : Cannot do due to the loss of performance.
Tests execute time is 7 hours with kvm.



gwachs ( gerdwachs ) - 2008-06-24 05:27

7

Open

None

Nobody/Anonymous

intel

None

Public


Comments ( 15 )

Date: 2008-07-31 20:55
Sender: gerdwachs


I always start with a newly created image file and install
from a Windows 2003 Server ISO which is well tested in the
physical machines and under vmware.
To give a little more background, I install the following
before installing the product under test (restarting when
approprate) : Windows Updates, Acorbat, Office 2003, Winzip,
Visual Basic 6, DirectX, SQL Server, Oracle Server.
The products we test require a DVD for space.
This may be a more heavy usage of Windows 2003 Server running
in a kvm environment.


Date: 2008-07-31 05:19
Sender: jlokier


Did you start with the Windows Server 2003 image which is available free
from Microsoft,
and convert it to qcow2 using 'qemu-img convert'? That conversion is
known to be buggy.


Date: 2008-07-17 07:39
Sender: gerdwachs


I use qcow2
The host disk format is xfs
I do not have access to the computers until Aug 10th
so I cannot test the raw format until them.


Date: 2008-07-16 14:23
Sender: avikProject Admin


What disk format are you using?

Does switching to raw format help?


Date: 2008-07-10 05:10
Sender: gerdwachs


Regarding using daily builds.
From Build 20080707, my existing guests would not start.
Have rebuilt one guest using 20080708. Seems OK but need
to be running more guests for longer time to be sure.
I will not have access to the environment from July 12 to Aug 10



Date: 2008-06-30 11:27
Sender: gerdwachs


Sorry, I am not that advanced on the usage of git.

If you would care to send instructions, I will try.

I am currently using the latest snapshots.

It appears to be working enough to for my requirements, but I have still
been getting the odd corruption message.

P.S. I believe that kvm is an absolute winning concept.



Date: 2008-06-27 14:09
Sender: iggy_cav


For some reason, I thought a virtio patch would help you, when you
obviously aren't using that with windows guests. It could still be
something with the i/o thread though. Is there any way you can do a git
bisect to figure out where exactly it breaks? I know it's hard to do
something like that when it takes so long to trigger the issue. It may be
our only option though.

kvm-69 doesn't have the i/o thread (I think), so it should be safe to use
if you just need something that works.


Date: 2008-06-27 10:44
Sender: gerdwachs


Patch did not fix problem.
Had one XP guest hang with build 20080626
Are currently running with build 20080626 on both hosts
to assess disk corruption occurances.


Date: 2008-06-27 08:25
Sender: gerdwachs


Regarding the patch. This seemed to fix the problem,
but will keep re-running for a week before being certain.

Regarding the computer freezing, tried snapshot 20080626,
whilst one guest SEEMED to hang for seconds/minutes, it kept running.

Thank you very much iggy_cav



Date: 2008-06-26 10:18
Sender: gerdwachs


I have applied the patch and begun testing.
I will update after testing.



Date: 2008-06-25 12:59
Sender: iggy_cav


Can you try to revert (patch -R) the virtio async feature? Someone else in
the irc channel that was having fs corruption had luck doing that.
http://people.redhat.com/~mtosatti/virtioblk-async.patch

Otherwise, just stick with the kvm-69 userspace until it's fixed.


Date: 2008-06-25 07:26
Sender: gerdwachs


Windows 2003 Guests also have random BSOD

The problems in this bug report put a stop to running Windows 2003 Server
on kvm
at this point in time.


Date: 2008-06-24 08:33
Sender: gerdwachs


The message : apic write: bad size=1 fee00030
does not occur when using the option : -no-kvm-irqchip
Will continue testing.


Date: 2008-06-24 08:15
Sender: gerdwachs


The message : apic write: bad size=1 fee00030

only occurs when the guest is started using kvm.

i.e does not occur with the -no-kvm option.

When using the -no-acpi option, the guest does not start kvm or no kvm


Date: 2008-06-24 07:32
Sender: gerdwachs


Noted that I get the following in the linux console :

apic write: bad size=1 fee00030



Attached File

No Files Currently Attached

Changes ( 5 )

Field Old Value Date By
status_id Pending 2008-07-17 07:39 gerdwachs
close_date 2008-07-16 14:23 2008-07-17 07:39 gerdwachs
status_id Open 2008-07-16 14:23 avik
close_date - 2008-07-16 14:23 avik
priority 5 2008-06-25 07:26 gerdwachs