Learn how easy it is to sync an existing GitHub or Google Code repo to a SourceForge project! See Demo

Close

#521 qemu-kvm fails to migrate properly in 0.12.2

open
nobody
qemu (66)
5
2014-11-21
2010-02-08
Lance Albertson
No

I'm using Ganeti and encountered a problem [1] with migrating KVM virtual machines. I have confirmed that this is not a problem specific to Ganeti and is also happening to the Libvirt project [2].

When migrating from host A to host B, everything goes fine however the guest VM (running CentOS 5.4) clock changes to approximately 600 seconds in the future. When attempting to migrate back to host B, the VM locks up and stops responding. However, if you fix the clock prior to syncing back to host A, the migration happens however the clock on the VM literally stops so the guest is pretty unusable.

I tried the same scenario using Ubuntu 9.10 for the guest OS and has different issues. Upon migrating to host B, the clock on the guest VM was ahead by over 3 days into the future. Then when I migrate it back to host A the VM locks up hard.

I can confirm I have no problem using migration when using 0.11.1.

Here's the detail information about our setup:
Hardware: Intel(R) Xeon(R) CPU 5160 @ 3.00GHz
qemu-kvm: 0.12.2
kernel: 2.6.29-hardened
Linux OS: Gentoo Hardened

If you need more information, please let me know and I'll be happy to provide it.

Thanks-

[1] http://groups.google.com/group/ganeti/browse_thread/thread/9b2c556557e749cf
[2] http://thread.gmane.org/gmane.comp.emulators.libvirt/20771

Discussion

  • Brian Jackson
    Brian Jackson
    2010-02-08

    This has been mentioned a few times in irc and possibly on the mailing list. No one has really come up with a fix. Can you try without kvmclock?

     
  • Are you wanting me to try this on the guest or host system?

     
  • Brian Jackson
    Brian Jackson
    2010-02-08

    guest

    you can either use the no-kvmclock boot param or choose a different clock in:
    /sys/devices/system/clocksource/clocksource0/current_clocksource

    possible values are in:
    /sys/devices/system/clocksource/clocksource0/available_clocksource

     
  • It seems to work if I use something like acpi_pm for the clock source (on Debian). The clock also seemed to stay in time. I haven't tested it on the other OS's but it seems that CentOS doesn't even use kvmclock (which means sense considering the age of the kernel).

     
  • Any idea why kvmclock is causing this issue? Has there been any recent changes to that part of qemu-kvm?

    Thanks for the tips on CentOS, I'll give that a try and see if that works as a workaround for now.

     
  • ttt
    ttt
    2010-02-09

    Using acpi_pm both on hosts and guests (Debian 5.0 AMD64) seems to work. I now did 20-30 live migrations or so and it worked like a charm. Thanks!

     
  • Brian Jackson
    Brian Jackson
    2010-02-09

    Can someone try this?

    http://www.mail-archive.com/kvm@vger.kernel.org/msg28046.html

     
  • I'll give that a try sometime today

     
  • The patch you provided doesn't seem to apply cleanly on older kernels so I went ahead and used the latest stable 2.6.32.8 which seems to include the patch. Unfortunately I have the same problem as before with the time moving up by nearly three days and the VM locking up upon returning to the original host.

     
  • Has there been any progress made on this bug? I noticed that I had the same problem with 0.11.1 and just resorted to not using the kvm-clock.

     
  • Brian Jackson
    Brian Jackson
    2010-06-16

    There have been some fixes committed and there is another long set of patches being discussed on the list now. Unfortunately, the fixes require both host and guest updates. I don't think they will be in till 2.6.36 or so. If they are suitable for stable releases, they can probably go into 2.6.32 and the other maintained stable trees.