Thread: [kvm-devel] paravirt clock stil causing hangs in kvm-65

Brought to you by: avik, mtosatti

kvm-devel

[kvm-devel] paravirt clock stil causing hangs in kvm-65

From: Nikola C. <ext...@li...> - 2008-04-07 11:53:40

Hi,

I also tried paravirt clock again in latest git with kvm-65 patch 
applied, and problem with cpu-lockups persists:

[10813.654806] BUG: soft lockup - CPU#0 stuck for 61s! [swapper:0]
[10813.655789] CPU 0:
[10813.656624] Modules linked in: virtio_pci virtio_ring virtio_blk virtio 
piix dm_snapshot dm_zero dm_mirror dm_mod ide_disk
  ide_core sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
[10813.658805] Pid: 0, comm: swapper Not tainted 2.6.25-rc7 #5
[10813.658805] RIP: 0010:[<ffffffff80222ab2>]  [<ffffffff80222ab2>] 
native_safe_halt+0x2/0x10
[10813.658805] RSP: 0018:ffffffff805adf50  EFLAGS: 00000296
[10813.658805] RAX: 000000019b08eeb0 RBX: ffffffff805f5000 RCX: 
000000019b08eeb0
[10813.658805] RDX: 0000000000000006 RSI: 00000000356832b0 RDI: 
ffffffff805adf38
[10813.658805] RBP: 0000000000000da8 R08: 0000000000000000 R09: 
0000000000000000
[10813.658805] R10: 0000000000000001 R11: 0000000000000002 R12: 
ffffffff802228ed
[10813.658805] R13: 00000000000132a0 R14: ffffffff80200bba R15: 
ffff81000100a280
[10813.658805] FS:  0000000000000000(0000) GS:ffffffff80576000(0000) 
knlGS:0000000000000000
[10813.658805] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
[10813.658805] CR2: 00007fac0f852000 CR3: 0000000000201000 CR4: 
00000000000006e0
[10813.658805] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
[10813.658805] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 
0000000000000400
[10813.658805]
[10813.658805] Call Trace:
[10813.658805]  [<ffffffff8020a55b>] ? default_idle+0x3b/0x70
[10813.658805]  [<ffffffff8020a520>] ? default_idle+0x0/0x70
[10813.658805]  [<ffffffff8020a60e>] ? cpu_idle+0x7e/0xe0
[10813.658805]  [<ffffffff80211630>] ? pda_init+0x30/0xb0

Can I somehow help to track this one down??

Additionaly, I'd like to ask two questions:
1) why is kvm-65 patch changing release to -rc7? maybe this is 
unintended??

2) I'm getting lot's of following messages on host, what do they mean?
[16836.605669] Ignoring de-assert INIT to vcpu SOMENUMBER
[16836.605687] SIPI to vcpu SOMENUMBER vector 0xSOMENUMBER

BR
nik

Re: [kvm-devel] paravirt clock stil causing hangs in kvm-65

From: Marcelo T. <mto...@re...> - 2008-04-07 21:31:59

On Mon, Apr 07, 2008 at 01:53:36PM +0200, Nikola Ciprich wrote:
> Hi,
> 
> I also tried paravirt clock again in latest git with kvm-65 patch 
> applied, and problem with cpu-lockups persists:
> 
> [10813.654806] BUG: soft lockup - CPU#0 stuck for 61s! [swapper:0]
> [10813.655789] CPU 0:
> [10813.656624] Modules linked in: virtio_pci virtio_ring virtio_blk virtio 
> piix dm_snapshot dm_zero dm_mirror dm_mod ide_disk
>   ide_core sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
> [10813.658805] Pid: 0, comm: swapper Not tainted 2.6.25-rc7 #5
> [10813.658805] RIP: 0010:[<ffffffff80222ab2>]  [<ffffffff80222ab2>] 
> native_safe_halt+0x2/0x10
> [10813.658805] RSP: 0018:ffffffff805adf50  EFLAGS: 00000296
> [10813.658805] RAX: 000000019b08eeb0 RBX: ffffffff805f5000 RCX: 
> 000000019b08eeb0
> [10813.658805] RDX: 0000000000000006 RSI: 00000000356832b0 RDI: 
> ffffffff805adf38
> [10813.658805] RBP: 0000000000000da8 R08: 0000000000000000 R09: 
> 0000000000000000
> [10813.658805] R10: 0000000000000001 R11: 0000000000000002 R12: 
> ffffffff802228ed
> [10813.658805] R13: 00000000000132a0 R14: ffffffff80200bba R15: 
> ffff81000100a280
> [10813.658805] FS:  0000000000000000(0000) GS:ffffffff80576000(0000) 
> knlGS:0000000000000000
> [10813.658805] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> [10813.658805] CR2: 00007fac0f852000 CR3: 0000000000201000 CR4: 
> 00000000000006e0
> [10813.658805] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
> 0000000000000000
> [10813.658805] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 
> 0000000000000400
> [10813.658805]
> [10813.658805] Call Trace:
> [10813.658805]  [<ffffffff8020a55b>] ? default_idle+0x3b/0x70
> [10813.658805]  [<ffffffff8020a520>] ? default_idle+0x0/0x70
> [10813.658805]  [<ffffffff8020a60e>] ? cpu_idle+0x7e/0xe0
> [10813.658805]  [<ffffffff80211630>] ? pda_init+0x30/0xb0
> 
> Can I somehow help to track this one down??

Hi Nikola,

I just reproduced this on a UP guest. Were you seeing the exact same
stack trace in the guest with kvm-64 ?

> 
> Additionaly, I'd like to ask two questions:
> 1) why is kvm-65 patch changing release to -rc7? maybe this is 
> unintended??
> 
> 2) I'm getting lot's of following messages on host, what do they mean?
> [16836.605669] Ignoring de-assert INIT to vcpu SOMENUMBER
> [16836.605687] SIPI to vcpu SOMENUMBER vector 0xSOMENUMBER
> 
> BR
> nik
> 
> 
> -------------------------------------------------------------------------
> This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
> Register now and save $200. Hurry, offer ends at 11:59 p.m., 
> Monday, April 7! Use priority code J8TLD2. 
> http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
> _______________________________________________
> kvm-devel mailing list
> kvm...@li...
> https://lists.sourceforge.net/lists/listinfo/kvm-devel

Re: [kvm-devel] [ RfC / patch ] kvmclock fixes

From: Jeremy F. <je...@go...> - 2008-04-21 09:57:21

Gerd Hoffmann wrote:
>   * Host: make kvm pv clock really compatible with xen pv clock.
>   * Guest/xen: factor out some xen clock code into a separate
>                source file (pvclock.[ch]), so kvm can reuse it.
>   * Guest/kvm: make kvm clock compatible with xen clock by using
>                the common code bits.
>   

I guess saving on code duplication is good...

> +cycle_t pvclock_clocksource_read(struct kvm_vcpu_time_info *src)
> +{
> +	struct pvclock_shadow_time *shadow = &get_cpu_var(shadow_time);
> +	cycle_t ret;
> +
> +	pvclock_get_time_values(shadow, src);
> +	ret = shadow->system_timestamp + pvclock_get_nsec_offset(shadow);
>   

You need to put this in a loop in case the system clock parameters 
change between the pvclock_get_time_values() and pvclock_get_nsec_offset().

How does kvm deal with suspend/resume with respect to time?  Is the 
"system" timestamp guaranteed to remain monotonic?  For Xen, I think 
we'll need to maintain an offset between the initial system timestamp 
and whatever it is after resuming.

    J

Re: [kvm-devel] [ RfC / patch ] kvmclock fixes

From: Gerd H. <kr...@re...> - 2008-04-21 13:01:23

Attachments: kvmclock-5.diff

Jeremy Fitzhardinge wrote:
> Gerd Hoffmann wrote:
>> +cycle_t pvclock_clocksource_read(struct kvm_vcpu_time_info *src)
>> +{
>> +	struct pvclock_shadow_time *shadow = &get_cpu_var(shadow_time);
>> +	cycle_t ret;
>> +
>> +	pvclock_get_time_values(shadow, src);
>> +	ret = shadow->system_timestamp + pvclock_get_nsec_offset(shadow);
>>   
> 
> You need to put this in a loop in case the system clock parameters 
> change between the pvclock_get_time_values() and pvclock_get_nsec_offset().

Fixed, new patch attached.

> How does kvm deal with suspend/resume with respect to time?  Is the 
> "system" timestamp guaranteed to remain monotonic?  For Xen, I think 
> we'll need to maintain an offset between the initial system timestamp 
> and whatever it is after resuming.

Havn't looked at it yet.

cheers,
  Gerd

-- 
http://kraxel.fedorapeople.org/xenner/

Re: [kvm-devel] [ RfC / patch ] kvmclock fixes

From: Jeremy F. <je...@go...> - 2008-04-21 13:38:35

Gerd Hoffmann wrote:
> +cycle_t pvclock_clocksource_read(struct kvm_vcpu_time_info *src)
> +{
> +	struct pvclock_shadow_time *shadow;
> +	cycle_t ret;
> +	unsigned version;
> +
> +	shadow = &get_cpu_var(shadow_time);
> +	do {
> +		version = pvclock_get_time_values(shadow, src);
> +		barrier();
> +		ret = shadow->system_timestamp + pvclock_get_nsec_offset(shadow);
> +		barrier();
>   

Is barrier() strong enough?  Does kvm guarantee that the per-cpu time 
parameters are only ever updated by that cpu?  I'm pretty sure Xen does, 
so that's OK.

    J

Re: [kvm-devel] paravirt clock stil causing hangs in kvm-65

From: Marcelo T. <mto...@re...> - 2008-04-08 01:15:28

On Mon, Apr 07, 2008 at 06:34:57PM -0300, Marcelo Tosatti wrote:
> On Mon, Apr 07, 2008 at 01:53:36PM +0200, Nikola Ciprich wrote:
> > Hi,
> > 
> > I also tried paravirt clock again in latest git with kvm-65 patch 
> > applied, and problem with cpu-lockups persists:
> > 
> > [10813.654806] BUG: soft lockup - CPU#0 stuck for 61s! [swapper:0]
> > [10813.655789] CPU 0:
> > [10813.656624] Modules linked in: virtio_pci virtio_ring virtio_blk virtio 
> > piix dm_snapshot dm_zero dm_mirror dm_mod ide_disk
> >   ide_core sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
> > [10813.658805] Pid: 0, comm: swapper Not tainted 2.6.25-rc7 #5
> > [10813.658805] RIP: 0010:[<ffffffff80222ab2>]  [<ffffffff80222ab2>] 
> > native_safe_halt+0x2/0x10
> > [10813.658805] RSP: 0018:ffffffff805adf50  EFLAGS: 00000296
> > [10813.658805] RAX: 000000019b08eeb0 RBX: ffffffff805f5000 RCX: 
> > 000000019b08eeb0
> > [10813.658805] RDX: 0000000000000006 RSI: 00000000356832b0 RDI: 
> > ffffffff805adf38
> > [10813.658805] RBP: 0000000000000da8 R08: 0000000000000000 R09: 
> > 0000000000000000
> > [10813.658805] R10: 0000000000000001 R11: 0000000000000002 R12: 
> > ffffffff802228ed
> > [10813.658805] R13: 00000000000132a0 R14: ffffffff80200bba R15: 
> > ffff81000100a280
> > [10813.658805] FS:  0000000000000000(0000) GS:ffffffff80576000(0000) 
> > knlGS:0000000000000000
> > [10813.658805] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> > [10813.658805] CR2: 00007fac0f852000 CR3: 0000000000201000 CR4: 
> > 00000000000006e0
> > [10813.658805] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
> > 0000000000000000
> > [10813.658805] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 
> > 0000000000000400
> > [10813.658805]
> > [10813.658805] Call Trace:
> > [10813.658805]  [<ffffffff8020a55b>] ? default_idle+0x3b/0x70
> > [10813.658805]  [<ffffffff8020a520>] ? default_idle+0x0/0x70
> > [10813.658805]  [<ffffffff8020a60e>] ? cpu_idle+0x7e/0xe0
> > [10813.658805]  [<ffffffff80211630>] ? pda_init+0x30/0xb0
> > 
> > Can I somehow help to track this one down??
> 
> Hi Nikola,
> 
> I just reproduced this on a UP guest. Were you seeing the exact same
> stack trace in the guest with kvm-64 ?

I think the logic to wakeup tasks in HLT is racy. Nothing prevents
a timer event from being lost if it fires in between guest exit and
vcpu_block().

Please try the patch below.

> > 2) I'm getting lot's of following messages on host, what do they mean?
> > [16836.605669] Ignoring de-assert INIT to vcpu SOMENUMBER
> > [16836.605687] SIPI to vcpu SOMENUMBER vector 0xSOMENUMBER

This is the APIC SMP initialization protocol.


diff --git a/arch/x86/kvm/i8254.c b/arch/x86/kvm/i8254.c
index 06a241a..fdd8342 100644
--- a/arch/x86/kvm/i8254.c
+++ b/arch/x86/kvm/i8254.c
@@ -199,10 +199,8 @@ int __pit_timer_fn(struct kvm_kpit_state *ps)
 	struct kvm_kpit_timer *pt = &ps->pit_timer;
 
 	atomic_inc(&pt->pending);
-	if (vcpu0 && waitqueue_active(&vcpu0->wq)) {
-		vcpu0->arch.mp_state = VCPU_MP_STATE_RUNNABLE;
-		wake_up_interruptible(&vcpu0->wq);
-	}
+	if (vcpu0)
+		kvm_wakeup_vcpu(vcpu0, VCPU_MP_STATE_RUNNABLE);
 
 	pt->timer.expires = ktime_add_ns(pt->timer.expires, pt->period);
 	pt->scheduled = ktime_to_ns(pt->timer.expires);
@@ -210,6 +208,16 @@ int __pit_timer_fn(struct kvm_kpit_state *ps)
 	return (pt->period == 0 ? 0 : 1);
 }
 
+int pit_has_pending_event(struct kvm_vcpu *vcpu)
+{
+	struct kvm_pit *pit = vcpu->kvm->arch.vpit;
+
+	if (pit && vcpu->vcpu_id == 0)
+		return atomic_read(&pit->pit_state.pit_timer.pending);
+
+	return 0;
+}
+
 static enum hrtimer_restart pit_timer_fn(struct hrtimer *data)
 {
 	struct kvm_kpit_state *ps;
diff --git a/arch/x86/kvm/irq.c b/arch/x86/kvm/irq.c
index dbfe21c..18b8a0e 100644
--- a/arch/x86/kvm/irq.c
+++ b/arch/x86/kvm/irq.c
@@ -26,6 +26,21 @@
 #include "i8254.h"
 
 /*
+ * check if there are pending timer events
+ * to be processed.
+ */
+int kvm_cpu_has_pending_timer(struct kvm_vcpu *vcpu)
+{
+	int ret;
+
+	ret = pit_has_pending_event(vcpu);
+	ret |= apic_has_pending_event(vcpu);
+
+	return ret;
+}
+EXPORT_SYMBOL(kvm_cpu_has_pending_timer);
+
+/*
  * check if there is pending interrupt without
  * intack.
  */
diff --git a/arch/x86/kvm/irq.h b/arch/x86/kvm/irq.h
index fa5ed5d..ff0ddb5 100644
--- a/arch/x86/kvm/irq.h
+++ b/arch/x86/kvm/irq.h
@@ -85,4 +85,7 @@ void kvm_inject_pending_timer_irqs(struct kvm_vcpu *vcpu);
 void kvm_inject_apic_timer_irqs(struct kvm_vcpu *vcpu);
 void __kvm_migrate_apic_timer(struct kvm_vcpu *vcpu);
 
+int pit_has_pending_event(struct kvm_vcpu *vcpu);
+int apic_has_pending_event(struct kvm_vcpu *vcpu);
+
 #endif
diff --git a/arch/x86/kvm/lapic.c b/arch/x86/kvm/lapic.c
index 31280df..9973d8e 100644
--- a/arch/x86/kvm/lapic.c
+++ b/arch/x86/kvm/lapic.c
@@ -936,13 +936,10 @@ EXPORT_SYMBOL_GPL(kvm_lapic_enabled);
 static int __apic_timer_fn(struct kvm_lapic *apic)
 {
 	int result = 0;
-	wait_queue_head_t *q = &apic->vcpu->wq;
 
 	atomic_inc(&apic->timer.pending);
-	if (waitqueue_active(q)) {
-		apic->vcpu->arch.mp_state = VCPU_MP_STATE_RUNNABLE;
-		wake_up_interruptible(q);
-	}
+	kvm_wakeup_vcpu(apic->vcpu, VCPU_MP_STATE_RUNNABLE);
+
 	if (apic_lvtt_period(apic)) {
 		result = 1;
 		apic->timer.dev.expires = ktime_add_ns(
@@ -952,6 +949,16 @@ static int __apic_timer_fn(struct kvm_lapic *apic)
 	return result;
 }
 
+int apic_has_pending_event(struct kvm_vcpu *vcpu)
+{
+	struct kvm_lapic *lapic = vcpu->arch.apic;
+
+	if (lapic)
+		return atomic_read(&lapic->timer.pending);
+
+	return 0;
+}
+
 static int __inject_apic_timer_irq(struct kvm_lapic *apic)
 {
 	int vector;
diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c
index cb57b6a..b11ea81 100644
--- a/arch/x86/kvm/x86.c
+++ b/arch/x86/kvm/x86.c
@@ -3925,10 +3944,7 @@ void kvm_vcpu_kick(struct kvm_vcpu *vcpu)
 {
 	int ipi_pcpu = vcpu->cpu;
 
-	if (waitqueue_active(&vcpu->wq)) {
-		wake_up_interruptible(&vcpu->wq);
-		++vcpu->stat.halt_wakeup;
-	}
+	kvm_wakeup_vcpu(vcpu, vcpu->arch.mp_state);
 	if (vcpu->guest_mode)
 		smp_call_function_single(ipi_pcpu, vcpu_kick_intr, vcpu, 0, 0);
 }
diff --git a/include/linux/kvm_host.h b/include/linux/kvm_host.h
index a2ceb51..a6cb75e 100644
--- a/include/linux/kvm_host.h
+++ b/include/linux/kvm_host.h
@@ -69,6 +69,7 @@ struct kvm_vcpu {
 	int fpu_active;
 	int guest_fpu_loaded;
 	wait_queue_head_t wq;
+	spinlock_t wq_lock;
 	int sigset_active;
 	sigset_t sigset;
 	struct kvm_vcpu_stat stat;
@@ -184,6 +185,7 @@ int kvm_is_visible_gfn(struct kvm *kvm, gfn_t gfn);
 void mark_page_dirty(struct kvm *kvm, gfn_t gfn);
 
 void kvm_vcpu_block(struct kvm_vcpu *vcpu);
+void kvm_wakeup_vcpu(struct kvm_vcpu *vcpu, int mpstate);
 void kvm_resched(struct kvm_vcpu *vcpu);
 void kvm_load_guest_fpu(struct kvm_vcpu *vcpu);
 void kvm_put_guest_fpu(struct kvm_vcpu *vcpu);
@@ -256,6 +262,7 @@ void kvm_arch_destroy_vm(struct kvm *kvm);
 
 int kvm_cpu_get_interrupt(struct kvm_vcpu *v);
 int kvm_cpu_has_interrupt(struct kvm_vcpu *v);
+int kvm_cpu_has_pending_timer(struct kvm_vcpu *vcpu);
 void kvm_vcpu_kick(struct kvm_vcpu *vcpu);
 
 static inline void kvm_guest_enter(void)
diff --git a/virt/kvm/kvm_main.c b/virt/kvm/kvm_main.c
index 3396a5f..00d1a6c 100644
--- a/virt/kvm/kvm_main.c
+++ b/virt/kvm/kvm_main.c
@@ -152,6 +152,7 @@ int kvm_vcpu_init(struct kvm_vcpu *vcpu, struct kvm *kvm, unsigned id)
 	vcpu->kvm = kvm;
 	vcpu->vcpu_id = id;
 	init_waitqueue_head(&vcpu->wq);
+	spin_lock_init(&vcpu->wq_lock);
 
 	page = alloc_page(GFP_KERNEL | __GFP_ZERO);
 	if (!page) {
@@ -698,6 +699,23 @@ void mark_page_dirty(struct kvm *kvm, gfn_t gfn)
 	}
 }
 
+void kvm_wakeup_vcpu(struct kvm_vcpu *vcpu, int mpstate)
+{
+	wait_queue_head_t *q = &vcpu->wq;
+	unsigned long flags;
+
+	spin_lock_irqsave(&vcpu->wq_lock, flags);
+	if (waitqueue_active(q)) {
+#ifdef CONFIG_X86
+		vcpu->arch.mp_state = mpstate;
+#endif
+		wake_up_interruptible(q);
+		++vcpu->stat.halt_wakeup;
+	}
+	spin_unlock_irqrestore(&vcpu->wq_lock, flags);
+
+}
+
 /*
  * The vCPU has executed a HLT instruction with in-kernel mode enabled.
  */
@@ -705,19 +723,24 @@ void kvm_vcpu_block(struct kvm_vcpu *vcpu)
 {
 	DECLARE_WAITQUEUE(wait, current);
 
+	spin_lock_irq(&vcpu->wq_lock);
 	add_wait_queue(&vcpu->wq, &wait);
 
 	/*
 	 * We will block until either an interrupt or a signal wakes us up
 	 */
 	while (!kvm_cpu_has_interrupt(vcpu)
+	       && !kvm_cpu_has_pending_timer(vcpu)
 	       && !signal_pending(current)
 	       && !kvm_arch_vcpu_runnable(vcpu)) {
 		set_current_state(TASK_INTERRUPTIBLE);
+		spin_unlock_irq(&vcpu->wq_lock);
 		vcpu_put(vcpu);
 		schedule();
 		vcpu_load(vcpu);
+		spin_lock_irq(&vcpu->wq_lock);
 	}
+	spin_unlock_irq(&vcpu->wq_lock);
 
 	__set_current_state(TASK_RUNNING);
 	remove_wait_queue(&vcpu->wq, &wait);

Re: [kvm-devel] paravirt clock stil causing hangs in kvm-65

From: Nikola C. <ext...@li...> - 2008-04-08 06:00:06

On Mon, 7 Apr 2008, Marcelo Tosatti wrote:

> I think the logic to wakeup tasks in HLT is racy. Nothing prevents
> a timer event from being lost if it fires in between guest exit and
> vcpu_block().
>
> Please try the patch below.
>

Hi Marcelo, the problem persists even with patch.

[ 1905.899112]  [<ffffffff8020a55b>] ? default_idle+0x3b/0x70
[ 1905.899112]  [<ffffffff8020a520>] ? default_idle+0x0/0x70
[ 1905.899112]  [<ffffffff8020a60e>] ? cpu_idle+0x7e/0xe0
[ 1905.899112]  [<ffffffff80211630>] ? pda_init+0x30/0xb0

As I already said, if I enable ACPI, system does not boot at all (even 
with init=/bin/bash) and ends with mentioned trace immediately.

If I disable ACPI, it hangs on "Starting udev", with init=/bin/bash it at 
least boots and I can use shell. At least for some time.
Funny thing is, that If I then use poweroff, the system shuts down, ends with "System 
halted." message, but everything still seems to be working, shell, etc...

Re: [kvm-devel] paravirt clock stil causing hangs in kvm-65

From: Marcelo T. <mto...@re...> - 2008-04-19 15:26:51

On Mon, Apr 07, 2008 at 06:34:57PM -0300, Marcelo Tosatti wrote:
> On Mon, Apr 07, 2008 at 01:53:36PM +0200, Nikola Ciprich wrote:
> > Hi,
> > 
> > I also tried paravirt clock again in latest git with kvm-65 patch 
> > applied, and problem with cpu-lockups persists:
> > 
> > [10813.654806] BUG: soft lockup - CPU#0 stuck for 61s! [swapper:0]
> > [10813.655789] CPU 0:
> > [10813.656624] Modules linked in: virtio_pci virtio_ring virtio_blk virtio 
> > piix dm_snapshot dm_zero dm_mirror dm_mod ide_disk
> >   ide_core sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
> > [10813.658805] Pid: 0, comm: swapper Not tainted 2.6.25-rc7 #5
> > [10813.658805] RIP: 0010:[<ffffffff80222ab2>]  [<ffffffff80222ab2>] 
> > native_safe_halt+0x2/0x10
> > [10813.658805] RSP: 0018:ffffffff805adf50  EFLAGS: 00000296
> > [10813.658805] RAX: 000000019b08eeb0 RBX: ffffffff805f5000 RCX: 
> > 000000019b08eeb0
> > [10813.658805] RDX: 0000000000000006 RSI: 00000000356832b0 RDI: 
> > ffffffff805adf38
> > [10813.658805] RBP: 0000000000000da8 R08: 0000000000000000 R09: 
> > 0000000000000000
> > [10813.658805] R10: 0000000000000001 R11: 0000000000000002 R12: 
> > ffffffff802228ed
> > [10813.658805] R13: 00000000000132a0 R14: ffffffff80200bba R15: 
> > ffff81000100a280
> > [10813.658805] FS:  0000000000000000(0000) GS:ffffffff80576000(0000) 
> > knlGS:0000000000000000
> > [10813.658805] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> > [10813.658805] CR2: 00007fac0f852000 CR3: 0000000000201000 CR4: 
> > 00000000000006e0
> > [10813.658805] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
> > 0000000000000000
> > [10813.658805] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 
> > 0000000000000400
> > [10813.658805]
> > [10813.658805] Call Trace:
> > [10813.658805]  [<ffffffff8020a55b>] ? default_idle+0x3b/0x70
> > [10813.658805]  [<ffffffff8020a520>] ? default_idle+0x0/0x70
> > [10813.658805]  [<ffffffff8020a60e>] ? cpu_idle+0x7e/0xe0
> > [10813.658805]  [<ffffffff80211630>] ? pda_init+0x30/0xb0
> > 
> > Can I somehow help to track this one down??
> 
> Hi Nikola,
> 
> I just reproduced this on a UP guest. Were you seeing the exact same
> stack trace in the guest with kvm-64 ?

I've been able to reproduce the problem. Symptoms are that when using
NOHZ vcpu0 LAPIC timer is ticking far less than the others (apparently vcpu0
is the only one ticking "correctly"):


nohz=on with kvmclock
[root@localhost ~]# cat /proc/timer_stats | grep apic
 13214,  8590 qemu-system-x86  apic_mmio_write (apic_timer_fn)
 13214,  8589 qemu-system-x86  apic_mmio_write (apic_timer_fn)
 13211,  8588 qemu-system-x86  apic_mmio_write (apic_timer_fn)
  389,  8587 qemu-system-x86  apic_mmio_write (apic_timer_fn)

nohz=off
 3253,  8672 qemu-system-x86  apic_mmio_write (apic_timer_fn)
 2876,  8673 qemu-system-x86  apic_mmio_write (apic_timer_fn)
 2543,  8674 qemu-system-x86  apic_mmio_write (apic_timer_fn)
 2179,  8675 qemu-system-x86  apic_mmio_write (apic_timer_fn)

no-kvmclock
 1017,  8808 qemu-system-x86  apic_mmio_write (apic_timer_fn)
 1577,  8809 qemu-system-x86  apic_mmio_write (apic_timer_fn)
 1708,  8807 qemu-system-x86  apic_mmio_write (apic_timer_fn)
 1812,  8806 qemu-system-x86  apic_mmio_write (apic_timer_fn)


Glauber will start looking at this next week.

Re: [kvm-devel] paravirt clock stil causing hangs in kvm-65

From: Marcelo T. <mto...@re...> - 2008-04-19 16:26:20

On Sat, Apr 19, 2008 at 12:29:47PM -0300, Marcelo Tosatti wrote:
> > I just reproduced this on a UP guest. Were you seeing the exact same
> > stack trace in the guest with kvm-64 ?
> 
> I've been able to reproduce the problem. Symptoms are that when using
> NOHZ vcpu0 LAPIC timer is ticking far less than the others (apparently vcpu0
> is the only one ticking "correctly"):
> 
> 
> nohz=on with kvmclock
> [root@localhost ~]# cat /proc/timer_stats | grep apic
>  13214,  8590 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>  13214,  8589 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>  13211,  8588 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>   389,  8587 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> 
> nohz=off
>  3253,  8672 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>  2876,  8673 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>  2543,  8674 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>  2179,  8675 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> 
> no-kvmclock
>  1017,  8808 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>  1577,  8809 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>  1708,  8807 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>  1812,  8806 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> 
> 
> Glauber will start looking at this next week.

Glauber, that printk you suggested has just triggered, but in a
different condition. Guest was working fine (SMP 2-way), then suddenly:

[root@localhost bonnie++-1.03c]# ./bonnie++
You must use the "-u" switch when running as root.
usage: bonnie++ [-d scratch-dir] [-s size(Mb)[:chunk-size(b)]]
                [-n number-to-stat[:max-size[:min-size][:num-directories]]]
                [-m machine-name]
                [-r ram-size-iserial8250: too much work for irq4
n-Mb]
                [-x number-of-tests] [-u uid-to-use:gid-to-use] [-g gid-to-use]
                [-q] [-f] [-b] [-p processes | -y]

Version: 1.03c
[root@localhost bonnie++-1.03c]# ./bonnie++  dirty_portuguese_word: -361322

And there it hanged.

diff --git a/kernel/time/timekeeping.c b/kernel/time/timekeeping.c
index a3fa587..7785fcc 100644
--- a/kernel/time/timekeeping.c
+++ b/kernel/time/timekeeping.c
@@ -453,6 +453,8 @@ void update_wall_time(void)
 #else
        offset = clock->cycle_interval;
 #endif
+        if ((s64) offset < 0)
+                printk("...! %lld\n", (s64)offset);
        clock->xtime_nsec += (s64)xtime.tv_nsec << clock->shift;

        /* normally this loop will run just once, however in the

Re: [kvm-devel] paravirt clock stil causing hangs in kvm-65

From: Glauber C. <gl...@gm...> - 2008-04-19 16:22:22

On Sat, Apr 19, 2008 at 12:29 PM, Marcelo Tosatti <mto...@re...> wrote:
> On Mon, Apr 07, 2008 at 06:34:57PM -0300, Marcelo Tosatti wrote:
>
>
> > On Mon, Apr 07, 2008 at 01:53:36PM +0200, Nikola Ciprich wrote:
>  > > Hi,
>  > >
>  > > I also tried paravirt clock again in latest git with kvm-65 patch
>  > > applied, and problem with cpu-lockups persists:
>  > >
>  > > [10813.654806] BUG: soft lockup - CPU#0 stuck for 61s! [swapper:0]
>  > > [10813.655789] CPU 0:
>  > > [10813.656624] Modules linked in: virtio_pci virtio_ring virtio_blk virtio
>  > > piix dm_snapshot dm_zero dm_mirror dm_mod ide_disk
>  > >   ide_core sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd
>  > > [10813.658805] Pid: 0, comm: swapper Not tainted 2.6.25-rc7 #5
>  > > [10813.658805] RIP: 0010:[<ffffffff80222ab2>]  [<ffffffff80222ab2>]
>  > > native_safe_halt+0x2/0x10
>  > > [10813.658805] RSP: 0018:ffffffff805adf50  EFLAGS: 00000296
>  > > [10813.658805] RAX: 000000019b08eeb0 RBX: ffffffff805f5000 RCX:
>  > > 000000019b08eeb0
>  > > [10813.658805] RDX: 0000000000000006 RSI: 00000000356832b0 RDI:
>  > > ffffffff805adf38
>  > > [10813.658805] RBP: 0000000000000da8 R08: 0000000000000000 R09:
>  > > 0000000000000000
>  > > [10813.658805] R10: 0000000000000001 R11: 0000000000000002 R12:
>  > > ffffffff802228ed
>  > > [10813.658805] R13: 00000000000132a0 R14: ffffffff80200bba R15:
>  > > ffff81000100a280
>  > > [10813.658805] FS:  0000000000000000(0000) GS:ffffffff80576000(0000)
>  > > knlGS:0000000000000000
>  > > [10813.658805] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
>  > > [10813.658805] CR2: 00007fac0f852000 CR3: 0000000000201000 CR4:
>  > > 00000000000006e0
>  > > [10813.658805] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
>  > > 0000000000000000
>  > > [10813.658805] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
>  > > 0000000000000400
>  > > [10813.658805]
>  > > [10813.658805] Call Trace:
>  > > [10813.658805]  [<ffffffff8020a55b>] ? default_idle+0x3b/0x70
>  > > [10813.658805]  [<ffffffff8020a520>] ? default_idle+0x0/0x70
>  > > [10813.658805]  [<ffffffff8020a60e>] ? cpu_idle+0x7e/0xe0
>  > > [10813.658805]  [<ffffffff80211630>] ? pda_init+0x30/0xb0
>  > >
>  > > Can I somehow help to track this one down??
>  >
>  > Hi Nikola,
>  >
>  > I just reproduced this on a UP guest. Were you seeing the exact same
>  > stack trace in the guest with kvm-64 ?
>
>  I've been able to reproduce the problem. Symptoms are that when using
>  NOHZ vcpu0 LAPIC timer is ticking far less than the others (apparently vcpu0
>  is the only one ticking "correctly"):
>
>
>  nohz=on with kvmclock
>  [root@localhost ~]# cat /proc/timer_stats | grep apic
>   13214,  8590 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>   13214,  8589 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>   13211,  8588 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>   389,  8587 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>
>  nohz=off
>   3253,  8672 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>   2876,  8673 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>   2543,  8674 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>   2179,  8675 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>
>  no-kvmclock
>   1017,  8808 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>   1577,  8809 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>   1708,  8807 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>   1812,  8806 qemu-system-x86  apic_mmio_write (apic_timer_fn)
>
>
>  Glauber will start looking at this next week.
>
>
>

>From what me and marcelo discussed, I think there's a possibility that
it has marginally something to do with precision of clock calculation.
Gerd's patches address that issues. Can somebody test this with those
patches (both guest and host), while I'm off ?

-- 
Glauber Costa.
"Free as in Freedom"
http://glommer.net

"The less confident you are, the more serious you have to act."

Re: [kvm-devel] paravirt clock stil causing hangs in kvm-65

From: Marcelo T. <mto...@re...> - 2008-04-19 16:46:38

On Sat, Apr 19, 2008 at 01:22:28PM -0300, Glauber Costa wrote:
> >  I've been able to reproduce the problem. Symptoms are that when using
> >  NOHZ vcpu0 LAPIC timer is ticking far less than the others (apparently vcpu0
> >  is the only one ticking "correctly"):
> >
> >
> >  nohz=on with kvmclock
> >  [root@localhost ~]# cat /proc/timer_stats | grep apic
> >   13214,  8590 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >   13214,  8589 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >   13211,  8588 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >   389,  8587 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >
> >  nohz=off
> >   3253,  8672 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >   2876,  8673 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >   2543,  8674 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >   2179,  8675 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >
> >  no-kvmclock
> >   1017,  8808 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >   1577,  8809 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >   1708,  8807 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >   1812,  8806 qemu-system-x86  apic_mmio_write (apic_timer_fn)
> >
> >
> >  Glauber will start looking at this next week.
> >
> >
> >
> 
> >From what me and marcelo discussed, I think there's a possibility that
> it has marginally something to do with precision of clock calculation.
> Gerd's patches address that issues. Can somebody test this with those
> patches (both guest and host), while I'm off ?

Haven't seen Gerd's guest patches ?

Re: [kvm-devel] paravirt clock stil causing hangs in kvm-65

From: Gerd H. <kr...@re...> - 2008-04-21 07:15:31

Marcelo Tosatti wrote:
>> >From what me and marcelo discussed, I think there's a possibility that
>> it has marginally something to do with precision of clock calculation.
>> Gerd's patches address that issues. Can somebody test this with those
>> patches (both guest and host), while I'm off ?
> 
> Haven't seen Gerd's guest patches ? 

I'm still busy cooking them up.  I've mentioned them in a mail, but they
didn't ran over the list (yet).  Stay tuned ;)

cheers,
  Gerd

-- 
http://kraxel.fedorapeople.org/xenner/

[kvm-devel] [ RfC / patch ] kvmclock fixes

From: Gerd H. <kr...@re...> - 2008-04-21 08:34:07

Attachments: kvmclock-4.diff

Gerd Hoffmann wrote:
> Marcelo Tosatti wrote:

>> Haven't seen Gerd's guest patches ? 
> 
> I'm still busy cooking them up.  I've mentioned them in a mail, but they
> didn't ran over the list (yet).  Stay tuned ;)

It compiles, ship it!

This time as all-in one patch (both guest and host side).  Almost
untested and not (yet) splitted into pieces.

Changes:

  * Host: make kvm pv clock really compatible with xen pv clock.
  * Guest/xen: factor out some xen clock code into a separate
               source file (pvclock.[ch]), so kvm can reuse it.
  * Guest/kvm: make kvm clock compatible with xen clock by using
               the common code bits.

Tests, reviews and comments are welcome.

cheers,
  Gerd

-- 
http://kraxel.fedorapeople.org/xenner/

Re: [kvm-devel] paravirt clock stil causing hangs in kvm-65

From: Glauber C. <gl...@gm...> - 2008-04-24 13:20:42

On Mon, Apr 21, 2008 at 4:15 AM, Gerd Hoffmann <kr...@re...> wrote:
> Marcelo Tosatti wrote:
>  >> >From what me and marcelo discussed, I think there's a possibility that
>  >> it has marginally something to do with precision of clock calculation.
>  >> Gerd's patches address that issues. Can somebody test this with those
>  >> patches (both guest and host), while I'm off ?
>  >
>  > Haven't seen Gerd's guest patches ?
>
>  I'm still busy cooking them up.  I've mentioned them in a mail, but they
>  didn't ran over the list (yet).  Stay tuned ;)
>
>  cheers,
>   Gerd
>

Just saw Gerd's patches flying around. Can anyone that is able to
reproduce this problem (a subgroup of human beings that does not
include me) test it with them applied?

If it still fails, please let me know asap

-- 
Glauber Costa.
"Free as in Freedom"
http://glommer.net

"The less confident you are, the more serious you have to act."

Re: [kvm-devel] paravirt clock stil causing hangs in kvm-65

From: <ext...@li...> - 2008-04-26 14:24:21

Hi Glauber, sorry for late reply.
well, I'm no longer able to reproduce the problem in the same way (with 
backtraces etc) as before, but anyways enabling paravirt_clock with or 
without Your patches on SMP guest still causes troubles:
on my phenom machine, the kernel hangs after printing
"PCI-GART - No AMD northbridge found."

on intel machine normal system boot seems to be terribly slow taking tens 
of seconds between steps and later hangs, using init=/bin/sh seems to work 
though.

I'm wondering how can I get the backtraces I was getting before, I have 
soft CPU lockup debugging enabled, what else could help?

cheers

n.

On Thu, 24 Apr 2008, Glauber Costa wrote:

> Just saw Gerd's patches flying around. Can anyone that is able to
> reproduce this problem (a subgroup of human beings that does not
> include me) test it with them applied?
>
> If it still fails, please let me know asap
>
> -- 
> Glauber Costa.
> "Free as in Freedom"
> http://glommer.net
>
> "The less confident you are, the more serious you have to act."
>
>