Work at SourceForge, help us to make it a better place! We have an immediate need for a Support Technician in our San Francisco or Denver office.

Close

#391 Hardware Unit Hangs with 82566DM-2 on 3.12 using 2.5.4-NAPI

open
dertman
None
in-kernel_driver
1
2014-07-23
2014-01-12
J. Kendzorra
No

Recently, the device started to hang quite frequently:

,--
Jan 12 13:57:52 nas kernel: [ 27.820279] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Jan 12 13:57:52 nas kernel: [ 27.820279] TDH <3e>
Jan 12 13:57:52 nas kernel: [ 27.820279] TDT <48>
Jan 12 13:57:52 nas kernel: [ 27.820279] next_to_use <48>
Jan 12 13:57:52 nas kernel: [ 27.820279] next_to_clean <3d>
Jan 12 13:57:52 nas kernel: [ 27.820279] buffer_info[next_to_clean]:
Jan 12 13:57:52 nas kernel: [ 27.820279] time_stamp <fffef520>
Jan 12 13:57:52 nas kernel: [ 27.820279] next_to_watch <3e>
Jan 12 13:57:52 nas kernel: [ 27.820279] jiffies <fffef633>
Jan 12 13:57:52 nas kernel: [ 27.820279] next_to_watch.status <0>
Jan 12 13:57:52 nas kernel: [ 27.820279] MAC Status <80283>
Jan 12 13:57:52 nas kernel: [ 27.820279] PHY Status <792d>
Jan 12 13:57:52 nas kernel: [ 27.820279] PHY 1000BASE-T Status <3800>
Jan 12 13:57:52 nas kernel: [ 27.820279] PHY Extended Status <3000>
Jan 12 13:57:52 nas kernel: [ 27.820279] PCI Status <10>
Jan 12 13:57:54 nas kernel: [ 29.820271] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Jan 12 13:57:54 nas kernel: [ 29.820271] TDH <3e>
Jan 12 13:57:54 nas kernel: [ 29.820271] TDT <48>
Jan 12 13:57:54 nas kernel: [ 29.820271] next_to_use <48>
Jan 12 13:57:54 nas kernel: [ 29.820271] next_to_clean <3d>
Jan 12 13:57:54 nas kernel: [ 29.820271] buffer_info[next_to_clean]:
Jan 12 13:57:54 nas kernel: [ 29.820271] time_stamp <fffef520>
Jan 12 13:57:54 nas kernel: [ 29.820271] next_to_watch <3e>
Jan 12 13:57:54 nas kernel: [ 29.820271] jiffies <fffef827>
Jan 12 13:57:54 nas kernel: [ 29.820271] next_to_watch.status <0>
Jan 12 13:57:54 nas kernel: [ 29.820271] MAC Status <80283>
Jan 12 13:57:54 nas kernel: [ 29.820271] PHY Status <792d>
Jan 12 13:57:54 nas kernel: [ 29.820271] PHY 1000BASE-T Status <3800>
Jan 12 13:57:54 nas kernel: [ 29.820271] PHY Extended Status <3000>
Jan 12 13:57:54 nas kernel: [ 29.820271] PCI Status <10>
Jan 12 13:57:56 nas kernel: [ 31.820304] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Jan 12 13:57:56 nas kernel: [ 31.820304] TDH <3e>
Jan 12 13:57:56 nas kernel: [ 31.820304] TDT <48>
Jan 12 13:57:56 nas kernel: [ 31.820304] next_to_use <48>
Jan 12 13:57:56 nas kernel: [ 31.820304] next_to_clean <3d>
Jan 12 13:57:56 nas kernel: [ 31.820304] buffer_info[next_to_clean]:
Jan 12 13:57:56 nas kernel: [ 31.820304] time_stamp <fffef520>
Jan 12 13:57:56 nas kernel: [ 31.820304] next_to_watch <3e>
Jan 12 13:57:56 nas kernel: [ 31.820304] jiffies <fffefa1b>
Jan 12 13:57:56 nas kernel: [ 31.820304] next_to_watch.status <0>
Jan 12 13:57:56 nas kernel: [ 31.820304] MAC Status <80283>
Jan 12 13:57:56 nas kernel: [ 31.820304] PHY Status <792d>
Jan 12 13:57:56 nas kernel: [ 31.820304] PHY 1000BASE-T Status <3800>
Jan 12 13:57:56 nas kernel: [ 31.820304] PHY Extended Status <3000>
Jan 12 13:57:56 nas kernel: [ 31.820304] PCI Status <10>
Jan 12 13:57:58 nas kernel: [ 33.820244] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Jan 12 13:57:58 nas kernel: [ 33.820244] TDH <3e>
Jan 12 13:57:58 nas kernel: [ 33.820244] TDT <48>
Jan 12 13:57:58 nas kernel: [ 33.820244] next_to_use <48>
Jan 12 13:57:58 nas kernel: [ 33.820244] next_to_clean <3d>
Jan 12 13:57:58 nas kernel: [ 33.820244] buffer_info[next_to_clean]:
Jan 12 13:57:58 nas kernel: [ 33.820244] time_stamp <fffef520>
Jan 12 13:57:56 nas kernel: [ 31.820304] next_to_watch <3e>
Jan 12 13:57:56 nas kernel: [ 31.820304] jiffies <fffefa1b>
Jan 12 13:57:56 nas kernel: [ 31.820304] next_to_watch.status <0>
Jan 12 13:57:56 nas kernel: [ 31.820304] MAC Status <80283>
Jan 12 13:57:56 nas kernel: [ 31.820304] PHY Status <792d>
Jan 12 13:57:56 nas kernel: [ 31.820304] PHY 1000BASE-T Status <3800>
Jan 12 13:57:56 nas kernel: [ 31.820304] PHY Extended Status <3000>
Jan 12 13:57:56 nas kernel: [ 31.820304] PCI Status <10>
Jan 12 13:57:58 nas kernel: [ 33.820244] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Jan 12 13:57:58 nas kernel: [ 33.820244] TDH <3e>
Jan 12 13:57:58 nas kernel: [ 33.820244] TDT <48>
Jan 12 13:57:58 nas kernel: [ 33.820244] next_to_use <48>
Jan 12 13:57:58 nas kernel: [ 33.820244] next_to_clean <3d>
Jan 12 13:57:58 nas kernel: [ 33.820244] buffer_info[next_to_clean]:
Jan 12 13:57:58 nas kernel: [ 33.820244] time_stamp <fffef520>
Jan 12 13:57:58 nas kernel: [ 33.820244] next_to_watch <3e>
Jan 12 13:57:58 nas kernel: [ 33.820244] jiffies <fffefc0f>
Jan 12 13:57:58 nas kernel: [ 33.820244] next_to_watch.status <0>
Jan 12 13:57:58 nas kernel: [ 33.820244] MAC Status <80283>
Jan 12 13:57:58 nas kernel: [ 33.820244] PHY Status <792d>
Jan 12 13:57:58 nas kernel: [ 33.820244] PHY 1000BASE-T Status <3800>
Jan 12 13:57:58 nas kernel: [ 33.820244] PHY Extended Status <3000>
Jan 12 13:57:58 nas kernel: [ 33.820244] PCI Status <10>
Jan 12 13:58:00 nas kernel: [ 35.820232] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Jan 12 13:58:00 nas kernel: [ 35.820232] TDH <3e>
Jan 12 13:58:00 nas kernel: [ 35.820232] TDT <48>
Jan 12 13:58:00 nas kernel: [ 35.820232] next_to_use <48>
Jan 12 13:58:00 nas kernel: [ 35.820232] next_to_clean <3d>
Jan 12 13:58:00 nas kernel: [ 35.820232] buffer_info[next_to_clean]:
Jan 12 13:58:00 nas kernel: [ 35.820232] time_stamp <fffef520>
Jan 12 13:58:00 nas kernel: [ 35.820232] next_to_watch <3e>
Jan 12 13:58:00 nas kernel: [ 35.820232] jiffies <fffefe03>
Jan 12 13:58:00 nas kernel: [ 35.820232] next_to_watch.status <0>
Jan 12 13:58:00 nas kernel: [ 35.820232] MAC Status <80283>
Jan 12 13:58:00 nas kernel: [ 35.820232] PHY Status <792d>
Jan 12 13:58:00 nas kernel: [ 35.820232] PHY 1000BASE-T Status <3800>
Jan 12 13:58:00 nas kernel: [ 35.820232] PHY Extended Status <3000>
Jan 12 13:58:00 nas kernel: [ 35.820232] PCI Status <10>
Jan 12 13:58:00 nas kernel: [ 35.831037] ------------[ cut here ]------------
Jan 12 13:58:00 nas kernel: [ 35.831047] WARNING: CPU: 0 PID: 4 at /home/apw/COD/linux/net/sched/sch_generic.c:264 dev_watchdog+0x267/0x270()
Jan 12 13:58:00 nas kernel: [ 35.831049] NETDEV WATCHDOG: eth0 (e1000e): transmit queue 0 timed out
Jan 12 13:58:00 nas kernel: [ 35.831051] Modules linked in: nfsd auth_rpcgss nfs_acl nfs lockd sunrpc fscache snd_hda_codec_analog coretemp kvm_intel kvm snd_hda_intel ppdev snd_hda_codec snd_hwdep snd_pcm snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device snd_timer dcdbas microcode snd psmouse soundcore serio_raw lpc_ich parport_pc mac_hid lp parport ext2 xts gf128mul dm_crypt raid10 raid456 async_memcpy async_raid6_recov async_pq async_xor async_tx xor raid6_pq raid1 raid0 multipath linear hid_generic usbhid hid i915 ahci libahci video i2c_algo_bit drm_kms_helper drm e1000e(OF)
Jan 12 13:58:00 nas kernel: [ 35.831109] CPU: 0 PID: 4 Comm: kworker/0:0 Tainted: GF O 3.12.0-031200-generic #201311031935
Jan 12 13:58:00 nas kernel: [ 35.831111] Hardware name: Dell Inc. OptiPlex 755 /0GM819, BIOS A19 05/31/2011
Jan 12 13:58:00 nas kernel: [ 35.831127] Workqueue: events e1000_print_hw_hang [e1000e]
Jan 12 13:58:00 nas kernel: [ 35.831129] 0000000000000108 ffff880127c03cf0 ffffffff81739f5f 0000000000005f82
Jan 12 13:58:00 nas kernel: [ 35.831133] ffff880127c03d40 ffff880127c03d30 ffffffff810675fc ffff880121e74710
Jan 12 13:58:00 nas kernel: [ 35.831136] ffff88011ce6c000 ffff88011ce6c320 ffff880036709c00 0000000000000001
Jan 12 13:58:00 nas kernel: [ 35.831140] Call Trace:
Jan 12 13:58:00 nas kernel: [ 35.831142] <IRQ> [<ffffffff81739f5f>] dump_stack+0x46/0x58
Jan 12 13:58:00 nas kernel: [ 35.831151] [<ffffffff810675fc>] warn_slowpath_common+0x8c/0xc0
Jan 12 13:58:00 nas kernel: [ 35.831154] [<ffffffff810676e6>] warn_slowpath_fmt+0x46/0x50
Jan 12 13:58:00 nas kernel: [ 35.831159] [<ffffffff8109fa45>] ? sched_clock_local+0x25/0x90
Jan 12 13:58:00 nas kernel: [ 35.831163] [<ffffffff816553e7>] dev_watchdog+0x267/0x270
Jan 12 13:58:00 nas kernel: [ 35.831166] [<ffffffff81655180>] ? pfifo_fast_dequeue+0xe0/0xe0
Jan 12 13:58:00 nas kernel: [ 35.831170] [<ffffffff81074dd6>] call_timer_fn+0x46/0x160
Jan 12 13:58:00 nas kernel: [ 35.831173] [<ffffffff8106cd34>] ? irq_exit+0x84/0xe0
Jan 12 13:58:00 nas kernel: [ 35.831177] [<ffffffff81075f90>] run_timer_softirq+0x280/0x300
Jan 12 13:58:00 nas kernel: [ 35.831181] [<ffffffff8108f269>] ? enqueue_hrtimer+0x39/0xc0
Jan 12 13:58:00 nas kernel: [ 35.831184] [<ffffffff81655180>] ? pfifo_fast_dequeue+0xe0/0xe0
Jan 12 13:58:00 nas kernel: [ 35.831187] [<ffffffff8106c9d0>] do_softirq+0xe0/0x2d0
Jan 12 13:58:00 nas kernel: [ 35.831192]
[<ffffffff817511dc>] call_softirq+0x1c/0x30
Jan 12 13:58:00 nas kernel: [ 35.831196]
[<ffffffff81016f75>] do_softirq+0x65/0xa0
Jan 12 13:58:00 nas kernel: [ 35.831198]
[<ffffffff8106cd6e>] irq_exit+0xbe/0xe0
Jan 12 13:58:00 nas kernel: [ 35.831202]
[<ffffffff81751baa>] smp_apic_timer_interrupt+0x4a/0x60
Jan 12 13:58:00 nas kernel: [ 35.831205]
[<ffffffff8175051d>] apic_timer_interrupt+0x6d/0x80
Jan 12 13:58:00 nas kernel: [ 35.831206] <EOI>
[<ffffffff8113c739>] ? irq_work_queue+0x69/0xb0
Jan 12 13:58:00 nas kernel: [ 35.831214]
[<ffffffff810bcde4>] ? vprintk_emit+0x1c4/0x520
Jan 12 13:58:00 nas kernel: [ 35.831219]
[<ffffffff8149fc69>] dev_vprintk_emit+0x69/0x90
Jan 12 13:58:00 nas kernel: [ 35.831222]
[<ffffffff8149fcc9>] dev_printk_emit+0x39/0x40
Jan 12 13:58:00 nas kernel: [ 35.831227]
[<ffffffff81631d59>]
netdev_printk+0x89/0xf0
Jan 12 13:58:00 nas kernel: [ 35.831230] [<ffffffff8161465b>] ? pci_conf1_read+0xcb/0x120
Jan 12 13:58:00 nas kernel: [ 35.831233] [<ffffffff81632163>] netdev_err+0x53/0x60
Jan 12 13:58:00 nas kernel: [ 35.831241] [<ffffffffa00056d9>] e1000_print_hw_hang+0x249/0x3a0 [e1000e]
Jan 12 13:58:00 nas kernel: [ 35.831244] [<ffffffff81083d1f>] process_one_work+0x17f/0x4d0
Jan 12 13:58:00 nas kernel: [ 35.831247] [<ffffffff81084f7b>] worker_thread+0x11b/0x3d0
Jan 12 13:58:00 nas kernel: [ 35.831250] [<ffffffff81084e60>] ? manage_workers.isra.20+0x1b0/0x1b0
Jan 12 13:58:00 nas kernel: [ 35.831254] [<ffffffff8108c140>] kthread+0xc0/0xd0
Jan 12 13:58:00 nas kernel: [ 35.831257] [<ffffffff8108c080>] ? flush_kthread_worker+0xb0/0xb0
Jan 12 13:58:00 nas kernel: [ 35.831260] [<ffffffff8174f6fc>] ret_from_fork+0x7c/0xb0
Jan 12 13:58:00 nas kernel: [ 35.831263] [<ffffffff8108c080>] ? flush_kthread_worker+0xb0/0xb0
Jan 12 13:58:00 nas kernel: [ 35.831265] ---[ end trace aa4ce287363e671d ]---
Jan 12 13:58:00 nas kernel: [ 35.831275] e1000e 0000:00:19.0 eth0: Reset adapter unexpectedly
Jan 12 13:58:03 nas kernel: [ 39.276866] e1000e: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx
`--

uname -a:
Linux nas 3.12.0-031200-generic #201311031935 SMP Mon Nov 4 00:36:54 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

modinfo e1000e:
filename: /lib/modules/3.12.0-031200-generic/kernel/drivers/net/ethernet/intel/e1000e/e1000e.ko
version: 2.5.4-NAPI
license: GPL
description: Intel(R) PRO/1000 Network Driver
author: Intel Corporation, linux.nics@intel.com
srcversion: 14FC0D45EE1DAA1B5E0DBBA
(...)

lspci:
00:19.0 Ethernet controller: Intel Corporation 82566DM-2 Gigabit Network Connection (rev 02)

I am now trying to apply a workaround found here:
ethtool -K eth0 gso off gro off tso off
Any help would be much appreciated!

Discussion

  • Todd Fujinaka
    Todd Fujinaka
    2014-01-15

    • assigned_to: dertman
     
  • Eggie
    Eggie
    2014-01-27

    I saw this invariably the same backtrace but under older Linux 2.6.32-431.3.1.el6.x86_64 (CentOS 6.5).

    driver: igb
    version: 5.0.5-k
    firmware-version: 1.61, 0x8000090e
    bus-info: 0000:09:00.0
    supports-statistics: yes
    supports-test: yes
    supports-eeprom-access: yes
    supports-register-dump: yes
    supports-priv-flags: no
    igb: Intel(R) Gigabit Ethernet Network Driver - version 5.0.5-k
    igb: Copyright (c) 2007-2013 Intel Corporation.
    alloc irq_desc for 35 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.0: PCI INT A -> GSI 35 (level, low) -> IRQ 35
    igb 0000:09:00.0: setting latency timer to 64
    alloc irq_desc for 180 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.0: irq 180 for MSI/MSI-X
    alloc irq_desc for 181 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.0: irq 181 for MSI/MSI-X
    alloc irq_desc for 182 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.0: irq 182 for MSI/MSI-X
    alloc irq_desc for 183 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.0: irq 183 for MSI/MSI-X
    alloc irq_desc for 184 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.0: irq 184 for MSI/MSI-X
    alloc irq_desc for 185 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.0: irq 185 for MSI/MSI-X
    alloc irq_desc for 186 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.0: irq 186 for MSI/MSI-X
    alloc irq_desc for 187 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.0: irq 187 for MSI/MSI-X
    alloc irq_desc for 188 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.0: irq 188 for MSI/MSI-X
    igb 0000:09:00.0: irq 180 for MSI/MSI-X
    igb 0000:09:00.0: irq 181 for MSI/MSI-X
    igb 0000:09:00.0: irq 182 for MSI/MSI-X
    igb 0000:09:00.0: irq 183 for MSI/MSI-X
    igb 0000:09:00.0: irq 184 for MSI/MSI-X
    igb 0000:09:00.0: irq 185 for MSI/MSI-X
    igb 0000:09:00.0: irq 186 for MSI/MSI-X
    igb 0000:09:00.0: irq 187 for MSI/MSI-X
    igb 0000:09:00.0: irq 188 for MSI/MSI-X
    igb 0000:09:00.0: added PHC on eth0
    igb 0000:09:00.0: Intel(R) Gigabit Ethernet Network Connection
    igb 0000:09:00.0: eth0: (PCIe:5.0Gb/s:Width x4) 00:25:90:8b:14:f2
    igb 0000:09:00.0: eth0: PBA No: 106100-000
    igb 0000:09:00.0: Using MSI-X interrupts. 8 rx queue(s), 8 tx queue(s)
    igb 0000:09:00.1: PCI INT B -> GSI 36 (level, low) -> IRQ 36
    igb 0000:09:00.1: setting latency timer to 64
    alloc irq_desc for 189 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.1: irq 189 for MSI/MSI-X
    alloc irq_desc for 190 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.1: irq 190 for MSI/MSI-X
    alloc irq_desc for 191 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.1: irq 191 for MSI/MSI-X
    alloc irq_desc for 192 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.1: irq 192 for MSI/MSI-X
    alloc irq_desc for 193 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.1: irq 193 for MSI/MSI-X
    alloc irq_desc for 194 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.1: irq 194 for MSI/MSI-X
    alloc irq_desc for 195 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.1: irq 195 for MSI/MSI-X
    alloc irq_desc for 196 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.1: irq 196 for MSI/MSI-X
    alloc irq_desc for 197 on node 0
    alloc kstat_irqs on node 0
    alloc irq_2_iommu on node 0
    igb 0000:09:00.1: irq 197 for MSI/MSI-X
    igb 0000:09:00.1: irq 189 for MSI/MSI-X
    igb 0000:09:00.1: irq 190 for MSI/MSI-X
    igb 0000:09:00.1: irq 191 for MSI/MSI-X
    igb 0000:09:00.1: irq 192 for MSI/MSI-X
    igb 0000:09:00.1: irq 193 for MSI/MSI-X
    igb 0000:09:00.1: irq 194 for MSI/MSI-X
    igb 0000:09:00.1: irq 195 for MSI/MSI-X
    igb 0000:09:00.1: irq 196 for MSI/MSI-X
    igb 0000:09:00.1: irq 197 for MSI/MSI-X
    igb 0000:09:00.1: added PHC on eth1
    igb 0000:09:00.1: Intel(R) Gigabit Ethernet Network Connection
    igb 0000:09:00.1: eth1: (PCIe:5.0Gb/s:Width x4) 00:25:90:8b:14:f3
    igb 0000:09:00.1: eth1: PBA No: 106100-000
    igb 0000:09:00.1: Using MSI-X interrupts. 8 rx queue(s), 8 tx queue(s)
    igb: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX
    ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
    WARNING: at net/sched/sch_generic.c:261 dev_watchdog+0x26b/0x280() (Not tainted)
    Hardware name: X9DRW-7TPF+
    NETDEV WATCHDOG: eth0 (igb): transmit queue 5 timed out
    Modules linked in: tpm_infineon nt3gd(U) cpufreq_ondemand acpi_cpufreq freq_table mperf ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 microcode acpi_pad ses enclosure iTCO_wdt iTCO_vendor_support igb i2c_algo_bit ixgbe dca ptp pps_core mdio serio_raw i2c_i801 i2c_core sg lpc_ich mfd_core shpchp ext4 jbd2 mbcache aesni_intel ablk_helper cryptd lrw glue_helper aes_x86_64 aes_generic xts gf128mul dm_crypt sd_mod crc_t10dif aacraid sr_mod cdrom isci libsas scsi_transport_sas ahci wmi dm_mirror dm_region_hash dm_log dm_mod [last unloaded: nt3gd]
    Pid: 4, comm: ksoftirqd/0 Not tainted 2.6.32-431.3.1.el6.x86_64 #1
    Call Trace:
    <IRQ> [<ffffffff81071e27>] ? warn_slowpath_common+0x87/0xc0
    [<ffffffff81071f16>] ? warn_slowpath_fmt+0x46/0x50
    [<ffffffff8147b75b>] ? dev_watchdog+0x26b/0x280
    [<ffffffff810ec287>] ? cpu_quiet_msk+0x77/0x130
    [<ffffffff810ecb5a>] ? rcu_process_callbacks+0x25a/0x350
    [<ffffffff8147b4f0>] ? dev_watchdog+0x0/0x280
    [<ffffffff81084b07>] ? run_timer_softirq+0x197/0x340
    [<ffffffff8107a8e1>] ?
    do_softirq+0xc1/0x1e0
    [<ffffffff8100c30c>] ? call_softirq+0x1c/0x30
    [<ffffffff8100c30c>] ? call_softirq+0x1c/0x30
    <EOI> [<ffffffff8100fa75>] ? do_softirq+0x65/0xa0
    [<ffffffff8107a4a0>] ? ksoftirqd+0x80/0x110
    [<ffffffff8107a420>] ? ksoftirqd+0x0/0x110
    [<ffffffff8109af06>] ? kthread+0x96/0xa0
    [<ffffffff8100c20a>] ? child_rip+0xa/0x20
    [<ffffffff8109ae70>] ? kthread+0x0/0xa0
    [<ffffffff8100c200>] ? child_rip+0x0/0x20
    ---[ end trace eb47724bdd96b5d1 ]---
    igb 0000:09:00.0: eth0: Reset adapter
    igb: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX
    igb 0000:09:00.0: eth0: Reset adapter
    igb: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX
    Bridge firewalling registered
    device eth0 entered promiscuous mode
    device eth0 left promiscuous mode

     
    • Todd Fujinaka
      Todd Fujinaka
      2014-01-27

      This has nothing to do with the parent issue. Please file a separate bug.

       
  • J. Kendzorra
    J. Kendzorra
    2014-01-27

    This seems to be a real hardware issue; downgraded to an older kernel due to various other reasons, with the same result:

    ,--
    Jan 27 17:36:22 nas kernel: [78124.808242] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
    Jan 27 17:36:22 nas kernel: [78124.808242] TDH <ee>
    Jan 27 17:36:22 nas kernel: [78124.808242] TDT <33>
    Jan 27 17:36:22 nas kernel: [78124.808242] next_to_use <33>
    Jan 27 17:36:22 nas kernel: [78124.808242] next_to_clean <ed>
    Jan 27 17:36:22 nas kernel: [78124.808242] buffer_info[next_to_clean]:
    Jan 27 17:36:22 nas kernel: [78124.808242] time_stamp <10128ddfa>
    Jan 27 17:36:22 nas kernel: [78124.808242] next_to_watch <ee>
    Jan 27 17:36:22 nas kernel: [78124.808242] jiffies <10128e0ca>
    Jan 27 17:36:22 nas kernel: [78124.808242] next_to_watch.status <0>
    Jan 27 17:36:22 nas kernel: [78124.808242] MAC Status <80283>
    Jan 27 17:36:22 nas kernel: [78124.808242] PHY Status <792d>
    Jan 27 17:36:22 nas kernel: [78124.808242] PHY 1000BASE-T Status <3800>
    Jan 27 17:36:22 nas kernel: [78124.808242] PHY Extended Status <3000>
    Jan 27 17:36:22 nas kernel: [78124.808242] PCI Status <10>
    Jan 27 17:36:24 nas kernel: [78126.808231] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
    Jan 27 17:36:24 nas kernel: [78126.808231] TDH <ee>
    Jan 27 17:36:24 nas kernel: [78126.808231] TDT <33>
    Jan 27 17:36:24 nas kernel: [78126.808231] next_to_use <33>
    Jan 27 17:36:24 nas kernel: [78126.808231] next_to_clean <ed>
    Jan 27 17:36:24 nas kernel: [78126.808231] buffer_info[next_to_clean]:
    Jan 27 17:36:24 nas kernel: [78126.808231] time_stamp <10128ddfa>
    Jan 27 17:36:24 nas kernel: [78126.808231] next_to_watch <ee>
    Jan 27 17:36:24 nas kernel: [78126.808231] jiffies <10128e2be>
    Jan 27 17:36:24 nas kernel: [78126.808231] next_to_watch.status <0>
    Jan 27 17:36:24 nas kernel: [78126.808231] MAC Status <80283>
    Jan 27 17:36:24 nas kernel: [78126.808231] PHY Status <792d>
    Jan 27 17:36:24 nas kernel: [78126.808231] PHY 1000BASE-T Status <3800>
    Jan 27 17:36:24 nas kernel: [78126.808231] PHY Extended Status <3000>
    Jan 27 17:36:24 nas kernel: [78126.808231] PCI Status <10>
    Jan 27 17:36:26 nas kernel: [78128.808271] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
    Jan 27 17:36:26 nas kernel: [78128.808271] TDH <ee>
    Jan 27 17:36:26 nas kernel: [78128.808271] TDT <33>
    Jan 27 17:36:26 nas kernel: [78128.808271] next_to_use <33>
    Jan 27 17:36:26 nas kernel: [78128.808271] next_to_clean <ed>
    Jan 27 17:36:26 nas kernel: [78128.808271] buffer_info[next_to_clean]:
    Jan 27 17:36:26 nas kernel: [78128.808271] time_stamp <10128ddfa>
    Jan 27 17:36:26 nas kernel: [78128.808271] next_to_watch <ee>
    Jan 27 17:36:26 nas kernel: [78128.808271] jiffies <10128e4b2>
    Jan 27 17:36:26 nas kernel: [78128.808271] next_to_watch.status <0>
    Jan 27 17:36:26 nas kernel: [78128.808271] MAC Status <80283>
    Jan 27 17:36:26 nas kernel: [78128.808271] PHY Status <792d>
    Jan 27 17:36:26 nas kernel: [78128.808271] PHY 1000BASE-T Status <3800>
    Jan 27 17:36:26 nas kernel: [78128.808271] PHY Extended Status <3000>
    Jan 27 17:36:26 nas kernel: [78128.808271] PCI Status <10>
    Jan 27 17:36:28 nas kernel: [78130.808263] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
    Jan 27 17:36:28 nas kernel: [78130.808263] TDH <ee>
    Jan 27 17:36:28 nas kernel: [78130.808263] TDT <33>
    Jan 27 17:36:28 nas kernel: [78130.808263] next_to_use <33>
    Jan 27 17:36:28 nas kernel: [78130.808263] next_to_clean <ed>
    Jan 27 17:36:28 nas kernel: [78130.808263] buffer_info[next_to_clean]:
    Jan 27 17:36:28 nas kernel: [78130.808263] time_stamp <10128ddfa>
    Jan 27 17:36:28 nas kernel: [78130.808263] next_to_watch <ee>
    Jan 27 17:36:28 nas kernel: [78130.808263] jiffies <10128e6a6>
    Jan 27 17:36:28 nas kernel: [78130.808263] next_to_watch.status <0>
    Jan 27 17:36:28 nas kernel: [78130.808263] MAC Status <80283>
    Jan 27 17:36:28 nas kernel: [78130.808263] PHY Status <792d>
    Jan 27 17:36:28 nas kernel: [78130.808263] PHY 1000BASE-T Status <3800>
    Jan 27 17:36:28 nas kernel: [78130.808263] PHY Extended Status <3000>
    Jan 27 17:36:28 nas kernel: [78130.808263] PCI Status <10>
    Jan 27 17:36:30 nas kernel: [78132.808254] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
    Jan 27 17:36:30 nas kernel: [78132.808254] TDH <ee>
    Jan 27 17:36:30 nas kernel: [78132.808254] TDT <33>
    Jan 27 17:36:30 nas kernel: [78132.808254] next_to_use <33>
    Jan 27 17:36:30 nas kernel: [78132.808254] next_to_clean <ed>
    Jan 27 17:36:30 nas kernel: [78132.808254] buffer_info[next_to_clean]:
    Jan 27 17:36:30 nas kernel: [78132.808254] time_stamp <10128ddfa>
    Jan 27 17:36:30 nas kernel: [78132.808254] next_to_watch <ee>
    Jan 27 17:36:30 nas kernel: [78132.808254] jiffies <10128e89a>
    Jan 27 17:36:30 nas kernel: [78132.808254] next_to_watch.status <0>
    Jan 27 17:36:30 nas kernel: [78132.808254] MAC Status <80283>
    Jan 27 17:36:30 nas kernel: [78132.808254] PHY Status <792d>
    Jan 27 17:36:30 nas kernel: [78132.808254] PHY 1000BASE-T Status <3800>
    Jan 27 17:36:30 nas kernel: [78132.808254] PHY Extended Status <3000>
    Jan 27 17:36:30 nas kernel: [78132.808254] PCI Status <10>
    Jan 27 17:36:30 nas kernel: [78132.818759] ------------[ cut here ]------------
    Jan 27 17:36:30 nas kernel: [78132.818768] WARNING: at /build/buildd/linux-lts-raring-3.8.0/net/sched/sch_generic.c:254 dev_watchdog+0x262/0x270()
    Jan 27 17:36:30 nas kernel: [78132.818770] Hardware name: OptiPlex 755
    Jan 27 17:36:30 nas kernel: [78132.818772] NETDEV WATCHDOG: eth0 (e1000e): transmit queue 0 timed out
    Jan 27 17:36:30 nas kernel: [78132.818774] Modules linked in: ip6table_filter(F) ip6_tables(F) ebtable_nat(F) ebtables(F) ipt_MASQUERADE(F) iptable_nat(F) nf_nat_ipv4(F) nf_nat(F) nf_conntrack_ipv4(F) nf_defrag_ipv4(F) xt_state(F) nf_conntrack(F) ipt_REJECT(F) xt_CHECKSUM(F) iptable_mangle(F) xt_tcpudp(F) iptable_filter(F) ip_tables(F) x_tables(F) bridge(F) stp(F) llc(F) nfsd(F) nfs_acl(F) auth_rpcgss(F) nfs(F) fscache(F) lockd(F) sunrpc(F) rc_tt_1500(OF) stb6100(OF) lnbp22(OF) stb0899(OF) snd_hda_codec_analog(F) dvb_usb_pctv452e(OF) dvb_usb(OF) snd_hda_intel(F) snd_hda_codec(F) dvb_core(OF) coretemp(F) kvm_intel(F) kvm(F) rc_core(OF) snd_hwdep(F) snd_pcm(F) psmouse(F) ppdev(F) snd_timer(F) parport_pc(F) serio_raw(F) snd(F) ttpci_eeprom(OF) lp(F) soundcore(F) parport(F) mac_hid(F) dcdbas(F) snd_page_alloc(F) lpc_ich(F) microcode(F) dm_crypt(F) raid10(F) raid456(F) async_pq(F) async_xor(F) xor(F) async_memcpy(F) async_raid6_recov(F) raid6_pq(F) async_tx(F) raid0(F) multipath(F) linear(F) hid_generic(F) usbhid(
    Jan 27 17:36:30 nas kernel: F) hid(F) raid1(F) e1000e(F) i915(F) drm_kms_helper(F) ahci(F) libahci(F) drm(F) i2c_algo_bit(F) video(F) [last unloaded: mei]
    Jan 27 17:36:30 nas kernel: [78132.818848] Pid: 28392, comm: kworker/1:0 Tainted: GF O 3.8.0-29-generic #42~precise1-Ubuntu
    Jan 27 17:36:30 nas kernel: [78132.818850] Call Trace:
    Jan 27 17:36:30 nas kernel: [78132.818852] <IRQ> [<ffffffff81059b0f>] warn_slowpath_common+0x7f/0xc0
    Jan 27 17:36:30 nas kernel: [78132.818861] [<ffffffff81059c06>] warn_slowpath_fmt+0x46/0x50
    Jan 27 17:36:30 nas kernel: [78132.818867] [<ffffffff81602062>] dev_watchdog+0x262/0x270
    Jan 27 17:36:30 nas kernel: [78132.818870] [<ffffffff81601e00>] ? pfifo_fast_dequeue+0xe0/0xe0
    Jan 27 17:36:30 nas kernel: [78132.818874] [<ffffffff8106995b>] call_timer_fn+0x3b/0x150
    Jan 27 17:36:30 nas kernel: [78132.818879] [<ffffffff815220c6>] ? ehci_enable_event+0x46/0x80
    Jan 27 17:36:30 nas kernel: [78132.818882] [<ffffffff8106b427>] run_timer_softirq+0x267/0x2c0
    Jan 27 17:36:30 nas kernel: [78132.818885] [<ffffffff81522dab>] ? ehci_hrtimer_func+0xbb/0xd0
    Jan 27 17:36:30 nas kernel: [78132.818888] [<ffffffff81601e00>] ? pfifo_fast_dequeue+0xe0/0xe0
    Jan 27 17:36:30 nas kernel: [78132.818892] [<ffffffff810ae974>] ? ktime_get+0x54/0xe0
    Jan 27 17:36:30 nas kernel: [78132.818895] [<ffffffff81062620>] do_softirq+0xc0/0x240
    Jan 27 17:36:30 nas kernel: [78132.818898]
    [<ffffffff810b6504>] ? tick_program_event+0x24/0x30
    Jan 27 17:36:30 nas kernel: [78132.818903]
    [<ffffffff816fdd5c>] call_softirq+0x1c/0x30
    Jan 27 17:36:30 nas kernel: [78132.818909]
    [<ffffffff81016775>] do_softirq+0x65/0xa0
    Jan 27 17:36:30 nas kernel: [78132.818912]
    [<ffffffff810628fe>] irq_exit+0x8e/0xb0
    Jan 27 17:36:30 nas kernel: [78132.818915]
    [<ffffffff816fe6de>] smp_apic_timer_interrupt+0x6e/0x99
    Jan 27 17:36:30 nas kernel: [78132.818918]
    [<ffffffff816fd61d>] apic_timer_interrupt+0x6d/0x80
    Jan 27 17:36:30 nas kernel: [78132.818919] <EOI>
    [<ffffffff81085872>] ? up+0x32/0x50
    Jan 27 17:36:30 nas kernel: [78132.818926]
    [<ffffffff8105b680>] ? vprintk_emit+0x170/0x490
    Jan 27 17:36:30 nas kernel: [78132.818930]
    [<ffffffff81461749>] dev_vprintk_emit+0x69/0x90
    Jan 27 17:36:30 nas kernel: [78132.818934]
    [<ffffffff814617a9>] dev_printk_emit+0x39/0x40
    Jan 27 17:36:30 nas kernel: [78132.818939]
    [<ffffffff815e0fc9>]
    netdev_printk+0x89/0xf0
    Jan 27 17:36:30 nas kernel: [78132.818943] [<ffffffff815c424b>] ? pci_conf1_read+0xcb/0x120
    Jan 27 17:36:30 nas kernel: [78132.818946] [<ffffffff815e1293>] netdev_err+0x53/0x60
    Jan 27 17:36:30 nas kernel: [78132.818959] [<ffffffffa008fe9d>] e1000_print_hw_hang+0x19d/0x340 [e1000e]
    Jan 27 17:36:30 nas kernel: [78132.818962] [<ffffffff81078ce1>] process_one_work+0x141/0x490
    Jan 27 17:36:30 nas kernel: [78132.818965] [<ffffffff81079ca8>] worker_thread+0x168/0x400
    Jan 27 17:36:30 nas kernel: [78132.818968] [<ffffffff81079b40>] ? manage_workers+0x120/0x120
    Jan 27 17:36:30 nas kernel: [78132.818972] [<ffffffff8107f1b0>] kthread+0xc0/0xd0
    Jan 27 17:36:30 nas kernel: [78132.818975] [<ffffffff8107f0f0>] ? flush_kthread_worker+0xb0/0xb0
    Jan 27 17:36:30 nas kernel: [78132.818978] [<ffffffff816fc82c>] ret_from_fork+0x7c/0xb0
    Jan 27 17:36:30 nas kernel: [78132.818981] [<ffffffff8107f0f0>] ? flush_kthread_worker+0xb0/0xb0
    Jan 27 17:36:30 nas kernel: [78132.818983] ---[ end trace ffd075d38d863f00 ]---
    `--

    ,-- # modinfo e1000e
    filename: /lib/modules/3.8.0-29-generic/kernel/drivers/net/ethernet/intel/e1000e/e1000e.ko
    version: 2.1.4-k
    `--