Date: Sat, 20 Oct 2007 07:18:56 +0200 From: Goran Lowkrantz <goran.lowkrantz@ismobile.com> To: freebsd-stable@freebsd.org Subject: Re: em 6.6.6 - watchdog timeout Message-ID: <E3BB2F9BBF74385C6F8A3B13@[10.255.253.2]> In-Reply-To: <DF00B541F0F46F5A6BE2414E@viglaf> References: <DF00B541F0F46F5A6BE2414E@viglaf>
next in thread | previous in thread | raw e-mail | index | archive | help
<goran.lowkrantz@ismobile.com> wrote: > Hi, > > After the update of em to 6.6.6 last, I experience watchdog timeouts on a > server running 6-STABLE. > > I have two identical servers with Intel D915GAV boards. Both have Intel > PRO/1000 PCI-Express network cards. > > Server balder: > em0: <Intel(R) PRO/1000 Network Connection Version - 6.6.6> port > 0xac00-0xac1f mem 0xff600000-0xff61ffff,0xff620000-0xff63ffff irq 16 at > device 0.0 on pci5 > em0: Ethernet address: 00:1b:21:00:48:c4 > em0: [FAST] > ># vmstat -i > interrupt total rate > irq1: atkbd0 3 0 > irq4: sio0 2 0 > irq6: fdc0 12 0 > irq14: ata0 68 0 > irq16: em0 uhci3 219828879 450 > irq19: uhci1++ 4287947 8 > irq22: ahc0 232717293 476 > irq23: uhci0 ehci0 1 0 > cpu0: timer 976552804 2000 > Total 1433387009 2935 > ># netstat -i > Name Mtu Network Address Ipkts Ierrs Opkts Oerrs > Coll > em0 1500 <Link#1> 00:1b:21:00:48:c4 209880531 773 206555522 > 84 0 > em0 1500 10.255.253/24 balder 215210996 - 212337968 > - - > plip0 1500 <Link#2> 0 0 0 0 > 0 > lo0 16384 <Link#3> 12040055 0 12055326 0 > 0 > lo0 16384 fe80:3::1 fe80:3::1 0 - 0 - > - > lo0 16384 localhost ::1 6 - 6 - > - > lo0 16384 your-net localhost 6249979 - 6249980 - > - > > 00:00.0 Host bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL Memory > Controller Hub (rev 04) > 00:01.0 PCI bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL PCI Express > Root Port (rev 04) > 00:02.0 VGA compatible controller: Intel Corporation 82915G/GV/910GL > Integrated Graphics Controller (rev 04) > 00:1c.0 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 1 (rev 03) > 00:1c.1 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 2 (rev 03) > 00:1c.2 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 3 (rev 03) > 00:1c.3 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 4 (rev 03) > 00:1d.0 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #1 (rev 03) > 00:1d.1 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #2 (rev 03) > 00:1d.2 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #3 (rev 03) > 00:1d.3 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #4 (rev 03) > 00:1d.7 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB2 EHCI Controller (rev 03) > 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d3) > 00:1f.0 ISA bridge: Intel Corporation 82801FB/FR (ICH6/ICH6R) LPC > Interface Bridge (rev 03) > 00:1f.1 IDE interface: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) IDE Controller (rev 03) > 00:1f.2 IDE interface: Intel Corporation 82801FB/FW (ICH6/ICH6W) SATA > Controller (rev 03) > 00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > SMBus Controller (rev 03) > 05:00.0 Ethernet controller: Intel Corporation 82572EI Gigabit Ethernet > Controller (Copper) (rev 06) > 06:01.0 SCSI storage controller: Adaptec AHA-2940U/UW/D / AIC-7881U (rev > 01) > > > Server midgard: > em0: <Intel(R) PRO/1000 Network Connection Version - 6.2.9> port > 0xac00-0xac1f mem 0xff500000-0xff51ffff,0xff520000-0xff53ffff irq 16 at > device 0.0 on pci5 > em0: Ethernet address: 00:15:17:0e:05:f7 > admglz@midgard> vmstat -i > interrupt total rate > irq1: atkbd0 11 0 > irq4: sio0 2142746 0 > irq6: fdc0 14 0 > irq14: ata0 252 0 > irq16: em0+ 666640101 164 > irq19: atapci1+ 7932757 1 > irq22: ahc0 87074425 21 > cpu0: timer 3807810138 937 > Total 4571600444 1125 > > admglz@midgard> netstat -i > Name Mtu Network Address Ipkts Ierrs Opkts Oerrs > Coll > em0 1500 <Link#1> 00:15:17:0e:05:f7 343771280 0 474609731 > 0 0 > em0 1500 10.255.253/24 midgard 347467842 - 478700485 > - - > plip0 1500 <Link#2> 0 0 0 0 > 0 > lo0 16384 <Link#3> 16821054 0 16947668 0 > 0 > lo0 16384 fe80:3::1 fe80:3::1 0 - 0 - > - > lo0 16384 localhost ::1 2610 - 2610 - > - > lo0 16384 your-net localhost 12616879 - 12616879 - > - > lo0 16384 10.255.253.12 appsrv1 0 - 0 - > - > lo0 16384 10.255.253.10 ca.glz.hidden-pow 0 - 0 - > - > lo0 16384 10.255.253.11 test 0 - 0 - > - > lo0 16384 10.255.253.13 secure 0 - 0 - > - > lo0 16384 10.255.253.18 rscds.hidden-powe 7 - 0 - > - > > midgard# lspci > 00:00.0 Host bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL Memory > Controller Hub (rev 04) > 00:01.0 PCI bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL PCI Express > Root Port (rev 04) > 00:02.0 VGA compatible controller: Intel Corporation 82915G/GV/910GL > Integrated Graphics Controller (rev 04) > 00:1c.0 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 1 (rev 03) > 00:1c.1 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 2 (rev 03) > 00:1c.2 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 3 (rev 03) > 00:1c.3 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 4 (rev 03) > 00:1d.0 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #1 (rev 03) > 00:1d.1 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #2 (rev 03) > 00:1d.2 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #3 (rev 03) > 00:1d.3 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #4 (rev 03) > 00:1d.7 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB2 EHCI Controller (rev 03) > 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d3) > 00:1f.0 ISA bridge: Intel Corporation 82801FB/FR (ICH6/ICH6R) LPC > Interface Bridge (rev 03) > 00:1f.1 IDE interface: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) IDE Controller (rev 03) > 00:1f.2 IDE interface: Intel Corporation 82801FB/FW (ICH6/ICH6W) SATA > Controller (rev 03) > 00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > SMBus Controller (rev 03) > 01:00.0 SCSI storage controller: Triones Technologies, Inc. Unknown > device 2310 (rev 02) > 05:00.0 Ethernet controller: Intel Corporation 82572EI Gigabit Ethernet > Controller (Copper) (rev 06) > 06:01.0 SCSI storage controller: Adaptec AHA-2940U/UW/D / AIC-7881U (rev > 01) > 06:02.0 FireWire (IEEE 1394): VIA Technologies, Inc. IEEE 1394 Host > Controller (rev 46) > > > When running netstat between servers balder and midgard, server balder > get watchdog timeouts and resets the connection for a few seconds. > Oct 19 13:12:47 balder kernel: em0: watchdog timeout -- resetting > Oct 19 13:12:47 balder kernel: em0: link state changed to DOWN > Oct 19 13:12:51 balder kernel: em0: link state changed to UP > > I have switched the cable between the two servers but get exactly the > same problem. The switch is a Netgear GS108T with the latest firmware. > > The resp. dmesg.boot are attached. > > Please let me know if there is any other information I can supply to > clear this. > > Best regards, > G=F6ran L > I have managed to get my performance back in two ways: - Switching to polling. - Build a kernel without USB. So it's the interrupt sharing between the network card and a USB hub that's = the problem. /glz
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?E3BB2F9BBF74385C6F8A3B13>