From owner-freebsd-stable@FreeBSD.ORG Sat Oct 20 05:19:01 2007 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3C9A116A418 for ; Sat, 20 Oct 2007 05:19:01 +0000 (UTC) (envelope-from goran.lowkrantz@ismobile.com) Received: from mail.ismobile.com (mail.ismobile.com [62.119.44.68]) by mx1.freebsd.org (Postfix) with ESMTP id 8240613C45D for ; Sat, 20 Oct 2007 05:18:59 +0000 (UTC) (envelope-from goran.lowkrantz@ismobile.com) Received: from mail.ismobile.com (localhost [127.0.0.1]) by mail.ismobile.com (Postfix) with ESMTP id E378A33C02 for ; Sat, 20 Oct 2007 07:18:57 +0200 (CEST) DKIM-Signature: v=0.5; a=rsa-sha1; c=relaxed; d=ismobile.com; h=received:date:from:to:subject:message-id:in-reply-to:references:x-mailer:mime-version:content-type:content-transfer-encoding:content-disposition; q=dns/txt; s=selector1; bh=9u4Z+nxDpAA9hehwSyuZVk7e0sI=; b=jMa2UC2ZYhDF6zFDpwsgBB6K92eNeBnmtcOYwdaiRy7adwnk02hjUxvckgAfMxfXD4jSm1CqUAAwP0UodDVa6JMnQXXnEvz5uxnY8O6dzNINe6eXt9UZsL5Nye2irg7RWO0si8sPVbOZetiXaTR6DV+eM4IXgnewF1/GDuDAfds= Received: from [10.255.253.2] (modgunn.iii-norr.com [213.242.135.174]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mail.ismobile.com (Postfix) with ESMTP id C197E33C01 for ; Sat, 20 Oct 2007 07:18:57 +0200 (CEST) Date: Sat, 20 Oct 2007 07:18:56 +0200 From: Goran Lowkrantz To: freebsd-stable@freebsd.org Message-ID: In-Reply-To: References: X-Mailer: Mulberry/4.0.8 (Win32) MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1; format=flowed Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Subject: Re: em 6.6.6 - watchdog timeout X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 20 Oct 2007 05:19:01 -0000 wrote: > Hi, > > After the update of em to 6.6.6 last, I experience watchdog timeouts on a > server running 6-STABLE. > > I have two identical servers with Intel D915GAV boards. Both have Intel > PRO/1000 PCI-Express network cards. > > Server balder: > em0: port > 0xac00-0xac1f mem 0xff600000-0xff61ffff,0xff620000-0xff63ffff irq 16 at > device 0.0 on pci5 > em0: Ethernet address: 00:1b:21:00:48:c4 > em0: [FAST] > ># vmstat -i > interrupt total rate > irq1: atkbd0 3 0 > irq4: sio0 2 0 > irq6: fdc0 12 0 > irq14: ata0 68 0 > irq16: em0 uhci3 219828879 450 > irq19: uhci1++ 4287947 8 > irq22: ahc0 232717293 476 > irq23: uhci0 ehci0 1 0 > cpu0: timer 976552804 2000 > Total 1433387009 2935 > ># netstat -i > Name Mtu Network Address Ipkts Ierrs Opkts Oerrs > Coll > em0 1500 00:1b:21:00:48:c4 209880531 773 206555522 > 84 0 > em0 1500 10.255.253/24 balder 215210996 - 212337968 > - - > plip0 1500 0 0 0 0 > 0 > lo0 16384 12040055 0 12055326 0 > 0 > lo0 16384 fe80:3::1 fe80:3::1 0 - 0 - > - > lo0 16384 localhost ::1 6 - 6 - > - > lo0 16384 your-net localhost 6249979 - 6249980 - > - > > 00:00.0 Host bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL Memory > Controller Hub (rev 04) > 00:01.0 PCI bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL PCI Express > Root Port (rev 04) > 00:02.0 VGA compatible controller: Intel Corporation 82915G/GV/910GL > Integrated Graphics Controller (rev 04) > 00:1c.0 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 1 (rev 03) > 00:1c.1 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 2 (rev 03) > 00:1c.2 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 3 (rev 03) > 00:1c.3 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 4 (rev 03) > 00:1d.0 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #1 (rev 03) > 00:1d.1 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #2 (rev 03) > 00:1d.2 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #3 (rev 03) > 00:1d.3 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #4 (rev 03) > 00:1d.7 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB2 EHCI Controller (rev 03) > 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d3) > 00:1f.0 ISA bridge: Intel Corporation 82801FB/FR (ICH6/ICH6R) LPC > Interface Bridge (rev 03) > 00:1f.1 IDE interface: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) IDE Controller (rev 03) > 00:1f.2 IDE interface: Intel Corporation 82801FB/FW (ICH6/ICH6W) SATA > Controller (rev 03) > 00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > SMBus Controller (rev 03) > 05:00.0 Ethernet controller: Intel Corporation 82572EI Gigabit Ethernet > Controller (Copper) (rev 06) > 06:01.0 SCSI storage controller: Adaptec AHA-2940U/UW/D / AIC-7881U (rev > 01) > > > Server midgard: > em0: port > 0xac00-0xac1f mem 0xff500000-0xff51ffff,0xff520000-0xff53ffff irq 16 at > device 0.0 on pci5 > em0: Ethernet address: 00:15:17:0e:05:f7 > admglz@midgard> vmstat -i > interrupt total rate > irq1: atkbd0 11 0 > irq4: sio0 2142746 0 > irq6: fdc0 14 0 > irq14: ata0 252 0 > irq16: em0+ 666640101 164 > irq19: atapci1+ 7932757 1 > irq22: ahc0 87074425 21 > cpu0: timer 3807810138 937 > Total 4571600444 1125 > > admglz@midgard> netstat -i > Name Mtu Network Address Ipkts Ierrs Opkts Oerrs > Coll > em0 1500 00:15:17:0e:05:f7 343771280 0 474609731 > 0 0 > em0 1500 10.255.253/24 midgard 347467842 - 478700485 > - - > plip0 1500 0 0 0 0 > 0 > lo0 16384 16821054 0 16947668 0 > 0 > lo0 16384 fe80:3::1 fe80:3::1 0 - 0 - > - > lo0 16384 localhost ::1 2610 - 2610 - > - > lo0 16384 your-net localhost 12616879 - 12616879 - > - > lo0 16384 10.255.253.12 appsrv1 0 - 0 - > - > lo0 16384 10.255.253.10 ca.glz.hidden-pow 0 - 0 - > - > lo0 16384 10.255.253.11 test 0 - 0 - > - > lo0 16384 10.255.253.13 secure 0 - 0 - > - > lo0 16384 10.255.253.18 rscds.hidden-powe 7 - 0 - > - > > midgard# lspci > 00:00.0 Host bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL Memory > Controller Hub (rev 04) > 00:01.0 PCI bridge: Intel Corporation 82915G/P/GV/GL/PL/910GL PCI Express > Root Port (rev 04) > 00:02.0 VGA compatible controller: Intel Corporation 82915G/GV/910GL > Integrated Graphics Controller (rev 04) > 00:1c.0 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 1 (rev 03) > 00:1c.1 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 2 (rev 03) > 00:1c.2 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 3 (rev 03) > 00:1c.3 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > PCI Express Port 4 (rev 03) > 00:1d.0 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #1 (rev 03) > 00:1d.1 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #2 (rev 03) > 00:1d.2 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #3 (rev 03) > 00:1d.3 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB UHCI #4 (rev 03) > 00:1d.7 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) USB2 EHCI Controller (rev 03) > 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d3) > 00:1f.0 ISA bridge: Intel Corporation 82801FB/FR (ICH6/ICH6R) LPC > Interface Bridge (rev 03) > 00:1f.1 IDE interface: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 > Family) IDE Controller (rev 03) > 00:1f.2 IDE interface: Intel Corporation 82801FB/FW (ICH6/ICH6W) SATA > Controller (rev 03) > 00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) > SMBus Controller (rev 03) > 01:00.0 SCSI storage controller: Triones Technologies, Inc. Unknown > device 2310 (rev 02) > 05:00.0 Ethernet controller: Intel Corporation 82572EI Gigabit Ethernet > Controller (Copper) (rev 06) > 06:01.0 SCSI storage controller: Adaptec AHA-2940U/UW/D / AIC-7881U (rev > 01) > 06:02.0 FireWire (IEEE 1394): VIA Technologies, Inc. IEEE 1394 Host > Controller (rev 46) > > > When running netstat between servers balder and midgard, server balder > get watchdog timeouts and resets the connection for a few seconds. > Oct 19 13:12:47 balder kernel: em0: watchdog timeout -- resetting > Oct 19 13:12:47 balder kernel: em0: link state changed to DOWN > Oct 19 13:12:51 balder kernel: em0: link state changed to UP > > I have switched the cable between the two servers but get exactly the > same problem. The switch is a Netgear GS108T with the latest firmware. > > The resp. dmesg.boot are attached. > > Please let me know if there is any other information I can supply to > clear this. > > Best regards, > G=F6ran L > I have managed to get my performance back in two ways: - Switching to polling. - Build a kernel without USB. So it's the interrupt sharing between the network card and a USB hub that's = the problem. /glz