From owner-freebsd-net@FreeBSD.ORG Sat Jun 23 03:40:05 2007 Return-Path: X-Original-To: freebsd-net@freebsd.org Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 9AE0616A400 for ; Sat, 23 Jun 2007 03:40:05 +0000 (UTC) (envelope-from andrew@modulus.org) Received: from bub.octopus.com.au (170.135.233.220.exetel.com.au [220.233.135.170]) by mx1.freebsd.org (Postfix) with ESMTP id 61F6A13C48C for ; Sat, 23 Jun 2007 03:40:02 +0000 (UTC) (envelope-from andrew@modulus.org) Received: from anzac.hos (170.135.233.220.exetel.com.au [220.233.135.170]) by esper.modulus.org (Postfix) with ESMTP id 14946B25FE for ; Sat, 23 Jun 2007 12:58:57 +1000 (EST) Message-ID: <467C8C5F.8050206@modulus.org> Date: Sat, 23 Jun 2007 12:58:39 +1000 From: Andrew Snow User-Agent: Thunderbird 2.0.0.0 (X11/20070426) MIME-Version: 1.0 To: freebsd-net@freebsd.org Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: watchdog timeout problem with freebsd 6.2-stable and v6.4.1 if_em driver X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 23 Jun 2007 03:40:05 -0000 Hi, I have a problem with Pro/1000 cards in Freebsd, as follows: System: Supermicro 1RU server CPU: Intel(R) Core(TM)2 CPU 4400 @ 2.00GHz OS: FreeBSD 6.2-STABLE (Tue May 29 03:19:28 EST 2007) amd64 (64 bit mode, SMP kernel) Driver: 6.4.1 kernel module (downloaded from Intel's website) em0: flags=8843 mtu 1500em0: port 0x5000-0x501f mem 0xe0300000-0xe031ffff irq 16 at device 0.0 on pci13 em0: Ethernet address: 00:30:48:8b:54:30 em1: port 0x6000-0x601f mem 0xe0400000-0xe041ffff irq 17 at device 0.0 on pci14 em1: Ethernet address: 00:30:48:8b:54:31 em0@pci13:0:0: class=0x020000 card=0x108c15d9 chip=0x108c8086 rev=0x03 hdr=0x00 vendor = 'Intel Corporation' device = 'PRO/1000 PM' class = network subclass = ethernet em1@pci14:0:0: class=0x020000 card=0x109a15d9 chip=0x109a8086 rev=0x00 hdr=0x00 vendor = 'Intel Corporation' class = network subclass = ethernet em1 is connected to another Pro/1000 network card and works properly without any problems whatsoever! em0 is connected to a Netgear Gigabit SmartSwitch and all our cables are Belkin CAT6 cables. We have TWO identical machines, and they both suffer the "watchdog timeout" problem as follows: After medium-heavy traffic, the NIC locks up completely and no traffic passes for a long time, perhaps longer than half an hour. Then, it recovers and prints this to syslog: em0: watchdog timeout -- resetting em0: link state changed to DOWN em0: link state changed to UP The problem was much worse with the supplied drivers of FreeBSD 6.2, any large packets would trigger it, but upgrading to 6.4.1 helped alot. The problem still happens with SMP completely disabled. The only workaround which is completely successful for me so far is to disable gigabit and use 100baseTX instead. The netgear switch is configured for VLANs but VLAN outbound tagging is not enabled for this switch port and the em0 interface is not configured for VLANs. Any ideas? Thanks, - Andrew