From owner-freebsd-stable@FreeBSD.ORG Fri Sep 12 16:06:00 2014 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 167E3198 for ; Fri, 12 Sep 2014 16:06:00 +0000 (UTC) Received: from smarthost1.sentex.ca (smarthost1.sentex.ca [IPv6:2607:f3e0:0:1::12]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client CN "smarthost.sentex.ca", Issuer "smarthost.sentex.ca" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id C87B630E for ; Fri, 12 Sep 2014 16:05:59 +0000 (UTC) Received: from [IPv6:2607:f3e0:0:4:f025:8813:7603:7e4a] (saphire3.sentex.ca [IPv6:2607:f3e0:0:4:f025:8813:7603:7e4a]) by smarthost1.sentex.ca (8.14.9/8.14.9) with ESMTP id s8CG5wJs023751; Fri, 12 Sep 2014 12:05:59 -0400 (EDT) (envelope-from mike@sentex.net) Message-ID: <541319F5.1020502@sentex.net> Date: Fri, 12 Sep 2014 12:06:13 -0400 From: Mike Tancsa Organization: Sentex Communications User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:31.0) Gecko/20100101 Thunderbird/31.1.1 MIME-Version: 1.0 To: Jack Vogel Subject: Re: svn commit: r267935 - head/sys/dev/e1000 (with work around?) References: <201406262133.s5QLXXP8029811@svn.freebsd.org> <20140804212220.GC48614@rancor.immure.com> <20140805130144.GF40246@rancor.immure.com> <53E51D62.9000507@sentex.net> <53E52762.7040300@sentex.net> <53E536AC.9060304@sentex.net> <53E572B6.1090908@sentex.net> <5412FEAB.1050707@sentex.net> In-Reply-To: <5412FEAB.1050707@sentex.net> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.74 Cc: "stable@freebsd.org" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 12 Sep 2014 16:06:00 -0000 On 9/12/2014 10:09 AM, Mike Tancsa wrote: > > FYI, I just ran into this bug on another box, with an onboard em nic, so > I dont think its a one off hardware issue. AMD64, FreeBSD 10.0-STABLE > #4 r270560: > This is on an Intel MB S1200BTL ( S1200BT.86B.02.00.0035.030220120927) > > Unfortunately, this is also a production box so its difficult to test. I > am going to see if I can find a similar MB to test against. I found another board I can test with. It takes a bit of random traffic to wedge, but I can lock up the NIC to the point where I have to down and up it When the NIC is wedged, sending sysctl -w em.1.debug=1 shows Sep 12 11:05:05 backup3 kernel: Interface is RUNNING and ACTIVE Sep 12 11:05:05 backup3 kernel: em1: hw tdh = 414, hw tdt = 980 Sep 12 11:05:05 backup3 kernel: em1: hw rdh = 768, hw rdt = 767 Sep 12 11:05:05 backup3 kernel: em1: Tx Queue Status = 1 Sep 12 11:05:05 backup3 kernel: em1: TX descriptors avail = 449 Sep 12 11:05:05 backup3 kernel: em1: Tx Descriptors avail failure = 3 Sep 12 11:05:05 backup3 kernel: em1: RX discarded packets = 0 Sep 12 11:05:05 backup3 kernel: em1: RX Next to Check = 768 Sep 12 11:05:05 backup3 kernel: em1: RX Next to Refresh = 767 em1: flags=8843 metric 0 mtu 1500 options=4219b ether 00:15:17:ed:68:a4 inet 1.1.1.2 netmask 0xffffff00 broadcast 1.1.1.255 nd6 options=29 media: Ethernet autoselect (1000baseT ) status: active The network traffic involves sending a lot of traffic via NFS. I found that if I disable TSO on the nic, it seems to fix the problem, or at least makes its hard to reproduce. With tso enabled, it took perhaps 30-120 seconds for the problem to manifest. Both on my test and production box, I have not run into the problem in the past 45min. ---Mike -- ------------------- Mike Tancsa, tel +1 519 651 3400 Sentex Communications, mike@sentex.net Providing Internet services since 1994 www.sentex.net Cambridge, Ontario Canada http://www.tancsa.com/