From owner-freebsd-net@FreeBSD.ORG Wed Apr 18 20:45:58 2012 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 84332106566C for ; Wed, 18 Apr 2012 20:45:58 +0000 (UTC) (envelope-from freebsd-net@m.gmane.org) Received: from plane.gmane.org (plane.gmane.org [80.91.229.3]) by mx1.freebsd.org (Postfix) with ESMTP id 3F6D08FC0A for ; Wed, 18 Apr 2012 20:45:58 +0000 (UTC) Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1SKblB-0001zN-E2 for freebsd-net@freebsd.org; Wed, 18 Apr 2012 22:45:53 +0200 Received: from www01.lwilke.de ([78.47.159.91]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 18 Apr 2012 22:45:53 +0200 Received: from lw by www01.lwilke.de with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 18 Apr 2012 22:45:53 +0200 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-net@freebsd.org From: Lars Wilke Date: Wed, 18 Apr 2012 20:45:28 +0000 Lines: 36 Message-ID: <8ol369-4nf.ln1@lwilke.de> References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: www01.lwilke.de User-Agent: slrn/0.9.9p1 (Linux) Subject: Re: Watchdog timeout em driver 8.2-R X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 18 Apr 2012 20:45:58 -0000 Hi Jack, thanks for your response. * Jack Vogel wrote: > On Wed, Apr 18, 2012 at 7:01 AM, Lars Wilke wrote: > > Apr 13 08:53:07 san02 kernel: em1: Watchdog timeout -- resetting > > Apr 13 08:53:07 san02 kernel: em1: Queue(0) tdh = 232, hw tdt = 190 > > Apr 13 08:53:07 san02 kernel: em1: TX(0) desc avail = 31,Next TX to > > Clean = 221 > > Apr 13 08:53:07 san02 kernel: em1: Link is Down > > Apr 13 08:53:07 san02 kernel: em1: link state changed to DOWN > > > > Sometimes nothing for days, sometimes under high Network load (NFSv3), > > sometimes > > multiple times a day. I see this message/behaviour on always the same two > > of the > > four interfaces (em1 and em3). > > > > Then the NIC does not have the ACTIVE flag anymore, an ifconfig em1 up > > solves the issue. But why does it loose the ACTIVE state and why does the > > NIC reset itself in the first place? > > > > Because a watchdog reset is just that, a reset, so it causes the hardware to > reinitialize. It should come back up, I do not know why it did not, maybe > the renegotiation with the switch fails for some reason? Hm, my main problem is that it did a reset in the first place. > One thought is to get the latest em driver and see if the behavior changes, > if that driver is the distributed 8.2 its pretty old. ok then i guess i will upgrade to 8.3-R, is the driver there reasonably new? --lars