From owner-freebsd-net@FreeBSD.ORG Wed Aug 17 13:48:55 2011 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 87CD2106564A for ; Wed, 17 Aug 2011 13:48:55 +0000 (UTC) (envelope-from mike@sentex.net) Received: from smarthost1.sentex.ca (smarthost1-6.sentex.ca [IPv6:2607:f3e0:0:1::12]) by mx1.freebsd.org (Postfix) with ESMTP id 4BBB58FC08 for ; Wed, 17 Aug 2011 13:48:55 +0000 (UTC) Received: from [IPv6:2607:f3e0:0:4:f025:8813:7603:7e4a] (saphire3.sentex.ca [IPv6:2607:f3e0:0:4:f025:8813:7603:7e4a]) by smarthost1.sentex.ca (8.14.4/8.14.4) with ESMTP id p7HDmrNT072508; Wed, 17 Aug 2011 09:48:53 -0400 (EDT) (envelope-from mike@sentex.net) Message-ID: <4E4BC6CB.40305@sentex.net> Date: Wed, 17 Aug 2011 09:48:59 -0400 From: Mike Tancsa Organization: Sentex Communications User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.2.13) Gecko/20101207 Thunderbird/3.1.7 MIME-Version: 1.0 To: Sami Halabi References: In-Reply-To: X-Enigmail-Version: 1.1.1 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.71 on IPv6:2607:f3e0:0:1::12 Cc: freebsd-net@freebsd.org Subject: Re: strange problem FreeBSD-8.1-Release X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 17 Aug 2011 13:48:55 -0000 On 8/17/2011 7:46 AM, Sami Halabi wrote: > Hi, > I have a FBSD router base on version 8.1-RELEASE-p4. > Today at 13:00 approx i had a sudden fall down of the traffic on the graphs > on all ports. > the strange thing is that no connection was lost, but the traffic like went > down. > on the logs i had these lines only: > Aug 17 12:59:45 bgpServer kernel: em2: Watchdog timeout -- resetting > Aug 17 12:59:45 bgpServer kernel: em2: link state changed to DOWN > Aug 17 12:59:48 bgpServer kernel: em2: link state changed to UP > > anyone ever faced this problem? any ideas how i can track down what happened > there? Until I saw the em errors, I was thinking you are just running into the 32bit counter limitations of snmp. e.g. an example graph at http://www.tancsa.com/overflow.png more info at http://www.cisco.com/en/US/tech/tk648/tk362/technologies_q_and_a_item09186a00800b69ac.shtml However, the em2 suggests a possible driver error. There have been a number of bug fixes to the em driver since 8.1-R. If possible, going to 8.2 or even RELENG_8 might help. also, what does sysctl -a dev.em show ? ---Mike -- ------------------- Mike Tancsa, tel +1 519 651 3400 Sentex Communications, mike@sentex.net Providing Internet services since 1994 www.sentex.net Cambridge, Ontario Canada http://www.tancsa.com/