From owner-freebsd-net@FreeBSD.ORG Tue Aug 16 18:15:30 2011 Return-Path: Delivered-To: net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 454A41065675; Tue, 16 Aug 2011 18:15:30 +0000 (UTC) (envelope-from mike@sentex.net) Received: from smarthost1.sentex.ca (smarthost1-6.sentex.ca [IPv6:2607:f3e0:0:1::12]) by mx1.freebsd.org (Postfix) with ESMTP id 093DE8FC1C; Tue, 16 Aug 2011 18:15:29 +0000 (UTC) Received: from [IPv6:2607:f3e0:0:4:f025:8813:7603:7e4a] (saphire3.sentex.ca [IPv6:2607:f3e0:0:4:f025:8813:7603:7e4a]) by smarthost1.sentex.ca (8.14.4/8.14.4) with ESMTP id p7GIFJAi062395; Tue, 16 Aug 2011 14:15:19 -0400 (EDT) (envelope-from mike@sentex.net) Message-ID: <4E4AB3BE.4090603@sentex.net> Date: Tue, 16 Aug 2011 14:15:26 -0400 From: Mike Tancsa Organization: Sentex Communications User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.2.13) Gecko/20101207 Thunderbird/3.1.7 MIME-Version: 1.0 To: =?ISO-8859-1?Q?Ask_Bj=F8rn_Hansen?= References: In-Reply-To: X-Enigmail-Version: 1.1.1 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit X-Scanned-By: MIMEDefang 2.71 on IPv6:2607:f3e0:0:1::12 Cc: embedded@freebsd.org, net@freebsd.org Subject: Re: system locks up with vr driver on alix board X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 16 Aug 2011 18:15:30 -0000 On 8/16/2011 1:41 PM, Ask Bjørn Hansen wrote: > Hi everyone, > > Over the weekend I upgraded a couple of NanoBSD based systems (on PC Engines Alix boards) from 7.4 to 8.2. The box is a small firewall and have been running stable for several years. > > Since then the busy systems consistently lock up. I had a 'top -bS -s1' running and it stopped a couple hours before the system locked up after showing basically 100% interrupts for a few minutes. (Divided ~50/50 between the two "vr" interfaces that gets most of the traffic). Another terminal tailing /var/log/messages kept running for another hour or so. > > An hour or two after the log stopped showing; the system stopped routing packets, but frustratingly kept sending CARP messages out so the secondary firewall didn't pick up the IP addresses to take over. > > Any ideas? Not sure if CARP has something to do with it as I have quite a few RELENG_8 boxes out there running on Alix boxes (2 and 3 port as well as Soekris 5501s). But I think the 7.4 and 8.2 drivers for vr are essentially the same. That being said, there are some updates in RELENG_8 to the driver. Not sure if that makes any difference to your issue. http://svnweb.freebsd.org/base?view=revision&revision=223681 MFC r223405: Remove link state change callback handler. There is no need to register both status change and link state change callbacks. Implement checking valid link in state change callback and poll active link state in vr_tick(). This allows immediate detection of lost link as well as protecting driver from frequent link flips during link renegotiation. taskq implementation was removed because driver now needs to poll link state in vr_tick(). While I'm here do not report current link state if interface is not running. > > > - ask > -- ------------------- Mike Tancsa, tel +1 519 651 3400 Sentex Communications, mike@sentex.net Providing Internet services since 1994 www.sentex.net Cambridge, Ontario Canada http://www.tancsa.com/