From owner-freebsd-current@FreeBSD.ORG Sun Oct 12 11:55:49 2003 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id E6EFD16A4B3 for ; Sun, 12 Oct 2003 11:55:49 -0700 (PDT) Received: from ebb.errno.com (ebb.errno.com [66.127.85.87]) by mx1.FreeBSD.org (Postfix) with ESMTP id EB20B43FCB for ; Sun, 12 Oct 2003 11:55:47 -0700 (PDT) (envelope-from sam@errno.com) Received: from 66.127.85.91 ([66.127.85.91]) (authenticated bits=0) by ebb.errno.com (8.12.9/8.12.9) with ESMTP id h9CItj0x046542 (version=TLSv1/SSLv3 cipher=RC4-MD5 bits=128 verify=NO); Sun, 12 Oct 2003 11:55:46 -0700 (PDT) (envelope-from sam@errno.com) From: Sam Leffler Organization: Errno Consulting To: Andre Guibert de Bruet , Josef Karthauser Date: Sun, 12 Oct 2003 11:56:53 -0700 User-Agent: KMail/1.5.3 References: <20031012124207.GA1530@genius.tao.org.uk> <20031012142234.GA2095@genius.tao.org.uk> <20031012140147.V26654@alpha.siliconlandmark.com> In-Reply-To: <20031012140147.V26654@alpha.siliconlandmark.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200310121156.53425.sam@errno.com> cc: current@freebsd.org Subject: Re: What's up with the IP stack? X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 12 Oct 2003 18:55:50 -0000 On Sunday 12 October 2003 11:03 am, Andre Guibert de Bruet wrote: > On Sun, 12 Oct 2003, Josef Karthauser wrote: > > On Sun, Oct 12, 2003 at 02:48:01PM +0200, Soren Schmidt wrote: > > > It seems Josef Karthauser wrote: > > > > I've just built and installed a new kernel, the first since Aug 6th. > > > > There appears to be a problem with the IP stack. What happens is > > > > that everything is fine for a few hours, and then the IP stack stops > > > > working. I can no longer ping anything on the local network, my > > > > default route drops out (which is probably dhclient's doing). > > > > Perhaps it is ARP that is broken, it's hard to tell. All I know is > > > > that I need to reboot to make it work again. > > > > > > > > Is anyone else experiencing this kind of problem? > > > > > > Do you have dummynet included in the kernel ? > > > That has been broken for me since sam's latest commit as a backout > > > of ip_dummynet.c fixes the problem for me... > > > > No, I've not got dummynet in there. My current kernel config is: > > I experienced this a week ago. I found that ifconfig'ing the interface > down and back up again "fixed" the problem. I've since reverted to a > kernel compiled on September 25th. It would be good to know more details; I still don't have much to go on. Try to identify, for example, if the problem is specific to a particular device/interface or feature you're using (e.g dummynet). If you have ddb in your system, then when the system gets into a bad state break into the debugger and look for threads that are blocked on locks. If you have witness in your kernel then show locks would also be useful. If you don't have witness in your system then rebuild your kernel with it. The most recent round of changes were to lock the routing table. These went in 10/3 and were extensive. They could easily be the problem but w/o more info I can't really help. Sam