From owner-freebsd-current@FreeBSD.ORG Tue Mar 27 10:42:27 2007 Return-Path: X-Original-To: freebsd-current@freebsd.org Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id CE65016A400; Tue, 27 Mar 2007 10:42:27 +0000 (UTC) (envelope-from rwatson@FreeBSD.org) Received: from cyrus.watson.org (cyrus.watson.org [209.31.154.42]) by mx1.freebsd.org (Postfix) with ESMTP id A050E13C45B; Tue, 27 Mar 2007 10:42:27 +0000 (UTC) (envelope-from rwatson@FreeBSD.org) Received: from fledge.watson.org (fledge.watson.org [209.31.154.41]) by cyrus.watson.org (Postfix) with ESMTP id A8FEC472DB; Tue, 27 Mar 2007 05:42:26 -0500 (EST) Date: Tue, 27 Mar 2007 11:42:26 +0100 (BST) From: Robert Watson X-X-Sender: robert@fledge.watson.org To: Andre Oppermann In-Reply-To: <4608DC71.9080001@freebsd.org> Message-ID: <20070327114125.V42335@fledge.watson.org> References: <20070324124732.GA767@nagual.pp.ru> <200703251348.58972.nb_root@videotron.ca> <20070325194946.GC79938@kobe.laptop> <200703251620.20879.nb_root@videotron.ca> <20070325202749.GA1503@kobe.laptop> <460705AE.5040107@freebsd.org> <20070327045252.GA3256@nagual.pp.ru> <46099675.3040609@u.washington.edu> <20070327052810.GA772@nagual.pp.ru> <20070327054501.GA1026@nagual.pp.ru> <4608DC71.9080001@freebsd.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: Garrett Cooper , freebsd-current@freebsd.org Subject: Re: Latest -current complete lockup (tcp changes?) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 27 Mar 2007 10:42:27 -0000 On Tue, 27 Mar 2007, Andre Oppermann wrote: > Andrey Chernov wrote: >> On Tue, Mar 27, 2007 at 09:28:10AM +0400, Andrey Chernov wrote: >>> On Tue, Mar 27, 2007 at 10:11:01PM +0000, Garrett Cooper wrote: >>>>> The problem is deeper than that ((( >>>>> I still got the same lockup, just with more net activity. >>>>> I even try to completely disable sack, with the same result, so probem >>>>> is somewhere else. Last working kernel still from Mar 22. >>>> I'll give a CVSup / upgrade a try and see what happens. >>> Additional non-default details from my machine which may (or may not) >>> affect the thing: >> >> Yet one detail about lockup: external pings to the machine works in the >> lockup situation, but no any TCP services is available. > > I can only think of a TCP_INFO_LOCK() leak here blocking any further > progress. When I'm back in the office in about an hour I'll prepare a test > patch. If the tcbinfo lock were being leaked, then the netisr or ithread would remain permanently wedged preventing any IP services from working, and rapidly hosing the machine as the ithread wouldn't be doing its thing. I think another sort of problem is more likely. Robert N M Watson Computer Laboratory University of Cambridge