From owner-freebsd-hackers@FreeBSD.ORG Thu Jan 4 15:27:58 2007 Return-Path: X-Original-To: freebsd-hackers@freebsd.org Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 324F316A412; Thu, 4 Jan 2007 15:27:58 +0000 (UTC) (envelope-from bsd@bsdhome.com) Received: from ms-smtp-04.southeast.rr.com (ms-smtp-04.southeast.rr.com [24.25.9.103]) by mx1.freebsd.org (Postfix) with ESMTP id E428013C455; Thu, 4 Jan 2007 15:27:57 +0000 (UTC) (envelope-from bsd@bsdhome.com) Received: from neutrino.bsdhome.com (cpe-071-070-208-236.nc.res.rr.com [71.70.208.236]) by ms-smtp-04.southeast.rr.com (8.13.6/8.13.6) with ESMTP id l04FRtNa014764; Thu, 4 Jan 2007 10:27:55 -0500 (EST) Received: from neutrino.bsdhome.com (localhost [127.0.0.1]) by neutrino.bsdhome.com (8.13.1/8.13.1) with ESMTP id l04FRscl094958; Thu, 4 Jan 2007 10:27:54 -0500 (EST) (envelope-from bsd@neutrino.bsdhome.com) Received: (from bsd@localhost) by neutrino.bsdhome.com (8.13.1/8.13.1/Submit) id l04FRsNA094957; Thu, 4 Jan 2007 10:27:54 -0500 (EST) (envelope-from bsd) Date: Thu, 4 Jan 2007 10:27:54 -0500 From: Brian Dean To: John Baldwin Message-ID: <20070104152754.GA94609@neutrino.bsdhome.com> References: <20061214190510.GA26590@neutrino.bsdhome.com> <552E24DE-C1D1-41B1-83D2-157F0A3E0449@bleepsoft.com> <200612272350.43680.jhb@freebsd.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200612272350.43680.jhb@freebsd.org> User-Agent: Mutt/1.5.11 X-Virus-Scanned: Symantec AntiVirus Scan Engine Cc: freebsd-hackers@freebsd.org, "R. Tyler Ballance" , Brian Dean Subject: Re: Kernel hang on 6.x X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 04 Jan 2007 15:27:58 -0000 On Wed, Dec 27, 2006 at 11:50:43PM -0500, John Baldwin wrote: > The 'traceall' seemed to miss several threads actually (like pid > 18). Can you get a 'ps'? Also, are you able to get a kernel dump > when this happens? I can't ps that particular session since it is no longer available, however I can reproduce another one and generate a new set of debug output. One note, the "swap_pager: indefinite wait buffer: ..." timeout message may have been a result of a misconfigured secondary swap file, so that might be a red herring. However, we can still reliably reproduce the hang with 32 Gig swap, but we don't get any console messages associated with it. The system is set up as a test system so I'm not under any pressure to get it rebooted and back up when it hangs, so I have the ability to take some time to debug it. I believe that I can generate a kernel dump. We tried this yesterday but didn't have a dump device configured. I think we've got that set up now and plan to generate a kernel dump. I'm assuming that since the process size and swap size is so large, that the dump size is going to be very large also, on the order of 32 Gig. I beleive I can host this on a server and make it accessible to you if you are willing to download it. -Brian