From owner-freebsd-hackers@FreeBSD.ORG Wed Jan 10 23:06:24 2007 Return-Path: X-Original-To: freebsd-hackers@freebsd.org Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id CD53416A416 for ; Wed, 10 Jan 2007 23:06:24 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (66-23-211-162.clients.speedfactory.net [66.23.211.162]) by mx1.freebsd.org (Postfix) with ESMTP id 7BB9513C441 for ; Wed, 10 Jan 2007 23:06:17 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from zion.baldwin.cx (zion.baldwin.cx [192.168.0.7]) (authenticated bits=0) by server.baldwin.cx (8.13.6/8.13.6) with ESMTP id l0AN5oZe088666; Wed, 10 Jan 2007 18:05:50 -0500 (EST) (envelope-from jhb@freebsd.org) From: John Baldwin To: "Brad L. Chisholm" Date: Wed, 10 Jan 2007 17:53:24 -0500 User-Agent: KMail/1.9.4 References: <20070110215207.GA85834@bsdone.bsdwins.com> In-Reply-To: <20070110215207.GA85834@bsdone.bsdwins.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200701101753.24716.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-2.0.2 (server.baldwin.cx [192.168.0.1]); Wed, 10 Jan 2007 18:05:50 -0500 (EST) X-Virus-Scanned: ClamAV 0.88.3/2433/Wed Jan 10 13:28:34 2007 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=4.2 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on server.baldwin.cx Cc: freebsd-hackers@freebsd.org Subject: Re: Kernel hang on 6.x X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 10 Jan 2007 23:06:24 -0000 On Wednesday 10 January 2007 16:52, Brad L. Chisholm wrote: > On Thu, 4 Jan 2007 12:53:47 -0500, John Baldwin wrote: > > On Thursday 04 January 2007 10:27, Brian Dean wrote: > > > > > > I believe that I can generate a kernel dump. We tried this yesterday > > > but didn't have a dump device configured. > > > > If this is 6.x, turn on minidumps via the sysctl. The dump size normally is > > the size of RAM. With minidumps it can be a lot smaller. If you get a dump, > > let me know and I'll point you at some gdb scripts to generate 'ps' type > > output, etc. > > > > I work with Brian, and have been helping him analyze this problem. We have > been able to generate kernel dumps, and have also done some additional > analysis under ddb. Here is a summary of our analysis so far. Suggestions > as to how to proceed from here are most welcome. How much swap do you have? You might have run out of buckets in the swap_zone before you ran out of swap space, in which case the kernel deadlocks rather than killing the hog like it does when it runs out of swap space. I added a printf to catch this on HEAD recently that will be MFC'd soonish. You can try bumping up kern.maxswzone (loader tunable). -- John Baldwin