From owner-freebsd-hackers@FreeBSD.ORG Thu Dec 28 06:22:18 2006 Return-Path: X-Original-To: freebsd-hackers@freebsd.org Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 0ECE316A403 for ; Thu, 28 Dec 2006 06:22:18 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (66-23-211-162.clients.speedfactory.net [66.23.211.162]) by mx1.freebsd.org (Postfix) with ESMTP id ACFF813C475 for ; Thu, 28 Dec 2006 06:22:17 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from zion.baldwin.cx (zion.baldwin.cx [192.168.0.7]) (authenticated bits=0) by server.baldwin.cx (8.13.6/8.13.6) with ESMTP id kBS5gZL0061275; Thu, 28 Dec 2006 00:42:37 -0500 (EST) (envelope-from jhb@freebsd.org) From: John Baldwin To: freebsd-hackers@freebsd.org Date: Wed, 27 Dec 2006 23:50:43 -0500 User-Agent: KMail/1.9.4 References: <20061214190510.GA26590@neutrino.bsdhome.com> <552E24DE-C1D1-41B1-83D2-157F0A3E0449@bleepsoft.com> In-Reply-To: <552E24DE-C1D1-41B1-83D2-157F0A3E0449@bleepsoft.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200612272350.43680.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-2.0.2 (server.baldwin.cx [192.168.0.1]); Thu, 28 Dec 2006 00:42:39 -0500 (EST) X-Virus-Scanned: ClamAV 0.88.3/2386/Wed Dec 27 13:32:31 2006 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=4.2 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on server.baldwin.cx Cc: "R. Tyler Ballance" , Brian Dean Subject: Re: Kernel hang on 6.x X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 28 Dec 2006 06:22:18 -0000 On Thursday 14 December 2006 16:06, R. Tyler Ballance wrote: > > On Dec 14, 2006, at 1:05 PM, Brian Dean wrote: > > > Hi, > > > > We're experiencing a kernel hang on a 6.x quad processor Sun amd64 > > based system. We are able to reproduce it fairly reliably, but the > > environment to do so is not easily replicatable so I cannot provide a > > simple test case. However, I have been able to build a debug kernel > > and when the system "hangs", I can break to the debugger prompt. But > > once there, I'm not sure what to do to isolate where the system is > > hung up. I have confirmed that the hang occurs in both SMP and > > uniprocessor mode. Here are some system details: > > > I think you'll need to ship this machine to my house for further > um....erm, diagnostics, yes, that's it ;) > > > On a more serious topic, can you paste the output from: > > > ddb> show pcpu > ddb>allpcpu > ddb>traceall > ddb>show alllocks > ddb>show lockedvnods > > Just curious as to whether those would show more info, because you're > right, that trace is about as informative as new printer paper :) The 'traceall' seemed to miss several threads actually (like pid 18). Can you get a 'ps'? Also, are you able to get a kernel dump when this happens? -- John Baldwin