From owner-freebsd-stable@FreeBSD.ORG Wed Jun 29 23:12:32 2005 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 1B44216A487; Wed, 29 Jun 2005 23:12:32 +0000 (GMT) (envelope-from matt@atopia.net) Received: from neptune.atopia.net (neptune.atopia.net [209.128.231.90]) by mx1.FreeBSD.org (Postfix) with ESMTP id E481243D1F; Wed, 29 Jun 2005 23:12:29 +0000 (GMT) (envelope-from matt@atopia.net) Received: by neptune.atopia.net (Postfix, from userid 1001) id 6D85B413B; Wed, 29 Jun 2005 19:12:29 -0400 (EDT) Received: from localhost (localhost [127.0.0.1]) by neptune.atopia.net (Postfix) with ESMTP id 5AD115B; Wed, 29 Jun 2005 19:12:29 -0400 (EDT) Date: Wed, 29 Jun 2005 19:12:29 -0400 (EDT) From: Matt Juszczak To: Kris Kennaway In-Reply-To: <20050629100535.GC27557@xor.obsecurity.org> Message-ID: <20050629191215.I34468@neptune.atopia.net> References: <42BF8815.6090909@atopia.net> <20050627081933.GA97832@cell.sick.ru> <42C16394.4040904@atopia.net> <1119971279.36316.45.camel@buffy.york.ac.uk> <42C16C0E.9090002@atopia.net> <20050629100535.GC27557@xor.obsecurity.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: Gleb Smirnoff , freebsd-stable@freebsd.org Subject: Re: FreeBSD -STABLE servers repeatedly crashing. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 29 Jun 2005 23:12:32 -0000 On Wed, 29 Jun 2005, Kris Kennaway wrote: > On Tue, Jun 28, 2005 at 11:26:06AM -0400, Matt Juszczak wrote: > >>> OK, when it crashes next and is sat at the "db>" prompt, type "tr" and >>> press enter to get a trace. Copy this down (or have a serial console to >>> capture the output). Also, try typing "call doadump()" and see if that >>> succeeds in generating a crash dump. How were you trying to generate >>> one before? >>> >>> Gavin >>> >>> >> >> I can't type anything. The machine locks up. >> >> See: http://paste.atopia.net/126 >> >> After CPUID: 1, the machine locks cold and nothing else is printed to >> the screen. > > Try two things: > > 1) adding 'options KDB_STOP_NMI' to your kernel config. > > 2) If you still can't get it to break to DDB, then compile up a > debugging kernel, run kgdb on it (as described in the developers' > handbook), and list *(0xblah) where that address is the value of the > instruction pointer in the trap message (e.g. 0xc6644eff in your paste > above). That might at least be a start. > > Kris OK :) I'll try this next time it crashes. I actually disabled ipf a few nights ago and it hasn't crashed since... knock on wood.