From owner-freebsd-stable@FreeBSD.ORG Thu Dec 16 09:50:54 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 09A0E106564A for ; Thu, 16 Dec 2010 09:50:54 +0000 (UTC) (envelope-from avg@freebsd.org) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id 4E5AE8FC13 for ; Thu, 16 Dec 2010 09:50:52 +0000 (UTC) Received: from porto.topspin.kiev.ua (porto-e.starpoint.kiev.ua [212.40.38.100]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id LAA01485; Thu, 16 Dec 2010 11:50:45 +0200 (EET) (envelope-from avg@freebsd.org) Received: from localhost.topspin.kiev.ua ([127.0.0.1]) by porto.topspin.kiev.ua with esmtp (Exim 4.34 (FreeBSD)) id 1PTAU0-00015X-QO; Thu, 16 Dec 2010 11:50:44 +0200 Message-ID: <4D09E0F3.5040302@freebsd.org> Date: Thu, 16 Dec 2010 11:50:43 +0200 From: Andriy Gapon User-Agent: Mozilla/5.0 (X11; U; FreeBSD amd64; en-US; rv:1.9.2.13) Gecko/20101211 Lightning/1.0b2 Thunderbird/3.1.7 MIME-Version: 1.0 To: perryh@pluto.rain.com References: <4cfc72a5.3nAjkv8mdrO/NrKQ%perryh@pluto.rain.com> <4CFD0633.9060509@freebsd.org> <4d089a74.vwMJkPEIddt7PIxy%perryh@pluto.rain.com> <4D08AACB.6060306@freebsd.org> <4d09dd2e.iVncbZ/gHBXX0WdL%perryh@pluto.rain.com> In-Reply-To: <4d09dd2e.iVncbZ/gHBXX0WdL%perryh@pluto.rain.com> X-Enigmail-Version: 1.1.2 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: How to debug a double fault? (Re: Could MSGBUF_SIZE be made a loader tunable?) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 16 Dec 2010 09:50:54 -0000 on 16/12/2010 11:34 perryh@pluto.rain.com said the following: > Andriy Gapon wrote: >> on 15/12/2010 12:37 perryh@pluto.rain.com said the following: >>> Fatal double fault: >>> eip = 0xc07feb98 >>> esp = 0xc101e000 >>> ebp = 0xc101e004 >>> cpuid = 0; apic id = 00 >>> panic: double fault >>> cpuid = 0 >>> >>> How do I go about tracking this down? >> >> Do you have the standard debugging options in your kernel? > > No, it is 8.1-RELEASE GENERIC with only the name changed and the > (first attempt) msgbufsize patches applied. I was trying to > minimize changes to GENERIC, so as to minimize the opportunity > to screw something up, and I had this silly idea that something > this simple might "just work." > > It does occur to me to wonder whether any debugger would be > functional this early, before even the first line of the signon > message has been displayed. Is it possible, given the loader > messages, to come up with a base address which could be used to > compare the eip value with the kernel symbol table? Granted this > won't provide a traceback, but even knowing in which function it > crashed would be a start. You can research this approach, but I would just add KDB+DDB and get a stack trace without sweat. >> BTW, are you sure that you correctly placed initialization of >> msgbufsize ? > > I am not at all sure of that, and am not sufficiently familiar with > the sequence of events early in intiialization to know how to find > out -- although I suppose the observed crash might not be altogether > surprising if the kernel message buffer got allocated with a zero > size :( > > Apart from the name, msgbufsize is set up in exactly the same > way and place -- in init_param1() -- as maxswzone and maxbcache. > Perhaps that is not early enough; any idea what would be a better > example? I don't see any connection between msgbufsize and maxswzone, so I also don't know if that place is early enough. Just try to initialize the variable where it's defined and use TUNABLE_LONG. -- Andriy Gapon