From owner-freebsd-current@FreeBSD.ORG Thu Dec 11 17:15:29 2003 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id D3C7016A4CE for ; Thu, 11 Dec 2003 17:15:29 -0800 (PST) Received: from rootlabs.com (root.org [67.118.192.226]) by mx1.FreeBSD.org (Postfix) with SMTP id A5AF743D09 for ; Thu, 11 Dec 2003 17:15:28 -0800 (PST) (envelope-from nate@rootlabs.com) Received: (qmail 51424 invoked by uid 1000); 12 Dec 2003 01:15:30 -0000 Date: Thu, 11 Dec 2003 17:15:30 -0800 (PST) From: Nate Lawson To: "Georg-W. Koltermann" In-Reply-To: <1071187926.1371.80.camel@hunter.muc.eu.mscsoftware.com> Message-ID: <20031211170630.V51376@root.org> References: <1071183306.1371.17.camel@hunter.muc.eu.mscsoftware.com> <1071187926.1371.80.camel@hunter.muc.eu.mscsoftware.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII cc: freebsd-current@freebsd.org cc: acpi-jp@jp.FreeBSD.org Subject: Re: [acpi-jp 2879] 5.1R: page fault from AcpiNsMapHandleToNode X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 12 Dec 2003 01:15:29 -0000 On Fri, 12 Dec 2003, Georg-W. Koltermann wrote: > On Fr, 2003-12-12 at 00:29, Nate Lawson wrote: > > Please try 5.2-current if you can. However, I'm not sure this is fixed > > in it. More info is needed. > > Will try one of the RC isos soon anyway, but given that it takes some > hours to trigger I am not sure how much I'll get from that. > > > Parent pointer is invalid, as you show. The panic was triggered by an > > attempt to turn the fan on. > > I guessed that. > > > You should be able to trigger it quickly by > > turning off CPU idling (sysctl machdep.cpu_idle_hlt=0) and doing something > > CPU intensive (dd if=/dev/random of=/dev/null bs=1m). > > Notice that the fan turns on and off just fine during normal work, I use > this machine as a swdev system and it does get loaded during builds. > Didn't try turning off cpu idling yet, though. Hmm the fan briefly > turned off (???) for 10 secs when I started bzipping the dump, then went > back on, then switched to high. This is on AC where it usually keeps > running on low speed even if idle. It turns off when I run on battery. There are different ways of turning on the fan. acpi_thermal switches them on based on the _ACx objects, which is what caused the panic. So the fan being turned on by the BIOS or through some other mechanism won't trigger the fault. We can also lower the value at which acpi_thermal does the switch or just manually turn it on. Output from sysctl hw.acpi would help. > > Let's get some more > > debugging info. > > > > Please send acpidump -t -d > georg-CpqEvo800w.asl > > acpidump doesn't like these switches. I'll attach the raw acpi dump from > acpidump -o gwk.aml below, uuencoded. Oh, for 5.1 it's acpidump > georg-CpqEvo800w.asl > > Please send me a link to the bzip2 kernel.debug and dump you have from > > above. > > vmcore.bz2 is 336M, will put it on some server at work tomorrow. I > don't have that much space from my private ISP. Yes, 1GB RAM does have > a drawback! Don't bother, that's too big. I thought we have support for sparse kernel dumps? If not, that's a big todo. > > Also, please recompile your acpi.ko with ACPI_DEBUG, then set these in > > sysctl after your system has booted: > > debug.acpi.layer="ACPI_THERMAL ACPI_NAMESPACE" > > debug.acpi.level="ACPI_LV_VALUES ACPI_LV_OBJECTS" > > hw.acpi.verbose="1" > > > > Then trigger the panic as described above. You should get some helpful > > dmesg output. > > I'll try tomorrow (getting late here), but might be difficult to > trigger. Ok. You can manually turn on the active cooling with: sysctl hw.acpi.thermal.active=0 (or 1,2,3, etc.) This is another way to trigger the panic. -Nate