From owner-freebsd-stable@FreeBSD.ORG Fri Apr 25 15:44:55 2008 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0181D1065675 for ; Fri, 25 Apr 2008 15:44:55 +0000 (UTC) (envelope-from gavin.atkinson@ury.york.ac.uk) Received: from mail-gw2.york.ac.uk (mail-gw2.york.ac.uk [144.32.128.247]) by mx1.freebsd.org (Postfix) with ESMTP id 6C8358FC22 for ; Fri, 25 Apr 2008 15:44:54 +0000 (UTC) (envelope-from gavin.atkinson@ury.york.ac.uk) Received: from mail-gw6.york.ac.uk (mail-gw6.york.ac.uk [144.32.129.26]) by mail-gw2.york.ac.uk (8.13.6/8.13.6) with ESMTP id m3PFinYi005735; Fri, 25 Apr 2008 16:44:49 +0100 (BST) Received: from buffy-128.york.ac.uk ([144.32.128.160] helo=buffy.york.ac.uk) by mail-gw6.york.ac.uk with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1JpQ6T-0000zH-SB; Fri, 25 Apr 2008 16:44:49 +0100 Received: from buffy.york.ac.uk (localhost [127.0.0.1]) by buffy.york.ac.uk (8.14.2/8.14.2) with ESMTP id m3PFineo050199; Fri, 25 Apr 2008 16:44:49 +0100 (BST) (envelope-from gavin.atkinson@ury.york.ac.uk) Received: (from ga9@localhost) by buffy.york.ac.uk (8.14.2/8.14.2/Submit) id m3PFimGr050198; Fri, 25 Apr 2008 16:44:48 +0100 (BST) (envelope-from gavin.atkinson@ury.york.ac.uk) X-Authentication-Warning: buffy.york.ac.uk: ga9 set sender to gavin.atkinson@ury.york.ac.uk using -f From: Gavin Atkinson To: Ryan Lackey In-Reply-To: <5bfab4bc0804211223w34d41326g7350c51da35a1721@mail.gmail.com> References: <5bfab4bc0804211223w34d41326g7350c51da35a1721@mail.gmail.com> Content-Type: text/plain Content-Transfer-Encoding: 7bit Date: Fri, 25 Apr 2008 16:44:48 +0100 Message-Id: <1209138288.48908.14.camel@buffy.york.ac.uk> Mime-Version: 1.0 X-Mailer: Evolution 2.10.1 FreeBSD GNOME Team Port X-York-MailScanner: Found to be clean X-York-MailScanner-From: gavin.atkinson@ury.york.ac.uk Cc: freebsd-stable@freebsd.org Subject: Re: HP DL145G2 boot problems with 7.0 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 25 Apr 2008 15:44:55 -0000 On Mon, 2008-04-21 at 12:23 -0700, Ryan Lackey wrote: > I have 4 HP DL145G2 boxes (dual opteron). I recently upgrded them from > 6.2-STABLE to 7.0-RELEASE using cvsup on each, compiling world + kernel. > > The upgrade was (fairly painless on two machines, and has broken the other > two machines. > > On boot, the broken machines hang somewhere after kbdc and psm, and before > serial driver initialization. > > Specifically, I see: > psm0: unable to allocate IRQ > psmcpnp0: PS/2 mouse port irq 12 on acpi0 > pm0: ps2 mouse irq 12 on atkbdc0 > ioapic0: routing intpin 12 (ISA IRQ 12) to vector 57 > psm0: giant-locked > psm0: thread > psm0: model intellimouse explorer.... > psm0: config; 0000000000 flags: 000000008, packet size:4 > psm0: syncmask:08, syncbits:00 > > After that, it hangs. > > I'm using a 7.0 world, mergemastered, and a kernel compiled on the broken > system, and also a kernel compiled on one of the working systems and copied > over, with the same config. > > (attached) > > dmesg from one of the working systems also attached -- same config modulo > RAM and CPU speed. > > If I boot using the 6.2 kernel and 7.0 userland, I can ssh in and some > things work, but others don't. That's unsurprising really, and is exactly why the upgrade instructions say to reboot with the new kernel before installing the new world. Doing that would have caught this problem before it was too late. > I'm using an IPKVM vs. serial console, so this is slightly more difficult to > debug, plus I have to ask someone > to reboot the machine, vs. using a power cycler. > > Any help in debugging this would be most appreciated. OK, recompile the kernel, adding the following options: options KDB options DDB When the machine hangs, hit Ctrl-Alt-Escape, this may or may not drp you into the debugger, depending on exactly how the machine is hanging. (if it doesn't, your only option may well be to try to get serial access to the machine) Assuming it does get you into the debugger, send the output of "bt" to the list. With that, hopefully there will be enough information to diagnose this issue. Thanks, Gavin