From owner-freebsd-stable@FreeBSD.ORG Sat Mar 1 20:07:02 2008 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 84FBD1065675 for ; Sat, 1 Mar 2008 20:07:02 +0000 (UTC) (envelope-from jfb@mr-happy.com) Received: from vexbert.mr-paradox.net (vexbert.mr-paradox.net [208.4.93.28]) by mx1.freebsd.org (Postfix) with ESMTP id 2E9FA8FC21 for ; Sat, 1 Mar 2008 20:07:02 +0000 (UTC) (envelope-from jfb@mr-happy.com) Received: from crow.mr-happy.com (crow.mr-happy.com [10.1.0.2]) by vexbert.mr-paradox.net (Postfix) with ESMTP id EC0B384452 for ; Sat, 1 Mar 2008 14:44:04 -0500 (EST) Received: by crow.mr-happy.com (Postfix, from userid 16139) id 939EA5C67; Sat, 1 Mar 2008 14:44:04 -0500 (EST) Date: Sat, 1 Mar 2008 14:44:04 -0500 From: Jeff Blank To: freebsd-stable@freebsd.org Message-ID: <20080301194404.GA1571@mr-happy.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline X-Virus-Scanned: ClamAV 0.92/6069/Sat Mar 1 14:26:21 2008 on vexbert.mr-paradox.net X-Virus-Status: Clean Subject: 7.0-STABLE amd64 kernel trap during boot-time device probe X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 01 Mar 2008 20:07:02 -0000 Hello, I posted this around 3 months ago and never received a response. the problem still occurs with 7.0-STABLE (csup on 20080301). I possibly incorrectly referred to it as a panic last time, when the problem was really a trap. The only steps I've really taken since I posted this originally were to upgrade world/kernel due to patches a couple times and to perform an "upgrade" install from the 7.0-RELEASE CD, none of which had any effect. Can anyone please help me out with this? thank you, Jeff ----- Forwarded message from Jeff Blank ----- Date: Fri, 7 Dec 2007 00:05:57 -0500 From: Jeff Blank To: freebsd-stable@freebsd.org Subject: 7.0-BETA4 amd64 panic during boot-time device probe I've upgraded my AMD64 box from RELENG_6 (csup on Nov. 30) to RELENG_7 (csup around 01:30 UTC Dec. 7) and am getting a kernel panic when I try to boot with seemingly any one module specified in /boot/loader.conf (XXX_load=YES). It seems to occur near the end of device probing, just before it detects the disks. This panic does not happen if no modules are specified to be loaded in /boot/loader.conf. There is also no panic if I boot without loader.conf modules but then load the modules with kldload. This problem was originally happening when I was attempting to go from 6-STABLE to 7.0-BETA4, and rebuilding 7.0B4 under 7.0B4 yields the same result. Here is console output from the panic and partial dmesg output from the successful boot (similar up to a point, some context included). I couldn't get my serial port to accept input at the debugger prompt, and my keyboard (USB) can't even "Press a key on the console to reboot" when I have a non-ddb/kdb/etc kernel, so I couldn't do anything once I got into the debugger. Hopefully what's below has some useful information--if not, I'll be happy to try to get it. On the subject of the kernel debugger, I used GENERIC plus options DDB options DDB_NUMSYM options GDB options KDB options KDB_TRACE and set hint.sio.0.flags="0x80" in /boot/device.hints. What am I missing to allow serial input when the debugger starts? thanks for any help, Jeff === panic === GDB: debug ports: sio GDB: current port: sio KDB: debugger backends: ddb gdb KDB: current backend: ddb Copyright (c) 1992-2007 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 7.0-BETA4 #0: Thu Dec 6 23:35:34 EST 2007 root@crow.mr-happy.com:/usr/obj/usr/src/sys/GENERIC_DBG Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Athlon(tm) 64 X2 Dual Core Processor 4400+ (2211.34-MHz K8-class CPU) Origin = "AuthenticAMD" Id = 0x20f32 Stepping = 2 Features=0x178bfbff Features2=0x1 AMD Features=0xe2500800 AMD Features2=0x3 Cores per package: 2 usable memory = 1060421632 (1011 MB) avail memory = 1021755392 (974 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 ioapic0: Changing APIC ID to 2 ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413) acpi0: on motherboard acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, 3fef0000 (3) failed Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0 cpu0: on acpi0 powernow0: on cpu0 device_attach: powernow0 attach returned 6 cpu1: on acpi0 powernow1: on cpu1 device_attach: powernow1 attach returned 6 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 pci0: at device 0.0 (no driver attached) isab0: at device 1.0 on pci0 isa0: on isab0 pci0: at device 1.1 (no driver attached) ohci0: mem 0xdc004000-0xdc004fff irq 21 at device 2.0 on pci0 ohci0: [GIANT-LOCKED] ohci0: [ITHREAD] usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: on ohci0 usb0: USB revision 1.0 uhub0: on usb0 uhub0: 10 ports with 10 removable, self powered ehci0: mem 0xfeb00000-0xfeb000ff irq 22 at device 2.1 on pci0 ehci0: [GIANT-LOCKED] ehci0: [ITHREAD] usb1: EHCI version 1.0 usb1: companion controller, 4 ports each: usb0 usb1: on ehci0 usb1: USB revision 2.0 uhub1: on usb1 uhub1: 10 ports with 10 removable, self powered pcm0: port 0xdc00-0xdcff,0xe000-0xe0ff mem 0xdc003000-0xdc003fff irq 23 at device 4.0 on pci0 pcm0: [ITHREAD] pcm0: atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xf000-0xf00f at device 6.0 on pci0 ata0: on atapci0 ata0: [ITHREAD] ata1: on atapci0 ata1: [ITHREAD] atapci1: port 0x9f0-0x9f7,0xbf0-0xbf3,0x970-0x977,0xb70-0xb73,0xd800-0xd80f mem 0xdc002000-0xdc002fff irq 21 at device 7.0 on pci0 atapci1: [ITHREAD] ata2: on atapci1 ata2: [ITHREAD] ata3: on atapci1 ata3: [ITHREAD] atapci2: port 0x9e0-0x9e7,0xbe0-0xbe3,0x960-0x967,0xb60-0xb63,0xc400-0xc40f mem 0xdc001000-0xdc001fff irq 22 at device 8.0 on pci0 atapci2: [ITHREAD] ata4: on atapci2 ata4: [ITHREAD] ata5: on atapci2 ata5: [ITHREAD] pcib1: at device 9.0 on pci0 pci5: on pcib1 sio0: configured irq 18 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: configured irq 18 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: <3COM PCI FaxModem> port 0xa000-0xa007 irq 18 at device 8.0 on pci5 sio0: moving to sio4 sio4: type 16550A sio4: [FILTER] fwohci0: mem 0xdb008000-0xdb0087ff,0xdb004000-0xdb007fff irq 16 at device 11.0 on pci5 fwohci0: [FILTER] fwohci0: OHCI version 1.10 (ROM=1) fwohci0: No. of Isochronous channels is 4. fwohci0: EUI64 00:11:d8:00:00:72:dc:3e fwohci0: Phy 1394a available S400, 2 ports. fwohci0: Link S400, max_rec 2048 bytes. firewire0: on fwohci0 dcons_crom0: on firewire0 dcons_crom0: bus_addr 0x9fe740 fwe0: on firewire0 if_fwe0: Fake Ethernet address: 02:11:d8:72:dc:3e fwe0: Ethernet address: 02:11:d8:72:dc:3e fwip0: on firewire0 fwip0: Firewire address: 00:11:d8:00:00:72:dc:3e @ 0xfffe00000000, S400, maxrec 2048 sbp0: on firewire0 fwohci0: Initiate bus reset fwohci0: BUS reset fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode skc0: port 0xa400-0xa4ff mem 0xdb000000-0xdb003fff irq 17 at device 12.0 on pci5 skc0: Marvell Yukon Lite Gigabit Ethernet rev. (0x9) sk0: on skc0 sk0: Ethernet address: 00:15:f2:1e:44:77 miibus0: on sk0 e1000phy0: PHY 0 on miibus0 e1000phy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX-FDX, auto skc0: [ITHREAD] nfe0: port 0xb000-0xb007 mem 0xdc000000-0xdc000fff irq 23 at device 10.0 on pci0 miibus1: on nfe0 e1000phy1: PHY 9 on miibus1 e1000phy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX-FDX, auto nfe0: Ethernet address: 00:15:f2:1e:31:0c nfe0: [FILTER] pcib2: at device 11.0 on pci0 pci4: on pcib2 pcib3: at device 12.0 on pci0 pci3: on pcib3 pcib4: at device 13.0 on pci0 pci2: on pcib4 pcib5: at device 14.0 on pci0 pci1: on pcib5 vgapci0: port 0x9000-0x90ff mem 0xd0000000-0xd7ffffff,0xd9000000-0xd900ffff irq 18 at device 0.0 on pci1 vgapci1: mem 0xd9010000-0xd901ffff at device 0.1 on pci1 acpi_tz0: on acpi0 sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled can't re-use a leaf (%desc)! can't re-use a leaf (%driver)! can't re-use a leaf (%location)! can't re-use a leaf (%pnpinfo)! can't re-use a leaf (%parent)! sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x90 on acpi0 sio0: type 16550A, console sio0: [FILTER] ppc0: port 0x378-0x37f,0x778-0x77b irq 7 drq 3 on acpi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/16 bytes threshold ppbus0: on ppc0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 plip0: on ppbus0 ppc0: [GIANT-LOCKED] ppc0: [ITHREAD] orm0: at iomem 0xd0000-0xd3fff on isa0 atkbdc0: at port 0x60,0x64 on isa0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ums0: on uhub0 ums0: 5 buttons and Z dir. ukbd0: on uhub0 kbd2 at ukbd0 Timecounters tick evfirewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me) firewire0: bus manager 0 (me) ery 1.000 msec Fatal trap 12: page fault while in kernel mode cpuid = 0; apic id = 00 fault virtual address = 0x258 fault code = supervisor read data, page not present instruction pointer = 0x8:0xffffffff8047aa7e stack pointer = 0x10:0xffffffffa0677b40 frame pointer = 0x10:0xffffffffa0677b60 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, long 1, def32 0, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 23 (irq21: ohci0+) [thread pid 23 tid 100029 ] Stopped at 0xffffffff8047aa7e = _mtx_lock_sleep+0x4e: movl 0x258(%rcx),%esi db> === end panic === === no panic === [...] ums0: on uhub0 ums0: 5 buttons and Z dir. ukbd0: on uhub0 kbd2 at ukbd0 Timecounters tick every 1.000 msec firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me) firewire0: bus manager 0 (me) acd0: DMA limited to UDMA33, device found non-ATA66 cable acd0: DVDR at ata0-master UDMA33 ad4: 238475MB at ata2-master SATA300 ad8: 157066MB at ata4-master SATA300 ad10: 157066MB at ata5-master SATA300 ar0: 314133MB status: READY ar0: disk0 READY using ad8 at ata4-master ar0: disk1 READY using ad10 at ata5-master SMP: AP CPU #1 Launched! Trying to mount root from ufs:/dev/ad4s1a [continue successful boot] === end no panic === ----- End forwarded message -----