From owner-freebsd-stable@FreeBSD.ORG Thu Sep 21 21:01:43 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 9413A16A415 for ; Thu, 21 Sep 2006 21:01:43 +0000 (UTC) (envelope-from mail@maxlor.com) Received: from popeye1.ggamaur.net (popeye1.ggamaur.net [213.160.40.50]) by mx1.FreeBSD.org (Postfix) with ESMTP id 56DED43D66 for ; Thu, 21 Sep 2006 21:01:28 +0000 (GMT) (envelope-from mail@maxlor.com) Received: from maxlor.mine.nu (maxlor@c-213-160-32-54.customer.ggaweb.ch [213.160.32.54]) by popeye1.ggamaur.net (8.13.7/8.13.7/Submit) with ESMTP id k8LL1PQq028906 for ; Thu, 21 Sep 2006 23:01:26 +0200 (CEST) (envelope-from mail@maxlor.com) Received: from localhost (unknown [127.0.0.1]) by maxlor.mine.nu (Postfix) with ESMTP id 44FDE2E12D for ; Thu, 21 Sep 2006 23:01:25 +0200 (CEST) X-Virus-Scanned: amavisd-new at atlantis.intranet Received: from maxlor.mine.nu ([127.0.0.1]) by localhost (atlantis.intranet [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 3BW4Gr-7aeql for ; Thu, 21 Sep 2006 23:01:24 +0200 (CEST) Received: from mini.intranet (mini.intranet [10.0.0.17]) by maxlor.mine.nu (Postfix) with ESMTP id DCBCD2E128 for ; Thu, 21 Sep 2006 23:01:24 +0200 (CEST) From: Benjamin Lutz To: freebsd-stable@freebsd.org Date: Thu, 21 Sep 2006 23:01:18 +0200 User-Agent: KMail/1.9.1 References: <200609192217.44712.mail@maxlor.com> In-Reply-To: <200609192217.44712.mail@maxlor.com> X-Face: $Ov27?7*N,h60fIEfNJdb!m,@#4T/d; 1hw|W0zvsHM(a$Yn6BYQ0^SEEXvi8>D`|V*F"=?iso-8859-1?q?=5F+R=0A?= 2@Aq>+mNb4`,'[[%z9v0Fa~]AD1}xQO3|>b.z&}l#R-_(P`?@Mz"kS; XC>Eti,i3>%@g?4f,=?iso-8859-1?q?=5Cc7=7CGh=0A?= =?iso-8859-1?q?_wb=26ky=24b2PJ=5E=5C0b83NkLsFKv=7CsmL/cI4UD=25Tu8alAD?= MIME-Version: 1.0 Content-Type: multipart/signed; boundary="nextPart1390711.J0h2XYPAOn"; protocol="application/pgp-signature"; micalg=pgp-sha1 Content-Transfer-Encoding: 7bit Message-Id: <200609212301.24257.mail@maxlor.com> X-Scanned-By: MIMEDefang 2.57 on 213.160.40.60 Subject: Re: Kernel panic on boot (how to debug?) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 21 Sep 2006 21:01:43 -0000 --nextPart1390711.J0h2XYPAOn Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Hello, I just updated to FreeBSD 6.2-PRERELEASE/amd64 as of 2 hours ago. I'm seein= g=20 the same behaviour with both the panic and the weird device name.=20 By sheer luck I figured out how to get into DDB though :), so I can now=20 provide a backtrace. Fatal trap 18: integer divide fault while in kernel mode instruction pointer =3D 0x8:0xffffffff801c02d5 stack pointer =3D 0x10:0xffffffff80864a80 frame pointer =3D 0x10:0xffffffff80864ad0 code segment =3D base 0x0, limit 0xfffff, type 0x1b =3D DPL 0, pres 1, long 1, def32 0, gran 1 processor eflags =3D interrupt enabled, resume, IOPL =3D 0 current process =3D 0 (swapper) trap number =3D 18 panic: integer divide fault Uptime: 1s Cannot dump. No dump device defined. Automatic reboot in 15 seconds - press a key on the console to abort --> Press a key on the console to reboot, --> or switch off the system now. KDB: enter: manual escape to debugger [thread pid 0 tid 0 ] Stopped at kdb_enter+0x31: leave db> bt Tracing pid 0 tid 0 td 0xffffffff806ee240 kdb_enter() at kdb_enter+0x31 scgetc() at scgetc+0x57e sccngetch() at sccngetch+0xf2 sccncheckc() at sccncheckc+0xe cncheckc() at cncheckc+0x3c cngetc() at cngetc+0x17 shutdown_panic() at shutdown_panic+0x79 boot() at boot+0x59a panic() at panic+0x1f9 trap_fatal() at trap_fatal+0x378 trap() at trap+0x41e calltrap() at calltrap+0x5 --- trap 0x12, rip =3D 0xffffffff801c02d5, rsp =3D 0xffffffff80864a80, rb= p =3D 0xffffffff80864ad0 --- ata_raid_promise_read_meta() at ata_raid_promise_read_meta+0x95 ata_raid_read_metadata() at ata_raid_read_metadata+0x2f4 ata_raid_subdisk_attach() at ata_raid_subdisk_attach+0x42 device_attach() at device_attach+0x5a device_probe_and_attach() at device_probe_and_attach+0xde bus_generic_attach() at bus_generic_attach+0x1a ad_attach() at ad_attach+0x285 device_attach() at device_attach+0x5a device_probe_and_attach() at device_probe_and_attach+0xde bus_generic_attach() at bus_generic_attach+0x1a ata_identify() at ata_identify+0x201 ata_boot_attach() at ata_boot_attach+0x6b run_interrupt_driven_config_hooks() at run_interrupt_driven_config_hooks+0xa7 mi_startup() at mi_startup+0xb6 btext() at btext+0x2c While the Mainboard (MSI K8N Neo2 Platinum) does have RAID support, it is=20 disabled in the BIOS, and the Controller is from nVidia, not Promise, afaik. Please let me know what other info I can provide.=20 The previous mail is quoted below for reference. Cheers Benjamin On Tuesday 19 September 2006 22:17, Benjamin Lutz wrote: > Since I added a new P-ATA disk to my computer, FreeBSD 6.1/amd64 panics > while booting: > > [...] > acd0: DVDROM at ata0-master UDMA33 > acd1: DVDR at ata0-slave UDMA33 > ad2: 78533MB at ata1-master UDMA100 > > > Fatal trap 18: integer divide fault while in kernel mode > instruction pointer =3D 0x8:0xffffffff801bd9a5 > stack pointer =3D 0x10:0xffffffff80845a90 > frame pointer =3D 0x10:0xffffffff80845ae0 > code segment =3D base 0x0, limit 0xfffff, type 0x1b > =3D DPL 0, pres 1, long 1, def32 0, gran 1 > processor eflags =3D interrupt enabled, resume, IOPL =3D 0 > current process =3D 0 (swapper) > trap number =3D 18 > panic: integer divide fault > Uptime: 1s > Cannot dump. No dump device defined. > Automatic reboot in 15 seconds - press a key on the console to abort > > If I just disable the second IDE channel, the system boots fine. FreeBSD > 5.5/i386, which is also installed on this computer, boots fine with or > without that disk. > > I've been trying to get into the kernel debugger to get a backtrace, but > without luck. I've hooked up a serial console, but I just can't get into > DDB. I've tried various option (adding debug.debugger_on_panic=3D"1" > to /boot/loader.conf, the BREAK_TO_DEBUGGER and KDB_UNATTENDED kernel > options as well as specifying "boot -d" at the boot loader - without any > effect. Since the panic happens before init is started, my dump device > configuration in /etc/rc.conf isn't read either. > > How can I get meaningful debugging data out of this system? > > Oh, btw, notice the weird device name string for acd0. It's supposed to s= ay > , and in the BIOS as well > as under FreeBSD 5.5/i386 it does. The erroneous strings shows up on > FreeBSD 6.1 with or without the second IDE channel being enabled. > > Any idea what might cause this? > > The full boot log follows bellow. > > Cheers > Benjamin > > > KDB: debugger backends: ddb > KDB: current backend: ddb > Copyright (c) 1992-2006 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD 6.1-RELEASE-p7 #4: Tue Sep 19 21:56:53 CEST 2006 > root@merlin.intranet:/usr/obj/usr/src/sys/MERLIN64 > Timecounter "i8254" frequency 1193182 Hz quality 0 > CPU: AMD Athlon(tm) 64 Processor 3500+ (2210.09-MHz K8-class CPU) > Origin =3D "AuthenticAMD" Id =3D 0xff0 Stepping =3D 0 > > Features=3D0x78bfbff,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2> AMD > Features=3D0xe0500800 > real memory =3D 1073676288 (1023 MB) > avail memory =3D 1026371584 (978 MB) > ACPI APIC Table: > ioapic0 irqs 0-23 on motherboard > kbd1 at kbdmux0 > acpi0: on motherboard > acpi0: Power Button (fixed) > Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 > acpi_timer0: <24-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0 > cpu0: on acpi0 > powernow0: on cpu0 > acpi_button0: on acpi0 > pcib0: port 0xcf8-0xcff,0xcf0-0xcf3 on acpi0 > pci0: on pcib0 > agp0: mem 0xf0000000-0xf7ffffff at > device 0.0 on pci0 > isab0: at device 1.0 on pci0 > isa0: on isab0 > pci0: at device 1.1 (no driver attached) > ohci0: mem 0xfe02f000-0xfe02ffff irq 21 at > device 2.0 on pci0 > ohci0: [GIANT-LOCKED] > usb0: OHCI version 1.0, legacy support > usb0: SMM does not respond, resetting > usb0: on ohci0 > usb0: USB revision 1.0 > uhub0: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub0: 4 ports with 4 removable, self powered > ohci1: mem 0xfe02e000-0xfe02efff irq 22 at > device 2.1 on pci0 > ohci1: [GIANT-LOCKED] > usb1: OHCI version 1.0, legacy support > usb1: SMM does not respond, resetting > usb1: on ohci1 > usb1: USB revision 1.0 > uhub1: nVidia OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub1: 4 ports with 4 removable, self powered > ehci0: mem 0xfe02d000-0xfe02d0ff > irq 23 at device 2.2 on pci0 > ehci0: [GIANT-LOCKED] > usb2: EHCI version 1.0 > usb2: companion controllers, 4 ports each: usb0 usb1 > usb2: on ehci0 > usb2: USB revision 2.0 > uhub2: nVidia EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 > uhub2: 8 ports with 8 removable, self powered > uhub3: vendor 0x0409 product 0x013e, class 9/0, rev 2.00/0.07, addr 2 > uhub3: multiple transaction translators > uhub3: 4 ports with 4 removable, self powered > pci0: at device 6.0 (no driver attached) > atapci0: port > 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xe000-0xe00f at device 8.0 on pci0 > ata0: on atapci0 > ata1: on atapci0 > atapci1: port > 0x9e0-0x9e7,0xbe0-0xbe3,0x960-0x967,0xb60-0xb63,0xcc00-0xcc0f,0xc800-0xc8= 7f > irq 22 at device 9.0 on pci0 > ata2: on atapci1 > ata3: on atapci1 > atapci2: port > 0x9f0-0x9f7,0xbf0-0xbf3,0x970-0x977,0xb70-0xb73,0xb400-0xb40f,0xb000-0xb0= 7f > irq 23 at device 10.0 on pci0 > ata4: on atapci2 > ata5: on atapci2 > pcib1: at device 11.0 on pci0 > pci1: on pcib1 > pci1: at device 0.0 (no driver attached) > pcib2: at device 14.0 on pci0 > pci2: on pcib2 > pcm0: port 0x9c00-0x9c1f irq 18 at device 8.0 on pci2 > pcm0: > pci2: at device 9.0 (no driver attached) > pci2: at device 9.1 (no driver attached) > fwohci0: port 0x9400-0x947f mem > 0xfdfff000-0xfdfff7ff irq 19 at device 12.0 on pci2 > fwohci0: OHCI version 1.0 (ROM=3D1) > fwohci0: No. of Isochronous channels is 4. > fwohci0: EUI64 00:10:dc:00:00:7a:42:db > fwohci0: Phy 1394a available S400, 3 ports. > fwohci0: Link S400, max_rec 2048 bytes. > firewire0: on fwohci0 > fwe0: on firewire0 > if_fwe0: Fake Ethernet address: 02:10:dc:7a:42:db > fwe0: Ethernet address: 02:10:dc:7a:42:db > fwe0: if_start running deferred for Giant > sbp0: on firewire0 > fwohci0: Initiate bus reset > fwohci0: node_id=3D0xc800ffc0, gen=3D1, CYCLEMASTER mode > firewire0: 1 nodes, maxhop <=3D 0, cable IRM =3D 0 (me) > firewire0: bus manager 0 (me) > re0: port 0x9000-0x90ff mem > 0xfdffe000-0xfdffe0ff irq 16 at device 13.0 on pci2 > miibus0: on re0 > rgephy0: on miibus0 > rgephy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, > 1000baseTX-FDX, auto > re0: Ethernet address: 00:11:09:65:fc:0e > fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acp= i0 > fdc0: [FAST] > fd0: <1440-KB 3.5" drive> on fdc0 drive 0 > sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on > acpi0 sio0: type 16550A, console > ppc0: port 0x378-0x37f,0x778-0x77b irq 7 > on acpi0 > ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode > ppbus0: on ppc0 > lpt0: on ppbus0 > lpt0: Interrupt-driven port > ppi0: on ppbus0 > atkbdc0: port 0x60,0x64 irq 1 on acpi0 > atkbd0: flags 0x1 irq 1 on atkbdc0 > kbd0 at atkbd0 > atkbd0: [GIANT-LOCKED] > orm0: at iomem 0xc0000-0xce7ff,0xd0000-0xd3fff on isa0 > sc0: at flags 0x100 on isa0 > sc0: VGA <16 virtual consoles, flags=3D0x300> > vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 > sio1: configured irq 3 not in bitmap of probed irqs 0 > sio1: port may not be enabled > Timecounter "TSC" frequency 2210091822 Hz quality 800 > Timecounters tick every 1.000 msec > acd0: DVDROM at ata0-master UDMA33 > acd1: DVDR at ata0-slave UDMA33 > ad2: 78533MB at ata1-master UDMA100 > > > Fatal trap 18: integer divide fault while in kernel mode > instruction pointer =3D 0x8:0xffffffff801bd9a5 > stack pointer =3D 0x10:0xffffffff80845a90 > frame pointer =3D 0x10:0xffffffff80845ae0 > code segment =3D base 0x0, limit 0xfffff, type 0x1b > =3D DPL 0, pres 1, long 1, def32 0, gran 1 > processor eflags =3D interrupt enabled, resume, IOPL =3D 0 > current process =3D 0 (swapper) > trap number =3D 18 > panic: integer divide fault > Uptime: 1s > Cannot dump. No dump device defined. > Automatic reboot in 15 seconds - press a key on the console to abort > Rebooting... --nextPart1390711.J0h2XYPAOn Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.5 (GNU/Linux) iD8DBQBFEv2kzZEjpyKHuQwRAvchAJ0aduczWbZBNbhaofXPRT2kG0l5vACfdIER vXQTSWIF311y6Tk11C0TmxE= =evH4 -----END PGP SIGNATURE----- --nextPart1390711.J0h2XYPAOn--