Date: Mon, 10 Jan 2011 17:34:48 -0500 From: Mark Saad <nonesuch@longcount.org> To: stable@freebsd.org Subject: Enabling DDB prevent kernel from panicing Message-ID: <AANLkTinp76kxbRu6y0=Qfe9PiuDUPiUuU7zbQ24nkp8B@mail.gmail.com>
next in thread | raw e-mail | index | archive | help
All This was originally posted to hackers@ I have a good question that I cant find an answer for. I believe found a kernel bug in 7.3-RELEASE that prevents me from booting 64-bit kernels on HP's DL360 G4p . The kernel dies with "Fatal trap 12: page fault while in kernel mode " . The hardware works fine in 7.2-RELEASE amd64, 7.1-RELEASE amd64, and 6.4-RELEASE amd64 . In 7.3-RELEASE amd64 I can not boot from cd or pxe correctly using the stock 7.3-RELEASE amd64 kernel however i386 works fine. To see if this issue was some how fixed in 7.3-RELEASE-p4 amd64 I rebuilt a GENERIC kernel using patches sources and tried to boot and I got the same crash. Next I rebuilt the kernel with KDB and DDB to see if I could get a core-dump of the system. I also set loader.conf to kernel="kernel.DEBUG" kern.dumpdev="/dev/da0s1b" Next I pxebooted the box and the system does not crash on boot up, it will easily load a nfs root and work fine. So I copied my debug kernel, and loader.conf to the local disk and rebooted and it boots fine from the local disk . Rebooting the server and running off the local disks and debug kernel, I cant find any issues. Reboot the box into a GENERIC 7.3-RELEASE-p4 kernel and it crashes With this error Fatal trap 12: page fault while in kernel mode cpuid = 0; apic id = 00 fault virtual address = 0x0 fault code = supervisor write data, page not present instruction pointer = 0x8:0xffffffff800070fa stack pointer = 0x10:0xffffffff8153cbe0 frame pointer = 0x10:0xffffffff8153cc50 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, long 1, def32 0, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 0 (swapper) [thread pid 0 tid 100000 ] Stopped at bzero+0xa: repe stosq %es:(%rdi) It was recommended to comment out the sio hints in /boot/device.hints I did this and I can properly boot a GENERIC 7.3-RELEASE kernel. I reran this same test using 7.4-RC1 the system boots with out any changes to anything. So my question, does anyone know what changed in stable/7 after the creation of 7.3-RELEASE that could have fixed this or does anyone know what could be causing this issue. The sio code does not look like its been changed in a long while . Do we still need s the hits for the sio ports anyway does omitting them from the hints file cause any major issues, I can use the serial port for a console and to connect to to other serial devices with out any issues. -- mark saad | nonesuch@longcount.org
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?AANLkTinp76kxbRu6y0=Qfe9PiuDUPiUuU7zbQ24nkp8B>