Date: Tue, 12 Nov 2002 22:37:55 -0800 (PST) From: Dave Cornejo <dave@dogwood.com> To: freebsd-current@freebsd.org Subject: current SMP kernel crashes (different?) Message-ID: <200211130637.gAD6btTC017406@white.dogwood.com>
next in thread | raw e-mail | index | archive | help
I've had a problem with a SuperMicro 2010H server crashing when attempting to run an SMP kernel. I've noticed a lot of this lately, but this seem to be crashing in the clock code. Below is the console output from power-up to crash. If I use an UP kernel of the same vintage there is no problem. If I'm reading the code correctly this seems to be a problem in APIC mode 8254 detection. Does anyone have any idea why this is happening? Any magical hints I could use to get past this? I've tried disabling ACPI to no avail. thanks, dave c Rebooting... Console: serial port BIOS drive A: is disk0 BIOS drive C: is disk1 BIOS drive D: is disk2 BIOS 639kB/1047488kB available memory FreeBSD/i386 bootstrap loader, Revision 1.1 (dave@white.dogwood.com, Fri Nov 8 11:05:00 PST 2002) Loading /boot/defaults/loader.conf /boot/kernel/kernel text=0x3edda0 data=0x56898+0x82bec syms=[0x4+0x4d580+0x4+0x5bcf9] / Hit [Enter] to boot immediately, or any other key for command prompt. Booting [/boot/kernel/kernel]... /boot/kernel/acpi.ko text=0x38abc data=0x1a58+0xb48 syms=[0x4+0x5770+0x4+0x73b9] Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.0-CURRENT #2: Tue Nov 12 21:19:51 PST 2002 dave@white.dogwood.com:/usr/obj/usr/src/sys/WHITE Preloaded elf kernel "/boot/kernel/kernel" at 0xc06bc000. Preloaded elf module "/boot/kernel/acpi.ko" at 0xc06bc0a8. Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (999.53-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x686 Stepping = 6 Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> real memory = 1073676288 (1023 MB) avail memory = 1036230656 (988 MB) Programming 16 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 16 pins in IOAPIC #1 FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 4, version: 0x000f0011, at 0xfec00000 io1 (APIC): apic id: 5, version: 0x000f0011, at 0xfec01000 Initializing GEOMetry subsystem Pentium Pro MTRR support enabled ACPI-0623: *** Warning: Type override - [DEB_] had invalid type (Integer) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [MLIB] had invalid type (Integer) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [IO__] had invalid type (Integer) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [DATA] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [SIO_] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [SB__] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [PM__] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [ICNT] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [ACPI] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [OSB4] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [PM__] had invalid type (String) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [BIOS] had invalid type (Integer) for Scope operator, changed to (Scope) ACPI-0623: *** Warning: Type override - [CMOS] had invalid type (Integer) for Scope operator, changed to (Scope) npx0: <math processor> on motherboard npx0: INT 16 interface acpi0: <RCC RCCNILE > on motherboard acpi0: power button is handled as a fixed feature programming model. acpi0: sleep button is handled as a fixed feature programming model. Timecounter "ACPI-safe" frequency 3579545 Hz Using $PIR table, 7 entries at 0xc00f5370 acpi_timer0: <32-bit timer at 3.579545MHz> port 0x508-0x50b on acpi0 acpi_cpu0: <CPU> on acpi0 acpi_cpu1: <CPU> on acpi0 acpi_cpu2: <CPU> on acpi0 acpi_cpu3: <CPU> on acpi0 acpi_button0: <Sleep Button> on acpi0 pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0 ---- initial configuration ------------------------ \_SB_.LNUS irq 0: [ 10] 0.15.0 ---- before setting priority for links ------------ \_SB_.LNUS: interrupts: 10 penalty: 110 references: 1 priority: 0 ---- before fixup boot-disabled links ------------- \_SB_.LNUS: interrupts: 10 penalty: 110 references: 1 priority: 110 ---- after fixup boot-disabled links -------------- \_SB_.LNUS: interrupts: 10 penalty: 110 references: 1 priority: 110 ---- arbitrated configuration --------------------- \_SB_.LNUS irq 0: [ 10] 0.15.0 pci0: <ACPI PCI bus> on pcib0 IOAPIC #1 intpin 12 -> irq 2 IOAPIC #1 intpin 10 -> irq 5 IOAPIC #1 intpin 11 -> irq 7 IOAPIC #1 intpin 15 -> irq 9 pcib1: <ACPI PCI-PCI bridge> at device 0.1 on pci0 ---- initial configuration ------------------------ ---- before setting priority for links ------------ \_SB_.LNUS: interrupts: 10 penalty: 110 references: 1 priority: 110 ---- before fixup boot-disabled links ------------- \_SB_.LNUS: interrupts: 10 penalty: 110 references: 1 priority: 110 ---- after fixup boot-disabled links -------------- \_SB_.LNUS: interrupts: 10 penalty: 110 references: 1 priority: 110 ---- arbitrated configuration --------------------- pci1: <ACPI PCI bus> on pcib1 IOAPIC #1 intpin 14 -> irq 10 pci1: <display, VGA> at device 0.0 (no driver attached) fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xc800-0xc83f mem 0xfe800000-0xfe8fffff,0xfeafb000-0xfeafbfff irq 2 at device 4.0 on pci0 fxp0: Ethernet address 00:30:48:11:69:84 inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto ahc0: <Adaptec aic7899 Ultra160 SCSI adapter> port 0xd000-0xd0ff mem 0xfeafc000-0xfeafcfff irq 5 at device 5.0 on pci0 aic7899: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs ahc1: <Adaptec aic7899 Ultra160 SCSI adapter> port 0xd800-0xd8ff mem 0xfeaff000-0xfeafffff irq 7 at device 5.1 on pci0 aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/253 SCBs fxp1: <Intel Pro 10/100B/100+ Ethernet> port 0xd400-0xd43f mem 0xfe900000-0xfe9fffff,0xfeafd000-0xfeafdfff irq 9 at device 6.0 on pci0 fxp1: Ethernet address 00:30:48:11:6e:27 inphy1: <i82555 10/100 media interface> on miibus1 inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto isab0: <PCI-ISA bridge> port 0x580-0x58f at device 15.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <ServerWorks ROSB4 ATA33 controller> port 0xffa0-0xffaf at device 15.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 ohci0: <OHCI (generic) USB controller> mem 0xfffff000-0xffffffff irq 0 at device 15.2 on pci0 usb0: OHCI version 1.0, legacy support usb0: <OHCI (generic) USB controller> on ohci0 usb0: USB revision 1.0 uhub0: (0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 4 ports with 4 removable, self powered pcib2: <ACPI Host-PCI bridge> on acpi0 acpi0: couldn't attach pci bus device_probe_and_attach: pcib2 attach returned 6 atkbdc0: <Keyboard controller (i8042)> port 0x64,0x60 irq 1 on acpi0 fdc0: cmd 3 failed at out byte 1 of 3 sio0 port 0x3f8-0x3ff irq 4 on acpi0 sio0: type 16550A, console sio1 port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A pcib2: <ACPI Host-PCI bridge> on acpi0 acpi0: couldn't attach pci bus device_probe_and_attach: pcib2 attach returned 6 fdc0: cmd 3 failed at out byte 1 of 3 orm0: <Option ROMs> at iomem 0xcf000-0xcffff,0xc9000-0xcefff,0xc8000-0xc8fff,0xc0000-0xc7fff on isa0 fdc0: <Enhanced floppy controller (i82077, NE72065 or clone)> at port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 pmtimer0 on isa0 ppc0: parallel port not found. sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x100> vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 10.000 msec APIC_IO: Testing 8254 interrupt delivery Fatal trap 12: page fault while in kernel mode cpuid = 0; lapic.id = 00000000 fault virtual address = 0x6dbc00 fault code = supervisor read, page not present instruction pointer = 0x8:0xc02d7383 stack pointer = 0x10:0xc06decf8 frame pointer = 0x10:0xc06ded18 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 0 (swapper) kernel: type 12 trap, code=0 Stopped at _mtx_lock_flags+0x43: cmpl $0xc05216c0,0(%ebx) db> trace _mtx_lock_flags(6dbc00,0,c04c3c00,138,c02d767d) at _mtx_lock_flags+0x43 ithread_remove_handler(c06ded80,c06ded78,c046c427,c06ded80,0) at ithread_remove_handler+0x53 inthand_remove(c06ded80,0,c04e8e36,445,a0) at inthand_remove+0x11 cpu_initclocks(c06ded98,c02bcf75,0,6db000,6dbc00) at cpu_initclocks+0x327 initclocks(0,6db000,6dbc00,6db000,0) at initclocks+0x1c mi_startup() at mi_startup+0xb5 begin() at begin+0x2c db> _mtx_lock_flags+0x43 -> sys/kern/kern_mutex.c:324 ithread_remove_handler+0x53 -> sys/kern/kern_intr.c:314 inthand_remove+0x11 -> sys/i386/isa/intr_machdep.c:705 cpu_initclocks+0x327 -> sys/i386/isa/clock.c:1096 initclocks+0x1c -> sys/kern/kern_clock.c:153 mi_startup+0xb5 -> sys/kern/init_main.c:217 -- Dave Cornejo @ Dogwood Media, Fremont, California (also dcornejo@ieee.org) "There aren't any monkeys chasing us..." - Xochi To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200211130637.gAD6btTC017406>