Date: Sun, 10 Nov 2002 22:46:30 -0400 (AST) From: "Marc G. Fournier" <scrappy@hub.org> To: freebsd-stable@freebsd.org Subject: Changes on Oct 28th break -STABLE kernel ... Message-ID: <20021110222143.D11716-100000@hub.org>
next in thread | raw e-mail | index | archive | help
Evening ... After spending today doing an incremental upgrade, starting from Oct12th (my last stable), going to Oct12th, Oct19 and Oct27th (all sucessful), I drop'd down to a 'daily incremental' ... Oct28th (the kernel I am now running) appears to work great, but as soon as I go to Oct29th, I can't get it to come up ... From cvsup, the changes between Oct28th (midnight) and Oct29th are: Connected to cvsup.FreeBSD.org Updating collection src-all/cvs Edit src/etc/MAKEDEV Edit src/sys/conf/files Edit src/sys/dev/amr/amr.c Edit src/sys/dev/amr/amr_cam.c Edit src/sys/dev/amr/amr_compat.h Edit src/sys/dev/amr/amr_disk.c Edit src/sys/dev/amr/amr_pci.c Edit src/sys/dev/amr/amr_tables.h Edit src/sys/dev/amr/amrio.h Edit src/sys/dev/amr/amrreg.h Edit src/sys/dev/amr/amrvar.h Edit src/sys/modules/amr/Makefile Edit src/sys/netinet/ip_fw.c Finished successfully The AMR driver is what I'm running: venus# grep amr /var/run/dmesg.boot amr0: flushing cache...done amr0: <AMI MegaRAID> mem 0xfc1f0000-0xfc1fffff irq 11 at device 2.0 on pci1 amr0: <Series 475 40 Logical Drive Firmware> Firmware E161, BIOS 3.13, 32MB RAM amrd0: <MegaRAID logical drive> on amr0 amrd0: 105000MB (215040000 sectors) RAID 5 (optimal) Mounting root from ufs:/dev/amrd0s1a I've made not changes to my kernel config (or the hardware) between the Oct28 and Oct29th kernels ... This server is a remote server, so 'hands on' debugging is difficult ... the folks at Rackspace try and provide what they can, based on what is on the console, which, in this case: "Your kernal failed when tring to boot the second processor." So, it seems that the 'hang @ "SMP: AP CPU #1 Launched!"' problem started with code around the Oct28/Oct29th period ... According to /var/run/dmesg.boot, the following is what happens after the "SMP: AP CPU #1 Launched!" message comes up on a good boot: SMP: AP CPU #1 Launched! sa0 at sym0 bus 0 target 0 lun 0 sa0: <SONY SDX-700C 0101> Removable Sequential Access SCSI-2 device sa0: 80.000MB/s transfers (40.000MHz, offset 31, 16bit) Mounting root from ufs:/dev/amrd0s1a and, on a good boot, the dmesg.boot looks like: Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.7-STABLE #20: Sun Nov 10 18:55:29 CST 2002 root@venus.hub.org:/usr/obj/usr/src/sys/kernel Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (1262.67-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x6b1 Stepping = 1 Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> real memory = 4227858432 (4128768K bytes) avail memory = 4120436736 (4023864K bytes) Programming 16 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 16 pins in IOAPIC #1 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 4, version: 0x000f0011, at 0xfec00000 io1 (APIC): apic id: 5, version: 0x000f0011, at 0xfec01000 Preloaded elf kernel "kernel" at 0xc02a7000. Pentium Pro MTRR support enabled Using $PIR table, 10 entries at 0xc00f51c0 npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard IOAPIC #1 intpin 6 -> irq 2 IOAPIC #1 intpin 4 -> irq 5 IOAPIC #1 intpin 5 -> irq 9 pci0: <PCI bus> on pcib0 pci0: <ATI Mach64-GR graphics accelerator> at 1.0 irq 2 pci0: <unknown card> (vendor=0x8086, dev=0x1229) at 4.0 irq 5 pci0: <unknown card> (vendor=0x8086, dev=0x1229) at 5.0 irq 9 isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0 isa0: <ISA bus> on isab0 pci0: <Unknown PCI ATA controller> at 15.1 pci0: <OHCI USB controller> at 15.2 irq 10 pcib1: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard IOAPIC #1 intpin 11 -> irq 11 IOAPIC #1 intpin 7 -> irq 16 IOAPIC #1 intpin 8 -> irq 17 pci1: <PCI bus> on pcib1 amr0: <AMI MegaRAID> mem 0xfc1f0000-0xfc1fffff irq 11 at device 2.0 on pci1 amr0: <Series 475 40 Logical Drive Firmware> Firmware E161, BIOS 3.13, 32MB RAM sym0: <896> port 0xe400-0xe4ff mem 0xfebc8000-0xfebc9fff,0xfebe0000-0xfebe03ff irq 16 at device 3.0 on pci1 sym0: Symbios NVRAM, ID 7, Fast-40, SE, parity checking sym0: open drain IRQ line driver, using on-chip SRAM sym0: using LOAD/STORE-based firmware. sym0: handling phase mismatch from SCRIPTS. sym1: <896> port 0xe800-0xe8ff mem 0xfebe8000-0xfebe9fff,0xfebf0000-0xfebf03ff irq 17 at device 3.1 on pci1 sym1: Symbios NVRAM, ID 7, Fast-40, LVD, parity checking sym1: open drain IRQ line driver, using on-chip SRAM sym1: using LOAD/STORE-based firmware. sym1: handling phase mismatch from SCRIPTS. orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc9800-0xca7ff,0xca800-0xcb7ff,0xcb800-0xcbfff on isa0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> APIC_IO: Testing 8254 interrupt delivery APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2 APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0 IP packet filtering initialized, divert disabled, rule-based forwarding enabled, default to accept, logging disabled Waiting 15 seconds for SCSI devices to settle (noperiph:sym0:0:-1:-1): SCSI BUS reset delivered. (noperiph:sym1:0:-1:-1): SCSI BUS reset delivered. amrd0: <MegaRAID logical drive> on amr0 amrd0: 105000MB (215040000 sectors) RAID 5 (optimal) SMP: AP CPU #1 Launched! sa0 at sym0 bus 0 target 0 lun 0 sa0: <SONY SDX-700C 0101> Removable Sequential Access SCSI-2 device sa0: 80.000MB/s transfers (40.000MHz, offset 31, 16bit) Mounting root from ufs:/dev/amrd0s1a The motherboard is: Thunder Le-T with bios version 1.06 ... Thoughts? anything else I can provide to narrow down, and fix, the problem? thanks ... To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20021110222143.D11716-100000>