Date: Sun, 10 Nov 2002 22:46:30 -0400 (AST) From: "Marc G. Fournier" <scrappy@hub.org> To: freebsd-stable@freebsd.org Subject: Changes on Oct 28th break -STABLE kernel ... Message-ID: <20021110222143.D11716-100000@hub.org>
next in thread | raw e-mail | index | archive | help
Evening ...
After spending today doing an incremental upgrade, starting from Oct12th
(my last stable), going to Oct12th, Oct19 and Oct27th (all sucessful), I
drop'd down to a 'daily incremental' ... Oct28th (the kernel I am now
running) appears to work great, but as soon as I go to Oct29th, I can't
get it to come up ...
From cvsup, the changes between Oct28th (midnight) and Oct29th are:
Connected to cvsup.FreeBSD.org
Updating collection src-all/cvs
Edit src/etc/MAKEDEV
Edit src/sys/conf/files
Edit src/sys/dev/amr/amr.c
Edit src/sys/dev/amr/amr_cam.c
Edit src/sys/dev/amr/amr_compat.h
Edit src/sys/dev/amr/amr_disk.c
Edit src/sys/dev/amr/amr_pci.c
Edit src/sys/dev/amr/amr_tables.h
Edit src/sys/dev/amr/amrio.h
Edit src/sys/dev/amr/amrreg.h
Edit src/sys/dev/amr/amrvar.h
Edit src/sys/modules/amr/Makefile
Edit src/sys/netinet/ip_fw.c
Finished successfully
The AMR driver is what I'm running:
venus# grep amr /var/run/dmesg.boot
amr0: flushing cache...done
amr0: <AMI MegaRAID> mem 0xfc1f0000-0xfc1fffff irq 11 at device 2.0 on pci1
amr0: <Series 475 40 Logical Drive Firmware> Firmware E161, BIOS 3.13, 32MB RAM
amrd0: <MegaRAID logical drive> on amr0
amrd0: 105000MB (215040000 sectors) RAID 5 (optimal)
Mounting root from ufs:/dev/amrd0s1a
I've made not changes to my kernel config (or the hardware) between the
Oct28 and Oct29th kernels ...
This server is a remote server, so 'hands on' debugging is difficult ...
the folks at Rackspace try and provide what they can, based on what is on
the console, which, in this case:
"Your kernal failed when tring to boot the second processor."
So, it seems that the 'hang @ "SMP: AP CPU #1 Launched!"' problem
started with code around the Oct28/Oct29th period ...
According to /var/run/dmesg.boot, the following is what happens after
the "SMP: AP CPU #1 Launched!" message comes up on a good boot:
SMP: AP CPU #1 Launched!
sa0 at sym0 bus 0 target 0 lun 0
sa0: <SONY SDX-700C 0101> Removable Sequential Access SCSI-2 device
sa0: 80.000MB/s transfers (40.000MHz, offset 31, 16bit)
Mounting root from ufs:/dev/amrd0s1a
and, on a good boot, the dmesg.boot looks like:
Copyright (c) 1992-2002 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 4.7-STABLE #20: Sun Nov 10 18:55:29 CST 2002
root@venus.hub.org:/usr/obj/usr/src/sys/kernel
Timecounter "i8254" frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (1262.67-MHz 686-class CPU)
Origin = "GenuineIntel" Id = 0x6b1 Stepping = 1
Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
real memory = 4227858432 (4128768K bytes)
avail memory = 4120436736 (4023864K bytes)
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 -> irq 0
Programming 16 pins in IOAPIC #1
FreeBSD/SMP: Multiprocessor motherboard
cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000
cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000
io0 (APIC): apic id: 4, version: 0x000f0011, at 0xfec00000
io1 (APIC): apic id: 5, version: 0x000f0011, at 0xfec01000
Preloaded elf kernel "kernel" at 0xc02a7000.
Pentium Pro MTRR support enabled
Using $PIR table, 10 entries at 0xc00f51c0
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard
IOAPIC #1 intpin 6 -> irq 2
IOAPIC #1 intpin 4 -> irq 5
IOAPIC #1 intpin 5 -> irq 9
pci0: <PCI bus> on pcib0
pci0: <ATI Mach64-GR graphics accelerator> at 1.0 irq 2
pci0: <unknown card> (vendor=0x8086, dev=0x1229) at 4.0 irq 5
pci0: <unknown card> (vendor=0x8086, dev=0x1229) at 5.0 irq 9
isab0: <ServerWorks IB6566 PCI to ISA bridge> at device 15.0 on pci0
isa0: <ISA bus> on isab0
pci0: <Unknown PCI ATA controller> at 15.1
pci0: <OHCI USB controller> at 15.2 irq 10
pcib1: <ServerWorks NB6635 3.0LE host to PCI bridge> on motherboard
IOAPIC #1 intpin 11 -> irq 11
IOAPIC #1 intpin 7 -> irq 16
IOAPIC #1 intpin 8 -> irq 17
pci1: <PCI bus> on pcib1
amr0: <AMI MegaRAID> mem 0xfc1f0000-0xfc1fffff irq 11 at device 2.0 on pci1
amr0: <Series 475 40 Logical Drive Firmware> Firmware E161, BIOS 3.13, 32MB RAM
sym0: <896> port 0xe400-0xe4ff mem 0xfebc8000-0xfebc9fff,0xfebe0000-0xfebe03ff irq 16 at device 3.0 on pci1
sym0: Symbios NVRAM, ID 7, Fast-40, SE, parity checking
sym0: open drain IRQ line driver, using on-chip SRAM
sym0: using LOAD/STORE-based firmware.
sym0: handling phase mismatch from SCRIPTS.
sym1: <896> port 0xe800-0xe8ff mem 0xfebe8000-0xfebe9fff,0xfebf0000-0xfebf03ff irq 17 at device 3.1 on pci1
sym1: Symbios NVRAM, ID 7, Fast-40, LVD, parity checking
sym1: open drain IRQ line driver, using on-chip SRAM
sym1: using LOAD/STORE-based firmware.
sym1: handling phase mismatch from SCRIPTS.
orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc9800-0xca7ff,0xca800-0xcb7ff,0xcb800-0xcbfff on isa0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2
APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0
IP packet filtering initialized, divert disabled, rule-based forwarding enabled, default to accept, logging disabled
Waiting 15 seconds for SCSI devices to settle
(noperiph:sym0:0:-1:-1): SCSI BUS reset delivered.
(noperiph:sym1:0:-1:-1): SCSI BUS reset delivered.
amrd0: <MegaRAID logical drive> on amr0
amrd0: 105000MB (215040000 sectors) RAID 5 (optimal)
SMP: AP CPU #1 Launched!
sa0 at sym0 bus 0 target 0 lun 0
sa0: <SONY SDX-700C 0101> Removable Sequential Access SCSI-2 device
sa0: 80.000MB/s transfers (40.000MHz, offset 31, 16bit)
Mounting root from ufs:/dev/amrd0s1a
The motherboard is: Thunder Le-T with bios version 1.06 ...
Thoughts? anything else I can provide to narrow down, and fix, the
problem?
thanks ...
To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-stable" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20021110222143.D11716-100000>
