From owner-freebsd-stable Sun Nov 10 18:46:36 2002 Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 99DD937B404 for ; Sun, 10 Nov 2002 18:46:32 -0800 (PST) Received: from hub.org (hub.org [64.49.215.141]) by mx1.FreeBSD.org (Postfix) with ESMTP id A3E2443E77 for ; Sun, 10 Nov 2002 18:46:31 -0800 (PST) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [64.49.215.141]) by hub.org (Postfix) with ESMTP id F1C678A18A0 for ; Sun, 10 Nov 2002 22:46:30 -0400 (AST) Date: Sun, 10 Nov 2002 22:46:30 -0400 (AST) From: "Marc G. Fournier" To: freebsd-stable@freebsd.org Subject: Changes on Oct 28th break -STABLE kernel ... Message-ID: <20021110222143.D11716-100000@hub.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG Evening ... After spending today doing an incremental upgrade, starting from Oct12th (my last stable), going to Oct12th, Oct19 and Oct27th (all sucessful), I drop'd down to a 'daily incremental' ... Oct28th (the kernel I am now running) appears to work great, but as soon as I go to Oct29th, I can't get it to come up ... From cvsup, the changes between Oct28th (midnight) and Oct29th are: Connected to cvsup.FreeBSD.org Updating collection src-all/cvs Edit src/etc/MAKEDEV Edit src/sys/conf/files Edit src/sys/dev/amr/amr.c Edit src/sys/dev/amr/amr_cam.c Edit src/sys/dev/amr/amr_compat.h Edit src/sys/dev/amr/amr_disk.c Edit src/sys/dev/amr/amr_pci.c Edit src/sys/dev/amr/amr_tables.h Edit src/sys/dev/amr/amrio.h Edit src/sys/dev/amr/amrreg.h Edit src/sys/dev/amr/amrvar.h Edit src/sys/modules/amr/Makefile Edit src/sys/netinet/ip_fw.c Finished successfully The AMR driver is what I'm running: venus# grep amr /var/run/dmesg.boot amr0: flushing cache...done amr0: mem 0xfc1f0000-0xfc1fffff irq 11 at device 2.0 on pci1 amr0: Firmware E161, BIOS 3.13, 32MB RAM amrd0: on amr0 amrd0: 105000MB (215040000 sectors) RAID 5 (optimal) Mounting root from ufs:/dev/amrd0s1a I've made not changes to my kernel config (or the hardware) between the Oct28 and Oct29th kernels ... This server is a remote server, so 'hands on' debugging is difficult ... the folks at Rackspace try and provide what they can, based on what is on the console, which, in this case: "Your kernal failed when tring to boot the second processor." So, it seems that the 'hang @ "SMP: AP CPU #1 Launched!"' problem started with code around the Oct28/Oct29th period ... According to /var/run/dmesg.boot, the following is what happens after the "SMP: AP CPU #1 Launched!" message comes up on a good boot: SMP: AP CPU #1 Launched! sa0 at sym0 bus 0 target 0 lun 0 sa0: Removable Sequential Access SCSI-2 device sa0: 80.000MB/s transfers (40.000MHz, offset 31, 16bit) Mounting root from ufs:/dev/amrd0s1a and, on a good boot, the dmesg.boot looks like: Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.7-STABLE #20: Sun Nov 10 18:55:29 CST 2002 root@venus.hub.org:/usr/obj/usr/src/sys/kernel Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (1262.67-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x6b1 Stepping = 1 Features=0x383fbff real memory = 4227858432 (4128768K bytes) avail memory = 4120436736 (4023864K bytes) Programming 16 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 16 pins in IOAPIC #1 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 4, version: 0x000f0011, at 0xfec00000 io1 (APIC): apic id: 5, version: 0x000f0011, at 0xfec01000 Preloaded elf kernel "kernel" at 0xc02a7000. Pentium Pro MTRR support enabled Using $PIR table, 10 entries at 0xc00f51c0 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard IOAPIC #1 intpin 6 -> irq 2 IOAPIC #1 intpin 4 -> irq 5 IOAPIC #1 intpin 5 -> irq 9 pci0: on pcib0 pci0: at 1.0 irq 2 pci0: (vendor=0x8086, dev=0x1229) at 4.0 irq 5 pci0: (vendor=0x8086, dev=0x1229) at 5.0 irq 9 isab0: at device 15.0 on pci0 isa0: on isab0 pci0: at 15.1 pci0: at 15.2 irq 10 pcib1: on motherboard IOAPIC #1 intpin 11 -> irq 11 IOAPIC #1 intpin 7 -> irq 16 IOAPIC #1 intpin 8 -> irq 17 pci1: on pcib1 amr0: mem 0xfc1f0000-0xfc1fffff irq 11 at device 2.0 on pci1 amr0: Firmware E161, BIOS 3.13, 32MB RAM sym0: <896> port 0xe400-0xe4ff mem 0xfebc8000-0xfebc9fff,0xfebe0000-0xfebe03ff irq 16 at device 3.0 on pci1 sym0: Symbios NVRAM, ID 7, Fast-40, SE, parity checking sym0: open drain IRQ line driver, using on-chip SRAM sym0: using LOAD/STORE-based firmware. sym0: handling phase mismatch from SCRIPTS. sym1: <896> port 0xe800-0xe8ff mem 0xfebe8000-0xfebe9fff,0xfebf0000-0xfebf03ff irq 17 at device 3.1 on pci1 sym1: Symbios NVRAM, ID 7, Fast-40, LVD, parity checking sym1: open drain IRQ line driver, using on-chip SRAM sym1: using LOAD/STORE-based firmware. sym1: handling phase mismatch from SCRIPTS. orm0: