From owner-freebsd-stable@FreeBSD.ORG Fri Sep 23 05:07:45 2005 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 880F316A41F for ; Fri, 23 Sep 2005 05:07:45 +0000 (GMT) (envelope-from smartweb@leadhill.net) Received: from natco3.natcotech.com (natco3.natcotech.com [205.167.142.103]) by mx1.FreeBSD.org (Postfix) with ESMTP id 1415243D46 for ; Fri, 23 Sep 2005 05:07:44 +0000 (GMT) (envelope-from smartweb@leadhill.net) Received: from ibm.nlcc.us (ldhl-ras1-dial-12-28-24-103.natcotech.com [12.28.24.103]) by natco3.natcotech.com (Postfix) with ESMTP id 53FC762A5D for ; Fri, 23 Sep 2005 00:07:43 -0500 (CDT) Received: (qmail 9023 invoked by uid 89); 23 Sep 2005 05:07:42 -0000 Received: from unknown (HELO ?192.168.0.2?) (192.168.0.2) by ibm.nlcc.us with SMTP; 23 Sep 2005 05:07:42 -0000 Message-ID: <43338D9D.7070109@leadhill.net> Date: Fri, 23 Sep 2005 00:07:41 -0500 From: Billy Newsom User-Agent: Mozilla Thunderbird 1.0.2 (Windows/20050317) X-Accept-Language: en-us, en MIME-Version: 1.0 To: FreeBSD Stable Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: critical BOOT failure updating to latest 5-Stable (5.4) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 23 Sep 2005 05:07:45 -0000 I have a usable kernel that I built from 5-Stable sources on July 4th, 2005. But the last two days, I tried to compile and install the lastest 5-Stable, and neither one would boot. During the boot, the entire normal dmesg is output (the part which is in "bold" on the CRT), except the last line which normally says, Mounting root from ufs:/dev/ad0s1a the system freezes with the disk LED "on" on the computer. So the last message I see on the screen during the failed boot is this: SMP: AP CPU #1 Launched! Okay, since I tried two kernels, and both froze up at the same place, but there was one minor difference, here it is... I saw this error the very last time I booted to the second bad kernel: ad0: TIMEOUT - READ_DMA retrying (2 retries left) LBA=0 This error was right below the line SMP: AP CPU #1 Launched! Okay, that is definitely an indication of some sort of timing or driver issue. But as far as I know, this hard disk has no other known problems, and obviously it boots flawlessly on the other 5.4 kernels and has for a long, long time. Has anything changed in the bootstrap code for the last two months in 5-Stable? I have not left the machine frozen for more than a couple of minutes, but during that time Scroll Lock/Scroll Back won't work. But it's possible the system is possibly in a long (but finite) state of disk reading and it's not completely frozen. I am inclusing the dmesg from a good boot. It looks like the bad boot messages are the same, at least as far as I can tell. Copyright (c) 1992-2005 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.4-STABLE #0: Mon Jul 4 18:28:47 CDT 2005 root@me.me.me:/usr/obj/usr/src/sys/BILLYSMP3 Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Pentium Pro (199.31-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x617 Stepping = 7 Features=0xfbff real memory = 536870912 (512 MB) avail memory = 515719168 (491 MB) MPTable: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 1 cpu1 (AP): APIC ID: 0 ioapic0: Assuming intbase of 0 ioapic0 irqs 0-23 on motherboard npx0: on motherboard npx0: INT 16 interface cpu0 on motherboard cpu1 on motherboard pcib0: pcibus 0 on motherboard pci0: on pcib0 isab0: at device 1.0 on pci0 isa0: on isab0 atapci0: port 0xfff0-0xffff,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 1.1 on pci0 ata0: channel #0 on atapci0 ata1: channel #1 on atapci0 uhci0: port 0x5400-0x541f irq 19 at device 1.2 on pci0 usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered rl0: port 0x5000-0x50ff mem 0x50000000-0x500000ff irq 18 at device 6.0 on pci0 miibus0: on rl0 rlphy0: on miibus0 rlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto rl0: Ethernet address: 00:50:ba:4d:57:6f pci0: at device 11.0 (no driver attached) orm0: at iomem 0xc0000-0xc7fff on isa0 fdc0: at port 0x3f0-0x3f5 irq 6 drq 2 on isa0 fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: at port 0x64,0x60 on isa0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <6 virtual consoles, flags=0x300> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold ppbus0: on ppc0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 ep0: <3Com 3C509B-TPO EtherLink III (PnP)> at port 0x210-0x21f irq 5 on isa0 ep0: Ethernet address: 00:a0:26:77:23:83 unknown: can't assign resources (port) unknown: can't assign resources (port) unknown: can't assign resources (port) unknown: can't assign resources (port) unknown: can't assign resources (port) Timecounters tick every 10.000 msec IP Filter: v3.4.35 initialized. Default = pass all, Logging = enabled Interrupt storm detected on "irq5: ep0"; throttling interrupt source ad0: 8809MB [17898/16/63] at ata0-master WDMA2 ad2: 2014MB [4092/16/63] at ata1-master WDMA2 acd0: DVDROM at ata1-slave PIO4 SMP: AP CPU #1 Launched! Mounting root from ufs:/dev/ad0s1a