From owner-freebsd-stable@FreeBSD.ORG Sun Mar 9 00:09:40 2008 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CC6A81065671; Sun, 9 Mar 2008 00:09:40 +0000 (UTC) (envelope-from eilko@bos-zuidema.nl) Received: from hobbes.brasapen.org (tafkam.xs4all.nl [82.95.217.25]) by mx1.freebsd.org (Postfix) with ESMTP id 369F88FC28; Sun, 9 Mar 2008 00:09:40 +0000 (UTC) (envelope-from eilko@bos-zuidema.nl) Received: from localhost (localhost [127.0.0.1]) by hobbes.brasapen.org (Postfix) with ESMTP id B3D4218984; Sun, 9 Mar 2008 00:51:33 +0100 (CET) X-Virus-Scanned: amavisd-new at bos-zuidema.nl Received: from hobbes.brasapen.org ([127.0.0.1]) by localhost (wilma.brasapen.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id gXumdABVwpS6; Sun, 9 Mar 2008 00:50:53 +0100 (CET) Received: from webmail.home.brasapen.org (webmail.home.brasapen.org [172.20.1.10]) by hobbes.brasapen.org (Postfix) with ESMTP id 4F555189B8; Sun, 9 Mar 2008 00:50:50 +0100 (CET) Received: by webmail.home.brasapen.org (Postfix, from userid 1001) id 97D7B1CC2B; Sun, 9 Mar 2008 00:50:49 +0100 (CET) Date: Sun, 9 Mar 2008 00:50:49 +0100 From: Eilko Bos To: Michael Haro Message-ID: <20080308235049.GA74522@webmail.home.brasapen.org> References: <479BAC09.7040505@freebsd.se> <20080126223750.GA8397@marshal.spacemarines.us> <479BC21D.10607@skyrush.com> <4cd036390801270001u72363b72v84231956b173bf73@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4cd036390801270001u72363b72v84231956b173bf73@mail.gmail.com> User-Agent: Mutt/1.5.16 (2007-06-09) Cc: Remco van Bekkum , Joe Peterson , Nikolaj Farrell , freebsd-stable@freebsd.org Subject: Re: ad8: TIMEOUT - WRITE_DMA errors UFS 7.0-RC1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 09 Mar 2008 00:09:41 -0000 >From the keyboard of Michael Haro, written on Sun, Jan 27, 2008 at 12:01:03AM -0800: > > Can anyone else using 7.0 who hasn't already (especially those using ZFS) > > check his/her /var/log/messages for disk TIMEOUTs or other disk error > > messages? If this is widespread, I think the chances re slim that it is a > > hardware problem in every case. > > I've had this problem with Hitachi sata drives using a promise sata controller. I am using 2 160Gb Maxtor disks in geom_mirror. With 6.3 it runs fine. I upgraded to 7.0-RELEASE and after install problems started. Disk TIMEOUTs freezed the box as soon as I initiated a lot of disk activity (e.g. make buildworld of building a kernel). I 'downgraded' the box to 6.3 again (had to rebuild the mirror because it was touched by a newer gmirror) and now the problems have gone again. I have the strong impression it is not hardware bot rather 7.0-RELEASE related. Actually I want to get rid of the box at home (want to carry it to a datacenter) but if it can be helpfull I am willing to have it for another week or two at home to upgrade/downgrade/etc. with it. My dmesg (6.3 again): -------------------------- Copyright (c) 1992-2008 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 6.3-RELEASE #0: Wed Jan 16 04:45:45 UTC 2008 root@dessler.cse.buffalo.edu:/usr/obj/usr/src/sys/SMP Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel Pentium III (1000.04-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x68a Stepping = 10 Features=0x383fbff real memory = 1610592256 (1535 MB) avail memory = 1564733440 (1492 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 3 cpu1 (AP): APIC ID: 0 ioapic0 irqs 0-15 on motherboard ioapic1 irqs 16-31 on motherboard kbd1 at kbdmux0 ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413) hptrr: HPT RocketRAID controller driver v1.1 (Jan 16 2008 04:43:12) acpi0: on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-safe" frequency 3579545 Hz quality 850 acpi_timer0: <32-bit timer at 3.579545MHz> port 0xe408-0xe40b on acpi0 cpu0: on acpi0 cpu1: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 fxp0: port 0xd800-0xd83f mem 0xfe000000-0xfe000fff,0xfd800000-0xfd8fffff at device 2.0 on pci0 miibus0: on fxp0 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:e0:18:47:01:35 em0: port 0xd400-0xd43f mem 0xfd000000-0xfd01ffff,0xfc800000-0xfc81ffff irq 17 at device 4.0 on pci0 em0: Ethernet address: 00:07:e9:3e:e7:90 pci0: at device 7.0 (no driver attached) isab0: port 0xe800-0xe80f at device 15.0 on pci0 isa0: on isab0 atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xb800-0xb80f at device 15.1 on pci0 ata0: on atapci0 ata1: on atapci0 ohci0: mem 0xfa000000-0xfa000fff irq 9 at device 15.2 on pci0 ohci0: [GIANT-LOCKED] usb0: OHCI version 1.0, legacy support usb0: on ohci0 usb0: USB revision 1.0 uhub0: (0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 4 ports with 4 removable, self powered pcib1: on acpi0 pci1: on pcib1 sym0: <1010-33> port 0xb400-0xb4ff mem 0xf9800000-0xf98003ff,0xf9000000-0xf9001fff irq 24 at device 5.0 on pci1 sym0: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking sym0: open drain IRQ line driver, using on-chip SRAM sym0: using LOAD/STORE-based firmware. sym0: handling phase mismatch from SCRIPTS. sym0: [GIANT-LOCKED] sym1: <1010-33> port 0xb000-0xb0ff mem 0xf8800000-0xf88003ff,0xf8000000-0xf8001fff irq 25 at device 5.1 on pci1 sym1: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking sym1: open drain IRQ line driver, using on-chip SRAM sym1: using LOAD/STORE-based firmware. sym1: handling phase mismatch from SCRIPTS. sym1: [GIANT-LOCKED] atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: model IntelliMouse, device ID 3 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A ppc0: port 0x378-0x37f,0x778-0x77a irq 7 drq 3 on acpi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold ppbus0: on ppc0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 pmtimer0 on isa0 orm0: at iomem 0xc0000-0xc7fff,0xc8000-0xc97ff,0xcc000-0xcc7ff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 1.000 msec hptrr: no controller detected. Waiting 5 seconds for SCSI devices to settle (noperiph:sym0:0:-1:-1): SCSI BUS reset delivered. (noperiph:sym1:0:-1:-1): SCSI BUS reset delivered. ad0: 152627MB at ata0-master UDMA33 ad1: 152627MB at ata0-slave UDMA33 acd0: CDROM at ata1-master PIO4 - - - - - - - - - - - - - - Rebuilding the mirror: GEOM_MIRROR: Device gm0 created (id=24929696). GEOM_MIRROR: Device gm0: provider ad0 detected. GEOM_MIRROR: Device gm0: provider ad0 activated. GEOM_MIRROR: Device gm0: provider mirror/gm0 launched. GEOM_MIRROR: Kernel module is too old to handle metadata from ad1. SMP: AP CPU #1 Launched! Trying to mount root from ufs:/dev/mirror/gm0s1a em0: link state changed to UP GEOM_MIRROR: Device gm0: provider ad1 detected. GEOM_MIRROR: Device gm0: rebuilding provider ad1. Grtz, -- Eilko Bos.