From owner-freebsd-questions@FreeBSD.ORG Wed Sep 7 01:00:17 2005 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id DCD3116A41F for ; Wed, 7 Sep 2005 01:00:17 +0000 (GMT) (envelope-from freebsd-questions@m.gmane.org) Received: from ciao.gmane.org (main.gmane.org [80.91.229.2]) by mx1.FreeBSD.org (Postfix) with ESMTP id 87C1D43D49 for ; Wed, 7 Sep 2005 01:00:08 +0000 (GMT) (envelope-from freebsd-questions@m.gmane.org) Received: from list by ciao.gmane.org with local (Exim 4.43) id 1ECoH8-0006V6-AX for freebsd-questions@freebsd.org; Wed, 07 Sep 2005 02:58:54 +0200 Received: from anthonychavez.org ([166.70.126.66]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 07 Sep 2005 02:58:54 +0200 Received: from acc by anthonychavez.org with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 07 Sep 2005 02:58:54 +0200 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-questions@freebsd.org From: Anthony Chavez Date: Tue, 06 Sep 2005 18:57:47 -0600 Lines: 135 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha1; protocol="application/pgp-signature" X-Complaints-To: usenet@sea.gmane.org X-Gmane-NNTP-Posting-Host: anthonychavez.org X-PGP-Key: http://anthonychavez.org/pubkey.asc User-Agent: Gnus/5.1006 (Gnus v5.10.6) Emacs/22.0.50 (darwin) Cancel-Lock: sha1:kzIiOObeEXzK4ePNeIZCxQePi9U= Sender: news Subject: Re: Stress testing and TIMEOUT - WRITE_DMA X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 07 Sep 2005 01:00:18 -0000 --=-=-= On Fri, 26 Aug 2005 03:21:35 -0600 Anthony Chavez wrote: > My question is simply this: is the fact that I received 4 TIMEOUT > warnings in the space of roughly 2 weeks significant cause for concern? Apparently, the fact that the stress tool produced so few warnings may have given me a false sense of security. I'm being treated to the following messages (81 in total) today, after 8 days uptime: Sep 6 11:35:27 mybox kernel: ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=8348191 ... Sep 6 18:59:09 mybox kernel: ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=8348383 Sep 6 19:04:58 mybox kernel: ad0: TIMEOUT - READ_DMA retrying (2 retries left) LBA=61749183 The READ_DMA timeouts are happening very infrequently, but it's worth mentioning that I'm seeing them now in addition. This is quite disturbing, particularly when the machine in question is *in*production.* Has anyone who has experienced this pain found solace in 5-STABLE's ATA drivers? dmesg below. -- Anthony Chavez http://anthonychavez.org/ mailto:acc@anthonychavez.org jabber:acc@jabber.anthonychavez.org Copyright (c) 1992-2005 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.4-RELEASE-p6 #0: Fri Aug 26 02:23:19 MDT 2005 root@mybox:/usr/obj/usr/src/sys/GENERIC ACPI APIC Table: Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Celeron(R) CPU 2.40GHz (2392.25-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf29 Stepping = 9 Features=0xbfebfbff real memory = 266813440 (254 MB) avail memory = 251445248 (239 MB) ioapic0: Changing APIC ID to 1 ioapic0 irqs 0-23 on motherboard npx0: on motherboard npx0: INT 16 interface acpi0: on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0 cpu0: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 agp0: mem 0xfeb80000-0xfebfffff,0xe8000000-0xefffffff irq 16 at device 2.0 on pci0 agp0: detected 892k stolen memory agp0: aperture size is 128M uhci0: port 0xff80-0xff9f irq 16 at device 29.0 on pci0 usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xff60-0xff7f irq 19 at device 29.1 on pci0 usb1: on uhci1 usb1: USB revision 1.0 uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered uhci2: port 0xff40-0xff5f irq 18 at device 29.2 on pci0 usb2: on uhci2 usb2: USB revision 1.0 uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub2: 2 ports with 2 removable, self powered pci0: at device 29.7 (no driver attached) pcib1: at device 30.0 on pci0 pci1: on pcib1 pci1: at device 5.0 (no driver attached) xl0: <3Com 3c900-TPO Etherlink XL> port 0xddc0-0xddff irq 18 at device 6.0 on pci1 xl0: selecting 10baseT transceiver, half duplex xl0: Ethernet address: 00:60:97:74:a8:6d bfe0: mem 0xfe9fe000-0xfe9fffff irq 17 at device 9.0 on pci1 miibus0: on bfe0 bmtphy0: on miibus0 bmtphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto bfe0: Ethernet address: 00:12:3f:d4:21:75 isab0: at device 31.0 on pci0 isa0: on isab0 atapci0: port 0xffa0-0xffaf,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 irq 18 at device 31.1 on pci0 ata0: on atapci0 ata1: on atapci0 pci0: at device 31.3 (no driver attached) pci0: at device 31.5 (no driver attached) fdc0: port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on acpi0 atkbdc0: port 0x64,0x60 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A ppc0: port 0x778-0x77f,0x378-0x37f irq 7 on acpi0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold ppbus0: on ppc0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 orm0: at iomem 0xcd000-0xcffff,0xcb800-0xccfff,0xc0000-0xcb7ff on isa0 pmtimer0 on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounter "TSC" frequency 2392248384 Hz quality 800 Timecounters tick every 10.000 msec ad0: 76293MB at ata0-master UDMA100 acd0: CDROM at ata1-master UDMA33 ATA PseudoRAID loaded Mounting root from ufs:/dev/ad0s1a --=-=-= Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.0 (Darwin) iQEVAwUAQx47D/AIdTFWAbdTAQrvZAgAtPd2gV7idLu5bTacQ450qWHd1uqOdyfw zJ/pyLVpZi5zsSaZ87az1hkHO55vrlHF/D3ZPIF/RfIxemYU7hx8BVcYJfPcb5Y0 qmiWtLAfFc9cshXZBu25U5lqw0jHTFImwMPAS7k9+ACuhxfaQmSSGAJKIqptJs/q eJU+n8GIxy167npuSjQ8f0Dq2vHk21zWSSVf89TFa+AAHgM7ffCOZy5HY7jGzVlb RlHBRu0YGoYkryEI6ALxCta1I210d+LdUjaJiaxWsKNd73o6+oEmfomRmtt66BtK oAkz5r4EGIQQbFZpQJSBzKHR4w/9GzfilGMn7JzV1rOIw6Zt3Mmb3A== =65pu -----END PGP SIGNATURE----- --=-=-=--