From owner-freebsd-current@FreeBSD.ORG Sun Apr 24 01:21:57 2005 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 8A2BB16A4CE for ; Sun, 24 Apr 2005 01:21:57 +0000 (GMT) Received: from gromit.dlib.vt.edu (gromit.dlib.vt.edu [128.173.49.29]) by mx1.FreeBSD.org (Postfix) with ESMTP id 2667943D60 for ; Sun, 24 Apr 2005 01:21:57 +0000 (GMT) (envelope-from paul@gromit.dlib.vt.edu) Received: from zappa.Chelsea-Ct.Org (pool-70-110-10-69.roa.east.verizon.net [70.110.10.69]) by gromit.dlib.vt.edu (8.13.3/8.13.3) with ESMTP id j3O1LsZv019426 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK) for ; Sat, 23 Apr 2005 21:21:55 -0400 (EDT) (envelope-from paul@gromit.dlib.vt.edu) Received: from zappa.Chelsea-Ct.Org (localhost.Chelsea-Ct.Org [127.0.0.1]) by zappa.Chelsea-Ct.Org (8.13.3/8.13.3) with ESMTP id j3O1LmZ8071740 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK) for ; Sat, 23 Apr 2005 21:21:48 -0400 (EDT) (envelope-from paul@gromit.dlib.vt.edu) Received: (from paul@localhost) by zappa.Chelsea-Ct.Org (8.13.3/8.13.3/Submit) id j3O1Llxn071739 for freebsd-current@freebsd.org; Sat, 23 Apr 2005 21:21:47 -0400 (EDT) (envelope-from paul@gromit.dlib.vt.edu) X-Authentication-Warning: zappa.Chelsea-Ct.Org: paul set sender to paul@gromit.dlib.vt.edu using -f From: Paul Mather To: freebsd-current@freebsd.org Content-Type: text/plain Content-Transfer-Encoding: 7bit Date: Sat, 23 Apr 2005 21:21:46 -0400 Message-Id: <1114305707.71309.40.camel@zappa.Chelsea-Ct.Org> Mime-Version: 1.0 X-Mailer: Evolution 2.2.2 FreeBSD GNOME Team Port Subject: Fatal TIMEOUT - WRITE_DMA errors return with ATA Mk.III X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 24 Apr 2005 01:21:57 -0000 Since somewhere in the 5.x release cycle, my system has fallen prey to the "TIMEOUT - WRITE_DMA" errors which result in the drive becoming detached (which causes my geom_mirror to break and require rebuilding). According to smartctl and disk diagnostics, there's nothing wrong with my drives. Plus, the problem does not manifest itself under 4-STABLE. (I'm not the only one to have reported this problem.) Lately, I'd had success using a patch posted to freebsd-current by Ian Dowse. The "TIMEOUT - WRITE_DMA" errors still occurred, but they weren't fatal. I updated my kernel and world recently, and, alas, the "TIMEOUT - WRITE_DMA" problem has returned once more: ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=49981679 ad0: FAILURE - device detached subdisk0: detached ad0: detached GEOM_MIRROR: Cannot update metadata on disk ad0 (error=5). GEOM_MIRROR: Cannot update metadata on disk ad0 (error=6). GEOM_MIRROR: Device raid1: provider ad0 disconnected. GEOM_MIRROR: Request failed (error=6). ad0[WRITE(offset=3847741440, length=16384)] Ian's patch was against the pre-ATA Mk.III regime. I doubt it is applicable to the ATA Mk.III rewrite. :-( Here is my system (re: ATA), FWIW: FreeBSD 6.0-CURRENT #0: Mon Apr 18 12:25:24 EDT 2005 paul@zappa.Chelsea-Ct.Org:/usr/obj/usr/src/sys/ZAPPA Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel Pentium III (698.39-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x683 Stepping = 3 Features=0x383f9ff real memory = 536870912 (512 MB) avail memory = 520253440 (496 MB) [[...]] atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x1440-0x144f at device 7.1 on pci0 ata0: on atapci0 ata1: on atapci0 [[...]] ad0: 24405MB at ata0-master UDMA33 acd0: DVDR at ata0-slave UDMA33 ad2: 24405MB at ata1-master UDMA33 acd1: CDRW at ata1-slave PIO4 My kernel has ATAPICAM support compiled in. Here is the pciconf -vl output for my ATA controller: atapci0@pci0:7:1: class=0x010180 card=0x00000000 chip=0x71118086 rev=0x01 hdr=0x00 vendor = 'Intel Corporation' device = '82371AB/EB/MB PIIX4/4E/4M IDE Controller' class = mass storage subclass = ATA It is in a Dell Dimension XPS T700r. Is there any way to up the number of retries to, say, 5, to see if this helps? Oh, well, welcome back "gmirror rebuild..." :-) Cheers, Paul. -- e-mail: paul@gromit.dlib.vt.edu "Without music to decorate it, time is just a bunch of boring production deadlines or dates by which bills must be paid." --- Frank Vincent Zappa