From owner-freebsd-stable@FreeBSD.ORG Mon Oct 13 02:10:01 2003 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id DC41916A4B3 for ; Mon, 13 Oct 2003 02:10:01 -0700 (PDT) Received: from mta03-svc.ntlworld.com (mta03-svc.ntlworld.com [62.253.162.43]) by mx1.FreeBSD.org (Postfix) with ESMTP id 6C28443F85 for ; Mon, 13 Oct 2003 02:10:00 -0700 (PDT) (envelope-from scott@fishballoon.org) Received: from llama.fishballoon.org ([81.104.195.124]) by mta03-svc.ntlworld.comESMTP <20031013090958.EWEM6394.mta03-svc.ntlworld.com@llama.fishballoon.org> for ; Mon, 13 Oct 2003 10:09:58 +0100 Received: from scott by llama.fishballoon.org with local (Exim 4.20) id 1A8yhS-000M9L-Ol for freebsd-stable@freebsd.org; Mon, 13 Oct 2003 10:09:10 +0100 Date: Mon, 13 Oct 2003 10:09:10 +0100 From: Scott Mitchell To: freebsd-stable@freebsd.org Message-ID: <20031013090910.GA84877@llama.fishballoon.org> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="YiEDa0DAkWCtVeE4" Content-Disposition: inline User-Agent: Mutt/1.4.1i X-Operating-System: FreeBSD 4.8-RELEASE-p13 i386 Sender: Scott Mitchell Subject: ATA failure with 4.6.2 & 250GB drive? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 13 Oct 2003 09:10:02 -0000 --YiEDa0DAkWCtVeE4 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Hi all, Just installed a Maxtor 250GB PATA drive in one of our servers, to be used as a backup staging area. This was actually a replacement for an identical drive that appeared to have died after a month of service. Anyway, 2 days after this drive was installed I start seeing this in the daily logs: > ad1s1e: hard error reading fsbn 850845887 of 425422912-425422943 (ad1s1 bn 850845887; cn 52962 tn 180 sn 17) trying PIO mode > ad1s1e: hard error reading fsbn 850845887 of 425422912-425422943 (ad1s1 bn 850845887; cn 52962 tn 180 sn 17) status=59 error=40 > ad1s1e: hard error reading fsbn 850845887 of 425422912-425422943 (ad1s1 bn 850845887; cn 52962 tn 180 sn 17) status=59 error=40 > ad1s1e: hard error reading fsbn 850845887 of 425422912-425422943 (ad1s1 bn 850845887; cn 52962 tn 180 sn 17) status=59 error=40 ... Several hundred of these are appearing every day, although the backup jobs do seem to be completing OK, and I'm not seeing any _write_ errors, only these identical read errors. I might be reading these messages wrong, but errors on block 850845887 seem a bit suspicious when the drive only has 490234752 blocks. So, do I really have another bad drive here? Or do I just need to upgrade this machine to get a newer ata driver? RELENG_4_6 appears to have support for 48-bit ATA addressing, so I wasn't expecting any problems, but maybe there's something else I need? Any advice much appreciated - this is a production machine so I don't want to take it out of service for an upgrade unless I really need to, but equally I would like this disk to work. Attached: dmesg.boot and output of 'atacontrol cap', fdisk and disklabel for the offending drive. Cheers, Scott -- =========================================================================== Scott Mitchell | PGP Key ID | "Eagles may soar, but weasels Cambridge, England | 0x54B171B9 | don't get sucked into jet engines" scott at fishballoon.org | 0xAA775B8B | -- Anon --YiEDa0DAkWCtVeE4 Content-Type: text/plain; charset=us-ascii Content-Disposition: attachment; filename="dmesg.boot" Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.6.2-RELEASE-p26 #1: Thu Oct 9 09:44:39 BST 2003 rsm@kokako:/usr/obj/usr/src/sys/KOKAKO Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (696.41-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x683 Stepping = 3 Features=0x383fbff real memory = 536805376 (524224K bytes) avail memory = 517312512 (505188K bytes) Preloaded elf kernel "kernel" at 0xc04d9000. Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 12 entries at 0xc00fdf00 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 pcib2: at device 1.0 on pci0 pci1: on pcib2 pcib3: at device 15.0 on pci1 pci2: on pcib3 fxp0: port 0x3000-0x303f mem 0xf4200000-0xf42fffff,0xf4300000-0xf4300fff irq 5 at device 7.0 on pci2 fxp0: Ethernet address 00:02:b3:16:7d:b9 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto ahc0: port 0x2000-0x20ff mem 0xf4100000-0xf4100fff irq 11 at device 12.0 on pci0 aic7896/97: Ultra2 Wide Channel A, SCSI Id=7, 32/253 SCBs ahc1: port 0x2400-0x24ff mem 0xf4101000-0xf4101fff irq 11 at device 12.1 on pci0 aic7896/97: Ultra2 Wide Channel B, SCSI Id=7, 32/253 SCBs fxp1: port 0x2800-0x283f mem 0xf4000000-0xf40fffff,0xf4102000-0xf4102fff irq 10 at device 14.0 on pci0 fxp1: Ethernet address 00:d0:b7:89:92:a7 inphy1: on miibus1 inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto isab0: at device 18.0 on pci0 isa0: on isab0 atapci0: port 0x2860-0x286f at device 18.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 uhci0: port 0x2840-0x285f irq 10 at device 18.2 on pci0 usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered intpm0: port 0x1040-0x104f irq 9 at device 18.3 on pci0 intpm0: I/O mapped 1040 intpm0: intr IRQ 9 enabled revision 0 smbus0: on intsmb0 smb0: on smbus0 intpm0: PM I/O mapped c00 pci0: at 20.0 pcib1: on motherboard pci3: on pcib1 orm0: