From owner-freebsd-questions@FreeBSD.ORG Fri Mar 19 12:01:11 2004 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id C216F16A4D0 for ; Fri, 19 Mar 2004 12:01:11 -0800 (PST) Received: from clunix.cl.msu.edu (clunix.cl.msu.edu [35.9.2.10]) by mx1.FreeBSD.org (Postfix) with ESMTP id 52ED543D1F for ; Fri, 19 Mar 2004 12:01:11 -0800 (PST) (envelope-from jerrymc@clunix.cl.msu.edu) Received: (from jerrymc@localhost) by clunix.cl.msu.edu (8.11.7p1+Sun/8.11.7) id i2JK0cU25855; Fri, 19 Mar 2004 15:00:38 -0500 (EST) From: Jerry McAllister Message-Id: <200403192000.i2JK0cU25855@clunix.cl.msu.edu> To: joey@mingrone.org (Joey Mingrone) Date: Fri, 19 Mar 2004 15:00:38 -0500 (EST) In-Reply-To: <200403191547.05545.joey@mingrone.org> from "Joey Mingrone" at Mar 19, 2004 03:45:21 PM X-Mailer: ELM [version 2.5 PL2] MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit cc: freebsd-questions@freebsd.org Subject: Re: drive / IDE controller questions X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 19 Mar 2004 20:01:11 -0000 > > Hi, > > I've been having problems with my western digital drive > (http://www.wdc.com/en/products/products.asp?DriveID=24) for about a year. > I've posted messages about this before, but I have some new information that > may be relevant. > > The problem started out when I would randomly hear the drive restarting. It > would make a high pitch sound that would increase in pitch and last a second > or two (The same sound the drive makes when you power on the system). > Messages similar to the ones below would also appear in /var/log/messages: > > ad0: WRITE command timeout tag=0 serv=0 - resetting > /kernel: ata0: resetting devices .. ata0-slave: ATA identify retries exceeded > > ad0: READ command timeout tag=0 serv=0 - resetting > > At first, these drive resets would only happen once or twice and everything > would go back to normal. Eventually they would happen over and over until > the system became unusable and wouldn't boot up even after running fsck in > singal user mode with all the partitions unmount or mounted read only. I > found a temporary solution was to run the western digital diagnostic tools > and then run fsck. I would usually have a small amount of data > corruption/loss but the system would be usable again. Things would be fine > for a week or longer, but the problem would eventually reappear. Finally I > contacted WD and they sent me a replacement drive (It was the same model, but > a refurbished drive that I'm guessing was sent back by someone else). After > a few tries I managed to get dd to copy the contents of the old drive to the > new drive and for quite some time everything was great. But, the other day > the new drive began to have the same problem as the old one. > > So now, I'm guessing the problem may lie with the controller on the main board > (http://usa.asus.com/mb/socketa/a7v266-e/overview.htm). Another early guess might also be overheating problems. I am not sure how you would check other than trying to make it cooler. ////jerry > > My main board has the optional raid controller. Would it be possible to use > that controller with my current drive, or would the secondary controller be a > better solution? > > Other notes: > > I've tried a few different IDE cables. > I don't think it has anything to do with the OS since the resets have happened > before the OS starts to boot and in the BIOS. > > I've included a uname -a, dmesg output and my kernel config file below. Any > tips would be much appreciated. > > Thanks, > > Joey > > uname -a: > FreeBSD *hostname* 4.9-RELEASE-p2 FreeBSD 4.9-RELEASE-p2 #0: Mon Feb 9 > 03:08:50 AST 2004 > root@*hostname*:/usr/obj/usr/src/sys/KERNEL_CONFIG_2004-02-09 i386 > > dmesg: > Copyright (c) 1992-2003 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD 4.9-RELEASE-p2 #0: Mon Feb 9 03:08:50 AST 2004 > root@*hostname*:/usr/obj/usr/src/sys/2004-02-09 > Timecounter "i8254" frequency 1193182 Hz > CPU: AMD Athlon(TM) XP1600+ (1410.21-MHz 686-class CPU) > Origin = "AuthenticAMD" Id = 0x662 Stepping = 2 > Features=0x383f9ff > AMD Features=0xc0480000 > real memory = 536788992 (524208K bytes) > avail memory = 518852608 (506692K bytes) > Preloaded elf kernel "kernel" at 0xc0364000. > Preloaded userconfig_script "/boot/kernel.conf" at 0xc036409c. > Pentium Pro MTRR support enabled > md0: Malloc disk > Using $PIR table, 10 entries at 0xc00f1480 > apm0: on motherboard > apm0: found APM BIOS v1.2, connected at v1.2 > npx0: on motherboard > npx0: INT 16 interface > pcib0: on motherboard > pci0: on pcib0 > agp0: mem 0xf8000000-0xfbffffff at device 0.0 > on pci0 > pcib1: at device 1.0 on pci0 > pci1: on pcib1 > pci1: at 0.0 irq 11 > atapci0: port > 0xb400-0xb43f,0xb800-0xb803,0xd000-0xd007,0xd400-0xd403,0xd800-0xd807 mem > 0xf3800000-0xf381ffff irq 5 at device 6.0 on pci0 > ata2: at 0xd800 on atapci0 > ata3: at 0xd000 on atapci0 > pcm0: port 0xb000-0xb01f irq 10 at device 15.0 on pci0 > pcm0: > rl0: port 0xa400-0xa4ff mem 0xf3000000-0xf30000ff > irq 5 at device 16.0 on pci0 > rl0: Ethernet address: 00:e0:4c:00:30:22 > miibus0: on rl0 > rlphy0: on miibus0 > rlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto > isab0: at device 17.0 on pci0 > isa0: on isab0 > atapci1: port 0xa000-0xa00f irq 0 at device 17.1 > on pci0 > ata0: at 0x1f0 irq 14 on atapci1 > ata1: at 0x170 irq 15 on atapci1 > uhci0: port 0x9800-0x981f irq 5 at device 17.2 on > pci0 > usb0: on uhci0 > usb0: USB revision 1.0 > uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub0: 2 ports with 2 removable, self powered > uhci1: port 0x9400-0x941f irq 5 at device 17.3 on > pci0 > usb1: on uhci1 > usb1: USB revision 1.0 > uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub1: 2 ports with 2 removable, self powered > uhci2: port 0x9000-0x901f irq 5 at device 17.4 on > pci0 > usb2: on uhci2 > usb2: USB revision 1.0 > uhub2: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > uhub2: 2 ports with 2 removable, self powered > orm0: