From owner-freebsd-hackers@FreeBSD.ORG Sun Dec 19 00:05:47 2004 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id B470E16A4CE for ; Sun, 19 Dec 2004 00:05:47 +0000 (GMT) Received: from sccrmhc12.comcast.net (sccrmhc12.comcast.net [204.127.202.56]) by mx1.FreeBSD.org (Postfix) with ESMTP id 32E8443D4C for ; Sun, 19 Dec 2004 00:05:47 +0000 (GMT) (envelope-from garycor@comcast.net) Received: from [10.56.78.111] (pcp09118143pcs.union01.nj.comcast.net[69.142.234.88]) by comcast.net (sccrmhc12) with ESMTP id <20041219000546012007rk01e>; Sun, 19 Dec 2004 00:05:46 +0000 Message-ID: <41C4C659.8070605@comcast.net> Date: Sat, 18 Dec 2004 19:07:53 -0500 From: Gary Corcoran User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.7) Gecko/20040616 X-Accept-Language: en-us, en MIME-Version: 1.0 To: freebsd-hackers@freebsd.org References: <41C3D62D.7000808@comcast.net> <20041218091739.GC97121@cirb503493.alcatel.com.au> <20041218195910.GD1068@cicely12.cicely.de> <20041218210720.GE97121@cirb503493.alcatel.com.au> <20041218211747.GE1068@cicely12.cicely.de> <1103408865.90538.29.camel@red.nativenerds.com> In-Reply-To: <1103408865.90538.29.camel@red.nativenerds.com> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit Subject: Re: Multiple hard disk failures - coincidence ? X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 19 Dec 2004 00:05:47 -0000 Ed Stover wrote: > Have you run the low level disk tools from Maxtor on your failed drives? > One day out of the blue my 80Gig maxtors started giving out hard error > failures, so I downloaded a floppy image from maxtor and used it to scan > and repair my drives. I rebooted in single user mode and fscked my > drives and rescued the data from lostnfound. and everything has been Aok > ever since. Thanks to everyone who responded. While I was doubtful of there being a heating problem, since my case is well cooled and there are fans blowing directly over (most of) the drives, I opened the case today and was surprised. One of the two fans in the front of the case that blow directly over five of the disks had completely *stopped*! And yes, the disks behind the one that stopped were the disks that were giving me errors, and they were extra warm (but not as toasty as my old SCSI drives in my firewall!). I don't know why/how it stopped. I nudged the fan to see if it had seized up, and it moved easily and started spinning! I moved the front panel fan control up to 'high' (from 'medium') and it started putting out a nice flow of air over the disks. It's been cooling the drives now for a few hours, and they seem back to 'normal' temp, but they are stilling showing exactly the same "hard error" sectors. Unfortunately one of the drives is having errors in sectors 96-103 (fsbn 255), so I can't even 'ls' the root directory. Are those sectors likely to be part of the superblock (which hopefully has a backup on disk?), or they probably part of the root directory? Thanks for reminding me about the Maxtor disk tools. I downloaded the latest version and am running it now to analyze the worst (no ls) drive first. Gary