Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 12 Nov 2007 11:03:47 -0500
From:      Jerry McAllister <jerrymc@msu.edu>
To:        David Newman <dnewman@networktest.com>
Cc:        freebsd-questions@freebsd.org
Subject:   Re: dealing with a failing drive
Message-ID:  <20071112160347.GA98697@gizmo.acns.msu.edu>
In-Reply-To: <4736593E.1090905@networktest.com>
References:  <4736593E.1090905@networktest.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On Sat, Nov 10, 2007 at 05:22:06PM -0800, David Newman wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> I'd welcome suggestions on how (or whether) to try to revive a SCSI
> drive that's failing.

to answer 'whether':  don't.   Get your stuff off from it as
soon as possible and nuke it if it has anything sensitive at all.

If it is a mirror or raid5 then you should be able to just replace it, but
otherwise, back it up immediately and quit using it.

Generally, if you start seeing a regular hard error, the drive
is on its last legs.   The errors only increase.    You may be 
able to do things to get past this one error, but more will be
coming.

So, is answer to 'how': also don't.

////jerry

> 
> This is on FreeBSD 6.2-RELENG on a Compaq Proliant DL320, onboard RAID
> and two SCSI drives in a RAID1 array.
> 
> Today this system rebooted and hung on Compaq's "what do you want the
> RAID controller to do?" message. I told it to fix any errors.
> 
> When I brought the system back up (after running fsck in single-user
> mode), the log had lots of errors like this:
> 
> Nov 10 09:00:40 mail kernel: ida0: hard write error
> Nov 10 09:00:40 mail kernel: ida0: invalid request
> Nov 10 09:01:48 mail last message repeated 35 times
> Nov 10 09:03:49 mail last message repeated 571 times
> Nov 10 09:12:27 mail last message repeated 796 times
> 
> I vaguely remember trying about a year ago to load a SMART utility from
> the ports collection but it wouldn't work on drives in a RAID array.
> 
> Is there some other way to:
> 
> a) diagnose/fix the errant disk here?
> b) monitor the health of disks on a Compaq controller so it doesn't get
> to this point to begin with?
> 
> thanks in advance
> 
> dn
> 
> 
> 
> 
> 
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.1 (Darwin)
> 
> iD8DBQFHNlk+yPxGVjntI4IRAntlAJ9FWA2ez+BdnViq7mrIpkLBTLm/CgCfRyEA
> czDvMn6+8KjlI3V0iBG4U3I=
> =36+k
> -----END PGP SIGNATURE-----
> _______________________________________________
> freebsd-questions@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-questions
> To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org"



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20071112160347.GA98697>