From owner-freebsd-questions@FreeBSD.ORG Sun Nov 11 03:52:59 2007 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9658916A46C for ; Sun, 11 Nov 2007 03:52:59 +0000 (UTC) (envelope-from dnewman@networktest.com) Received: from mail.networktest.com (mail.networktest.com [207.181.8.134]) by mx1.freebsd.org (Postfix) with ESMTP id 65C7413C4A6 for ; Sun, 11 Nov 2007 03:52:58 +0000 (UTC) (envelope-from dnewman@networktest.com) Received: by mail.networktest.com (Postfix, from userid 1002) id 6014E78C4B; Sat, 10 Nov 2007 17:22:10 -0800 (PST) X-Spam-Checker-Version: SpamAssassin 3.2.3 (2007-08-08) on mail.networktest.com X-Spam-Level: X-Spam-Status: No, score=-1.0 required=5.0 tests=AWL,BAYES_00,RCVD_IN_PBL, RCVD_IN_SORBS_DUL,RDNS_DYNAMIC autolearn=no version=3.2.3 Received: from lion.local (cpe-75-82-195-55.socal.res.rr.com [75.82.195.55]) by mail.networktest.com (Postfix) with ESMTP id 0AB4878C4E for ; Sat, 10 Nov 2007 17:22:04 -0800 (PST) Message-ID: <4736593E.1090905@networktest.com> Date: Sat, 10 Nov 2007 17:22:06 -0800 From: David Newman Organization: Network Test Inc. User-Agent: Thunderbird 2.0.0.6 (Macintosh/20070728) MIME-Version: 1.0 To: freebsd-questions@freebsd.org X-Enigmail-Version: 0.95.5 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Subject: dealing with a failing drive X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 11 Nov 2007 03:52:59 -0000 -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 I'd welcome suggestions on how (or whether) to try to revive a SCSI drive that's failing. This is on FreeBSD 6.2-RELENG on a Compaq Proliant DL320, onboard RAID and two SCSI drives in a RAID1 array. Today this system rebooted and hung on Compaq's "what do you want the RAID controller to do?" message. I told it to fix any errors. When I brought the system back up (after running fsck in single-user mode), the log had lots of errors like this: Nov 10 09:00:40 mail kernel: ida0: hard write error Nov 10 09:00:40 mail kernel: ida0: invalid request Nov 10 09:01:48 mail last message repeated 35 times Nov 10 09:03:49 mail last message repeated 571 times Nov 10 09:12:27 mail last message repeated 796 times I vaguely remember trying about a year ago to load a SMART utility from the ports collection but it wouldn't work on drives in a RAID array. Is there some other way to: a) diagnose/fix the errant disk here? b) monitor the health of disks on a Compaq controller so it doesn't get to this point to begin with? thanks in advance dn -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (Darwin) iD8DBQFHNlk+yPxGVjntI4IRAntlAJ9FWA2ez+BdnViq7mrIpkLBTLm/CgCfRyEA czDvMn6+8KjlI3V0iBG4U3I= =36+k -----END PGP SIGNATURE-----