From owner-freebsd-questions@FreeBSD.ORG Mon Jun 16 12:56:27 2008 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B5FB5106564A for ; Mon, 16 Jun 2008 12:56:27 +0000 (UTC) (envelope-from wmoran@potentialtech.com) Received: from mail.potentialtech.com (internet.potentialtech.com [66.167.251.6]) by mx1.freebsd.org (Postfix) with ESMTP id 891E38FC41 for ; Mon, 16 Jun 2008 12:56:27 +0000 (UTC) (envelope-from wmoran@potentialtech.com) Received: from vanquish.ws.pitbpa0.priv.collaborativefusion.com (pr40.pitbpa0.pub.collaborativefusion.com [206.210.89.202]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mail.potentialtech.com (Postfix) with ESMTPSA id F3052EBC09; Mon, 16 Jun 2008 08:56:26 -0400 (EDT) Date: Mon, 16 Jun 2008 08:55:34 -0400 From: Bill Moran To: z.szalbot@lc-words.com Message-Id: <20080616085534.ca7524f8.wmoran@potentialtech.com> In-Reply-To: <48565E7A.50807@lc-words.com> References: <48565E7A.50807@lc-words.com> X-Mailer: Sylpheed 2.4.8 (GTK+ 2.12.9; i386-portbld-freebsd7.0) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Cc: freebsd-questions@freebsd.org Subject: Re: RAID 1 / disk error / Offline uncorrectable sectors X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 16 Jun 2008 12:56:27 -0000 In response to Zbigniew Szalbot : > > A couple of days ago smartd let me know about a disk problem. > > Jun 14 01:13:38 relay kernel: ad12: FAILURE - READ_DMA48 > status=51 error=40 LBA=374468863 > Jun 14 01:13:38 relay kernel: ar0: WARNING - mirror protection lost. > RAID1 array in DEGRADED mode > Jun 14 01:14:19 relay kernel: ad12: WARNING - WRITE_DMA taskqueue > timeout - completing request directly > Jun 14 01:14:19 relay kernel: ad12: WARNING - WRITE_DMA48 freeing > taskqueue zombie request > Jun 14 01:37:38 relay smartd[683]: Device: /dev/ad12, 1 Currently > unreadable (pending) sectors > Jun 14 01:37:38 relay smartd[683]: Device: /dev/ad12, 1 Offline > uncorrectable sectors > > If I do smarctl -a /dev/ad12 I get > > 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always > - 1 > 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age > Offline - 1 > > My understanding is that RAID 1 no longer works because of this error. > There is a bad sector on HD (Offline uncorrectable sectors) and the best > we can do is replace the drive? Does it make sense to try to turn RAID 1 > on ignoring this error (however, this is done in BIOS so the machine > would have to be taken down in order to do that)? It seems serious > enough for me not to ignore it but then I know close to nothing about HDs. Replace the hard drive. Every modern hard drive keeps extra space available to "remap" bad sectors. This happens magically behind the scenes without you ever knowing about it. Once you've hit "uncorrectable" errors, it means your re-mappable sectors are used up, and that means the drive is on its last legs. -- Bill Moran http://www.potentialtech.com