From owner-freebsd-stable@FreeBSD.ORG Sun Aug 3 13:18:09 2008 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 89E35106567A for ; Sun, 3 Aug 2008 13:18:09 +0000 (UTC) (envelope-from jdc@parodius.com) Received: from mx01.sc1.parodius.com (mx01.sc1.parodius.com [72.20.106.3]) by mx1.freebsd.org (Postfix) with ESMTP id 7882C8FC1B for ; Sun, 3 Aug 2008 13:18:09 +0000 (UTC) (envelope-from jdc@parodius.com) Received: by mx01.sc1.parodius.com (Postfix, from userid 1000) id 5AD4B1CC0B8; Sun, 3 Aug 2008 06:18:09 -0700 (PDT) Date: Sun, 3 Aug 2008 06:18:09 -0700 From: Jeremy Chadwick To: Eugene Butusov Message-ID: <20080803131809.GA65161@eos.sc1.parodius.com> References: <4894CE6D.2000204@gmail.com> <20080802215814.GA20164@eos.sc1.parodius.com> <4894DC4C.7030001@gmail.com> <20080802230626.GA24435@eos.sc1.parodius.com> <48958A31.1000305@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <48958A31.1000305@gmail.com> User-Agent: Mutt/1.5.18 (2008-05-17) Cc: FreeBSD-STABLE-LIST Subject: Re: 7-STABLE, gjournal and fsck. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 03 Aug 2008 13:18:09 -0000 On Sun, Aug 03, 2008 at 12:36:33PM +0200, Eugene Butusov wrote: > Jeremy Chadwick wrote: >> On Sun, Aug 03, 2008 at 12:14:36AM +0200, Eugene Butusov wrote: >>> 2) smartctl -a /dev/ad4 >>> ... >>> 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 >>> 200 Multi_Zone_Error_Rate 0x0008 100 253 051 Old_age Offline - 0 >>> ... >> >> The other SMART stats look okay, but can you run an offline test which >> should update counters 198 and 200? smartctl -t offline /dev/ad4 should >> do the trick. It may take some time, especially if the disk is being >> used. >> >> And for clarification: just because the test is called "offline" does >> not mean it brings the disk offline (it doesn't). :-) > > I did smartctl -t offline /dev/ad4, here is the results: > > 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always > - 0 > 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline > - 0 > 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always > - 0 > 200 Multi_Zone_Error_Rate 0x0008 200 200 051 Old_age Offline > - 0 > > I think the problem lies in fsck itself. Somehow it is was unable to > deal with journaled filesystems It has failed to read them and mark them > as clean. Yep, I agree, SMART-wise your disk looks fine. It's worth noting that there's been reports of the ATA subsystem reporting I/O errors on LBAs which don't even appear to be valid. I'm left wondering if somehow that's what's happening here. I'm not sure why fsck would handle most of the gjournal'd blocks but not others. -- | Jeremy Chadwick jdc at parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |