From owner-freebsd-current@FreeBSD.ORG Sat Jun 13 15:13:13 2009 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6EA38106566B; Sat, 13 Jun 2009 15:13:13 +0000 (UTC) (envelope-from serenity@exscape.org) Received: from ch-smtp01.sth.basefarm.net (ch-smtp01.sth.basefarm.net [80.76.149.212]) by mx1.freebsd.org (Postfix) with ESMTP id F0B4D8FC15; Sat, 13 Jun 2009 15:13:12 +0000 (UTC) (envelope-from serenity@exscape.org) Received: from c83-253-252-234.bredband.comhem.se ([83.253.252.234]:59868 helo=mx.exscape.org) by ch-smtp01.sth.basefarm.net with esmtp (Exim 4.69) (envelope-from ) id 1MFUul-0003IQ-68; Sat, 13 Jun 2009 17:13:05 +0200 Received: from [192.168.1.5] (macbookpro [192.168.1.5]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by mx.exscape.org (Postfix) with ESMTPSA id 04E5B6AC12; Sat, 13 Jun 2009 17:13:03 +0200 (CEST) Message-Id: From: Thomas Backman To: Pawel Jakub Dawidek In-Reply-To: <20090613150627.GB1848@garage.freebsd.pl> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v935.3) Date: Sat, 13 Jun 2009 17:13:00 +0200 References: <920A69B1-4F06-477E-A13B-63CC22A13120@exscape.org> <3c1674c90906121401s19105167vf4535566321b45de@mail.gmail.com> <20090613150627.GB1848@garage.freebsd.pl> X-Mailer: Apple Mail (2.935.3) X-Originating-IP: 83.253.252.234 X-Scan-Result: No virus found in message 1MFUul-0003IQ-68. X-Scan-Signature: ch-smtp01.sth.basefarm.net 1MFUul-0003IQ-68 e6c89ce1cbc939550c23aa3c0065ffd8 Cc: freebsd-fs@freebsd.org, Kip Macy , FreeBSD Current Subject: Re: ZFS: Silent/hidden errors, nothing logged anywhere X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 13 Jun 2009 15:13:14 -0000 On Jun 13, 2009, at 05:06 PM, Pawel Jakub Dawidek wrote: > On Fri, Jun 12, 2009 at 02:01:57PM -0700, Kip Macy wrote: >> On Fri, Jun 12, 2009 at 10:32 AM, Thomas >> Backman wrote: >>> OK, so I filed a PR late May (kern/135050): >>> http://www.freebsd.org/cgi/query-pr.cgi?pr=135050 . >>> I don't know if this is a "feature" or a bug, but it really should >>> be >>> considered the latter. The data could be repaired in the >>> background without >>> the user ever knowing - until the disk dies completely. I'd prefer >>> to have >>> warning signs (i.e. checksum errors) so that I can buy a >>> replacement drive >>> *before* that. >>> >>> Not only does this mean that errors can go unnoticed, but also >>> that it's >>> impossible to figure out which disk is broken, if ZFS has >>> *temporarily* >>> repaired the broken data! THAT is REALLY bad! >>> Is this something that we can expect to see changed before 8.0- >>> RELEASE? >> >> >> I'm fairly certain that we've discussed this already. Solaris uses >> FMA >> - I don't think that I'll get to a "real fix" any time soon. The time >> that I do have will go to addressing stability problems (memory >> over-allocation, NFS interaction, control directory mounts) all of >> which cause panics. Maintaining them persistently in the label >> doesn't >> make sense - when do you drop them? Would a simple log message >> about >> the number of checksum errors suffice? > > We do log such errors. Solaris uses FMA and for FreeBSD I use devd. > You > can find the following entry in /etc/devd.conf: > ... > If you see nothing in your logs, there must be a bug with reporting > the > problem somewhere or devd is not running (it should be enabled by > default). Awesome! After checking further I did indeed find a bunch of such messages in messages.0.bz2. One thing less to worry about, I guess. :) Regards, Thomas