From owner-freebsd-fs@FreeBSD.ORG  Fri Oct 15 21:43:12 2010
Return-Path: <owner-freebsd-fs@FreeBSD.ORG>
Delivered-To: freebsd-fs@freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34])
	by hub.freebsd.org (Postfix) with ESMTP id 52C08106566C
	for <freebsd-fs@freebsd.org>; Fri, 15 Oct 2010 21:43:12 +0000 (UTC)
	(envelope-from jdc@koitsu.dyndns.org)
Received: from qmta10.westchester.pa.mail.comcast.net
	(qmta10.westchester.pa.mail.comcast.net [76.96.62.17])
	by mx1.freebsd.org (Postfix) with ESMTP id F07C88FC12
	for <freebsd-fs@freebsd.org>; Fri, 15 Oct 2010 21:43:11 +0000 (UTC)
Received: from omta16.westchester.pa.mail.comcast.net ([76.96.62.88])
	by qmta10.westchester.pa.mail.comcast.net with comcast
	id JyTc1f0021uE5Es5A9jC0D; Fri, 15 Oct 2010 21:43:12 +0000
Received: from koitsu.dyndns.org ([98.248.41.155])
	by omta16.westchester.pa.mail.comcast.net with comcast
	id K9jB1f0023LrwQ23c9jB5S; Fri, 15 Oct 2010 21:43:12 +0000
Received: by icarus.home.lan (Postfix, from userid 1000)
	id DFC389B418; Fri, 15 Oct 2010 14:43:09 -0700 (PDT)
Date: Fri, 15 Oct 2010 14:43:09 -0700
From: Jeremy Chadwick <freebsd@jdc.parodius.com>
To: Scott Charron <shewless@unleashed-web.org>
Message-ID: <20101015214309.GA19087@icarus.home.lan>
References: <AANLkTi=RMKLpVzv48Pu1Wk=afCL0NU-v-D0p2_A5JdNe@mail.gmail.com>
	<201010151932.o9FJWXb3066388@lurza.secnetix.de>
	<AANLkTi=jcf7vAAc6yPurpjrLP9tVxVtOkdrV7ZSdc7HM@mail.gmail.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
In-Reply-To: <AANLkTi=jcf7vAAc6yPurpjrLP9tVxVtOkdrV7ZSdc7HM@mail.gmail.com>
User-Agent: Mutt/1.5.21 (2010-09-15)
Cc: freebsd-fs@freebsd.org
Subject: Re: consistent file system inconsistencies (tried replacing drive)
X-BeenThere: freebsd-fs@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Filesystems <freebsd-fs.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-fs>,
	<mailto:freebsd-fs-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-fs>
List-Post: <mailto:freebsd-fs@freebsd.org>
List-Help: <mailto:freebsd-fs-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-fs>,
	<mailto:freebsd-fs-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Fri, 15 Oct 2010 21:43:12 -0000

On Fri, Oct 15, 2010 at 05:19:22PM -0400, Scott Charron wrote:
> >  > Also I was under the impression soft-updates would actually require
> >  > a little more disk access time and thus make the problem slightly
> >  > worse.
> >
> > No, soft-updates doesn't require more disk access time in
> > general.  It caches and re-orders meta data updates, so it
> > can even save disk access time.  But the important thing
> > is that soft-updates re-orders the meta data updates in a
> > way that guarantees that it is in a consistent state at
> > any time (provided that the disk's firware cooperates
> > correctly).  This means that there won't be _unexpected_
> > inconsistencies after a crash, and fsck will be able to
> > run without user-intervention.  (NB:  If you want to avoid
> > fsck completely, you will have to use journalling, or go
> > to a ZFS-only system without any UFS file systems.)
> 
> Maybe I should just go to ZFS... it's fully supported now even for
> root right? Will that be more robust against power outages??

IMHO, no.  Some would argue that a ZFS mirror or raidz pool would be
sufficient, and I disagree.  People need to remember the situation:
*power is being completely lost to the entire system abruptly*.  Not
just one device (e.g. yanking a disk off the bus without warning), but
literally everything.

A ZFS mirror would require 2 or more disks (or devices; that could mean
slice or partition), while ZFS raidz would require 3 or more.  A single
disk or device using ZFS gets you the ability to detect when there's
read/write errors, but no way to fix them.

I don't think it matters anyway because......

> > If you still get unexpected inconsistencies even though
> > you use soft-updates everywhere, then something else must
> > be wrong.  Maybe your hard disk doesn't play along nicely.
> > The usual recommendation is to disable the write-cache
> > on hard disks.  This will make your system slower, though.
> 
> Remember I'm using a USB stick here :)

Which makes this entire thread even more crazy.

The only thing I think that might help you "more" (but again, absolutely
no guarantee of filesystem integrity) would be to mount all of your
filesystems with the "sync" option.  Please see mount(8).

Be aware this will make I/O very, very, very slow.  It will induce a
flush-to-disk on every single write operation as I understand it.

You should really just solve the actual problem of power being lost
abruptly.

> > If you see filesystem problems on your non-root filesystem as well, e.g.
> > ones with SU (soft-updates) applied, I would recommend setting
> > background_fsck="no" in your /etc/rc.conf.  There are some old threads
> > documenting how background filesystem checks don't always fix all
> > problems before the system starts actually using the filesystem.  There
> > were reports of people finding that manual fsck would detect issues that
> > background fsck wouldn't fix.  YMMV.
> 
> Is this recommended? Should I schedule regular manual fscks?

The thread is long, and you should take some time to read it in full
(please do not skim it, and please read what Kirk wrote).  The original
thread ("panic: handle_written_inodeblock: bad size") thread gets split
off into a sub-thread ("background fsck considered harmful?").  This is
the best I can find on short notice:

http://lists.freebsd.org/pipermail/freebsd-stable/2010-July/thread.html#57849
http://lists.freebsd.org/pipermail/freebsd-stable/2010-July/thread.html#57876
http://lists.freebsd.org/pipermail/freebsd-fs/2010-July/thread.html#8927

It's recommended (by myself and some others) because it's been proven
background fsck doesn't always repair errors.  Some feel the opinion is
too harsh, and that's a legitimate opinion.  The "solution" is to "use
journalled soft-updates when they become available in FreeBSD 9.0".

But again, I'll take a moment to recommend that you solve the power loss
problem altogether by investing in a UPS.  Sudden power loss is going to
blow/fry your hardware eventually (especially if you experience
repetitive brown outs like we do here -- literally "flickering power"
outages where power is rapidly being toggled on/off at 0.5s intervals
over the course of 15-20 seconds.  I *have* had consumer home appliances
die permanently because of this.  Let's hear it for Silicon Valley power
grids...)

-- 
| Jeremy Chadwick                                   jdc@parodius.com |
| Parodius Networking                       http://www.parodius.com/ |
| UNIX Systems Administrator                  Mountain View, CA, USA |
| Making life hard for others since 1977.              PGP: 4BD6C0CB |