From owner-freebsd-hackers@FreeBSD.ORG  Tue Jan 22 07:36:55 2013
Return-Path: <owner-freebsd-hackers@FreeBSD.ORG>
Delivered-To: freebsd-hackers@freebsd.org
Received: from mx1.freebsd.org (mx1.FreeBSD.org [8.8.178.115])
 by hub.freebsd.org (Postfix) with ESMTP id B2E1FA07;
 Tue, 22 Jan 2013 07:36:55 +0000 (UTC)
 (envelope-from peter@rulingia.com)
Received: from vps.rulingia.com (host-122-100-2-194.octopus.com.au
 [122.100.2.194]) by mx1.freebsd.org (Postfix) with ESMTP id 4545C9FF;
 Tue, 22 Jan 2013 07:36:54 +0000 (UTC)
Received: from server.rulingia.com
 (c220-239-253-186.belrs5.nsw.optusnet.com.au [220.239.253.186])
 by vps.rulingia.com (8.14.5/8.14.5) with ESMTP id r0M7anI0064423
 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK);
 Tue, 22 Jan 2013 18:36:51 +1100 (EST)
 (envelope-from peter@rulingia.com)
X-Bogosity: Ham, spamicity=0.000000
Received: from server.rulingia.com (localhost.rulingia.com [127.0.0.1])
 by server.rulingia.com (8.14.5/8.14.5) with ESMTP id r0M7ahxi050714
 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO);
 Tue, 22 Jan 2013 18:36:43 +1100 (EST)
 (envelope-from peter@server.rulingia.com)
Received: (from peter@localhost)
 by server.rulingia.com (8.14.5/8.14.5/Submit) id r0M7afm9050710;
 Tue, 22 Jan 2013 18:36:41 +1100 (EST) (envelope-from peter)
Date: Tue, 22 Jan 2013 18:36:41 +1100
From: Peter Jeremy <peter@rulingia.com>
To: Wojciech Puchar <wojtek@wojtek.tensor.gdynia.pl>
Subject: Re: ZFS regimen: scrub, scrub, scrub and scrub again.
Message-ID: <20130122073641.GH30633@server.rulingia.com>
References: <CACpH0Mf6sNb8JOsTzC+WSfQRB62+Zn7VtzEnihEKmEV2aO2p+w@mail.gmail.com>
 <alpine.BSF.2.00.1301211201570.9447@wojtek.tensor.gdynia.pl>
MIME-Version: 1.0
Content-Type: multipart/signed; micalg=pgp-sha1;
 protocol="application/pgp-signature"; boundary="n+lFg1Zro7sl44OB"
Content-Disposition: inline
In-Reply-To: <alpine.BSF.2.00.1301211201570.9447@wojtek.tensor.gdynia.pl>
X-PGP-Key: http://www.rulingia.com/keys/peter.pgp
User-Agent: Mutt/1.5.21 (2010-09-15)
Cc: freebsd-fs <freebsd-fs@freebsd.org>,
 FreeBSD Hackers <freebsd-hackers@freebsd.org>
X-BeenThere: freebsd-hackers@freebsd.org
X-Mailman-Version: 2.1.14
Precedence: list
List-Id: Technical Discussions relating to FreeBSD
 <freebsd-hackers.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/options/freebsd-hackers>, 
 <mailto:freebsd-hackers-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-hackers>
List-Post: <mailto:freebsd-hackers@freebsd.org>
List-Help: <mailto:freebsd-hackers-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-hackers>,
 <mailto:freebsd-hackers-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Tue, 22 Jan 2013 07:36:55 -0000


--n+lFg1Zro7sl44OB
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On 2013-Jan-21 12:12:45 +0100, Wojciech Puchar <wojtek@wojtek.tensor.gdynia=
=2Epl> wrote:
>That's why i use properly tuned UFS, gmirror, and prefer not to use=20
>gstripe but have multiple filesystems

When I started using ZFS, I didn't fully trust it so I had a gmirrored
UFS root (including a full src tree).  Over time, I found that gmirror
plus UFS was giving me more problems than ZFS.  In particular, I was
seeing behaviour that suggested that the mirrors were out of sync,
even though gmirror insisted they were in sync.  Unfortunately, there
is no way to get gmirror to verify the mirroring or to get UFS to
check correctness of data or metadata (fsck can only check metadata
consistency).  I've since moved to a ZFS root.

>Which is marketing, not truth. If you want bullet-proof recoverability,=20
>UFS beats everything i've ever seen.

I've seen the opposite.  One big difference is that ZFS is designed to
ensure it returns the data that was written to it whereas UFS just
returns the bytes it finds where it thinks it wrote your data.  One
side effect of this is that ZFS is far fussier about hardware quality
- since it checksums everything, it is likely to pick up glitches that
UFS doesn't notice.

>If you want FAST crash recovery, use softupdates+journal, available in=20
>FreeBSD 9.

I'll admit that I haven't used SU+J but one downside of SU+J is that
it prevents the use of snapshots, which in turn prevents the (safe)
use of dump(8) (which is the official tool for UFS backups) on live
filesystems.

>> of fuss.  Even if you dislodge a drive ... so that it's missing the last
>> 'n' transactions, ZFS seems to figure this out (which I thought was extra
>> cudos).
>
>Yes this is marketing. practice is somehow different. as you discovered=20
>yourself.

Most of the time this works as designed.  It's possible there are bugs
in the implementation.

>While RAID-Z is already a king of bad performance,

I don't believe RAID-Z is any worse than RAID5.  Do you have any actual
measurements to back up your claim?

> i assume=20
>you mean two POOLS, not 2 RAID-Z sets. if you mixed 2 different RAID-Z poo=
ls you would=20
>spread load unevenly and make performance even worse.

There's no real reason why you could't have 2 different vdevs in the
same pool.

>> A full scrub of my drives weighs in at 36 hours or so.
>
>which is funny as ZFS is marketed as doing this efficient (like checking=
=20
>only used space).

It _does_ only check used space but it does so in logical order rather
than physical order.  For a fragmented pool, this means random accesses.

>Even better - use UFS.

Then you'll never know that your data has been corrupted.

>For both bullet proof recoverability and performance.
use ZFS.

--=20
Peter Jeremy

--n+lFg1Zro7sl44OB
Content-Type: application/pgp-signature

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.19 (FreeBSD)

iEYEARECAAYFAlD+QYkACgkQ/opHv/APuIdH7QCfQcSzk1BtPmFuSWNBqH/UUZL0
r+kAoKU/ks97MatHjPwjXl2BarlMyOzg
=KFNN
-----END PGP SIGNATURE-----

--n+lFg1Zro7sl44OB--