Date: Thu, 07 Aug 2014 03:31:43 -0500 From: Scott Bennett <bennett@sdf.org> To: freebsd@qeng-ho.org Cc: freebsd-questions@freebsd.org Subject: Re: gvinum raid5 vs. ZFS raidz Message-ID: <201408070831.s778VhJc015365@sdf.org> In-Reply-To: <53E1FF5F.1050500@qeng-ho.org> References: <201408020621.s726LsiA024208@sdf.org> <alpine.BSF.2.11.1408020356250.1128@wonkity.com> <53DCDBE8.8060704@qeng-ho.org> <201408060556.s765uKJA026937@sdf.org> <53E1FF5F.1050500@qeng-ho.org>
next in thread | previous in thread | raw e-mail | index | archive | help
Arthur Chance <freebsd@qeng-ho.org> wrote: > On 06/08/2014 06:56, Scott Bennett wrote: > > Arthur Chance <freebsd@qeng-ho.org> wrote: > >> > >> [stuff deleted --SB] > > I wonder if what varies is the amount of space taken up by the > > checksums. If there's a checksum for each block, then the block size > > would change the fraction of the space lost to checksums, and the parity > > for the checksums would thus also change. Enough to matter? Maybe. > > I'm not a file system guru, but my (high level) understanding is as > follows. Corrections from anyone more knowledgeable welcome. > > 1. UFS and ZFS both use tree structures to represent files, with the > data stored at the leaves and bookkeeping stored in the higher nodes. > Therefore the overhead scales as the log of the data size, which is a > negligible fraction for any sufficiently large amount of data. > > 2. UFS doesn't have data checksums, it relies purely on the hardware > checksums. (This is the area I'm least certain of.) What hardware checksums are there? I wasn't aware that this sort of hardware kept any. > > 3. ZFS keeps its checksums in a Merkel tree > (http://en.wikipedia.org/wiki/Merkle_tree) so the checksums are held in > the bookkeeping blocks, not in the data blocks. This simply changes the > constant multiplier in front of the logarithm for the overhead. Also, I > believe ZFS doesn't use fixed size data blocks, but aggregates writes > into blocks of up to 128K. > > Personally, I don't worry about the overheads of checksumming as the > cost of the parity stripe(s) in raidz is dominant. It's a cost well > worth paying though - I have a 3 disk raidz1 pool and a disk went bad > within 3 months of building it (the manufacturer turned out to be having > a few problems at the time) but I didn't lose a byte. > Good testimonial. I'm not worried about the checksum space either. I figure the benefits make it cheap at the price. Of more concern to me now is how I'm going to come up with at least two more 2 TB drives to set up a raidz2 with a tolerably small fraction of the total space being tied up in combined ZFS overhead (i.e., bookkeeping, parity, checksums, etc.) Scott Bennett, Comm. ASMELG, CFIAG ********************************************************************** * Internet: bennett at sdf.org *xor* bennett at freeshell.org * *--------------------------------------------------------------------* * "A well regulated and disciplined militia, is at all times a good * * objection to the introduction of that bane of all free governments * * -- a standing army." * * -- Gov. John Hancock, New York Journal, 28 January 1790 * **********************************************************************
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?201408070831.s778VhJc015365>