From owner-freebsd-current@FreeBSD.ORG Thu Jul 24 19:19:54 2008 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 47C851065671 for ; Thu, 24 Jul 2008 19:19:54 +0000 (UTC) (envelope-from sven@dmv.com) Received: from smtp-gw-cl-d.dmv.com (smtp-gw-cl-d.dmv.com [216.240.97.42]) by mx1.freebsd.org (Postfix) with ESMTP id 0A0768FC15 for ; Thu, 24 Jul 2008 19:19:53 +0000 (UTC) (envelope-from sven@dmv.com) Received: from [216.240.97.46] (lanshark.dmv.com [216.240.97.46]) by smtp-gw-cl-d.dmv.com (8.12.10/8.12.10) with ESMTP id m6OIqfN0047987; Thu, 24 Jul 2008 14:52:42 -0400 (EDT) (envelope-from sven@dmv.com) From: Sven Willenberger To: Maurice Volaski In-Reply-To: References: Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="=-Zl3QXM2dyhM6PkBzcIMF" Date: Thu, 24 Jul 2008 14:52:41 -0400 Message-Id: <1216925561.6489.6.camel@lanshark.dmv.com> Mime-Version: 1.0 X-Mailer: Evolution 2.10.3 X-Scanned-By: MIMEDefang 2.48 on 216.240.97.42 Cc: freebsd-current@freebsd.org, pjd@freebsd.org Subject: Re: Would ZFS and gmirror work well together in a two-node failover cluster? X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 24 Jul 2008 19:19:54 -0000 --=-Zl3QXM2dyhM6PkBzcIMF Content-Type: text/plain Content-Transfer-Encoding: quoted-printable On Fri, 2008-07-18 at 12:06 -0400, Maurice Volaski wrote: > I am looking to put together a two-node high-availability cluster=20 > where each node has identical data storage consisting of a set of=20 > internal data drives (separate from the boot drive). I want ZFS to=20 > manage the drives as a JDBOD in a RAIDZ2 configuration. Thus, if an=20 > individual drive misbehaves or fails, ZFS detects and handles the=20 > fault. >=20 > But I'm also looking to mirror this entire setup in real time to a=20 > second identical server. >=20 > Basically, my question is can this work well on FreeBSD while taking=20 > full advantage of ZFS? >=20 > Specifically, my understanding is that the only way to handle the=20 > real time mirror is with gmirror and ggated, but it's not clear how=20 > gmirror would interact with ZFS. >=20 My findings have been that ZFS and ggate[cd] do *not* play nicely together so I would concur taht gmirror/ggate[cd] would be the way to create a real-time mirror. (For those interested, I have posted some information on how ggate[cd] and zpool will cause lockups rendering it unsuitable for real-time multi-host mirroring over on -STABLE). > I am assuming that gmirror operates only on individual drives, so if=20 > I had a set of 24 drives on each server, there would be 24 mirrored=20 > drive pairs. >=20 > One concern I have is that this setup could run into trouble with=20 > gmirror's potentially sabotaging ZFS's RAIDZ2. For example, when a=20 > drive starts failing, won't gmirror see it before ZFS does and take=20 > the unfavorable action of substituting the corresponding drive in the=20 > failover server in subsequent I/O, leaving ZFS's RAIDZ2 out of the=20 > loop? >=20 > This is just one particular scenario, but in general, it's not=20 > entirely clear that it's possible to have fine-grained control of=20 > when, how much and in what direction gmirror manages synchronization=20 > among drive pairs. >=20 > -Maurice As was suggested in other followups, it would seem that zfs send/recv may be a viable option depending on whethere it is granular enough (timewise) to be practical. As far as failover goes (e.g. yank power cord out) your best bet would be using CARP as a virtual IP and using devd or ifstated to trigger an event on CARP interface change (from BACKUP to MASTER or vice-versa). I would be interested in seeing how well the zfs option of snapshots would work as rebuilding a 500GB gmirror is really a lengthy process ... Sven --=-Zl3QXM2dyhM6PkBzcIMF Content-Type: application/pgp-signature; name=signature.asc Content-Description: This is a digitally signed message part -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.7 (GNU/Linux) iD8DBQBIiM95Snmnd8q3JGsRAkWCAJ9wqgLFhdygQbnV07k2sXChdhznAACdGcAV Slp9cGV01W3JaBWYGBciDN0= =TwiU -----END PGP SIGNATURE----- --=-Zl3QXM2dyhM6PkBzcIMF--