Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 30 Jun 2016 17:37:47 +0200
From:      Julien Cigar <julien@perdition.city>
To:        Ben RUBSON <ben.rubson@gmail.com>
Cc:        freebsd-fs@freebsd.org
Subject:   Re: HAST + ZFS + NFS + CARP
Message-ID:  <20160630153747.GB5695@mordor.lan>
In-Reply-To: <AD42D8FD-D07B-454E-B79D-028C1EC57381@gmail.com>
References:  <20160630144546.GB99997@mordor.lan> <71b8da1e-acb2-9d4e-5d11-20695aa5274a@internetx.com> <AD42D8FD-D07B-454E-B79D-028C1EC57381@gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help

--mxv5cy4qt+RJ9ypb
Content-Type: text/plain; charset=utf-8
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Thu, Jun 30, 2016 at 05:28:41PM +0200, Ben RUBSON wrote:
>=20
> > On 30 Jun 2016, at 17:14, InterNetX - Juergen Gotteswinter <jg@internet=
x.com> wrote:
> >=20
> >=20
> >=20
> > Am 30.06.2016 um 16:45 schrieb Julien Cigar:
> >> Hello,
> >>=20
> >> I'm always in the process of setting a redundant low-cost storage for=
=20
> >> our (small, ~30 people) team here.
> >>=20
> >> I read quite a lot of articles/documentations/etc and I plan to use HA=
ST
> >> with ZFS for the storage, CARP for the failover and the "good old NFS"
> >> to mount the shares on the clients.
> >>=20
> >> The hardware is 2xHP Proliant DL20 boxes with 2 dedicated disks for the
> >> shared storage.
> >>=20
> >> Assuming the following configuration:
> >> - MASTER is the active node and BACKUP is the standby node.
> >> - two disks in each machine: ada0 and ada1.
> >> - two interfaces in each machine: em0 and em1
> >> - em0 is the primary interface (with CARP setup)
> >> - em1 is dedicated to the HAST traffic (crossover cable)
> >> - FreeBSD is properly installed in each machine.
> >> - a HAST resource "disk0" for ada0p2.
> >> - a HAST resource "disk1" for ada1p2.
> >> - a zpool create zhast mirror /dev/hast/disk0 /dev/hast/disk1 is creat=
ed
> >>  on MASTER
> >>=20
> >> A couple of questions I am still wondering:
> >> - If a disk dies on the MASTER I guess that zpool will not see it and
> >>  will transparently use the one on BACKUP through the HAST ressource..
> >=20
> > thats right, as long as writes on $anything have been successful hast is
> > happy and wont start whining
> >=20
> >>  is it a problem?=20
> >=20
> > imho yes, at least from management view
> >=20
> >> could this lead to some corruption?
> >=20
> > probably, i never heard about anyone who uses that for long time in
> > production
> >=20
> > At this stage the
> >>  common sense would be to replace the disk quickly, but imagine the
> >>  worst case scenario where ada1 on MASTER dies, zpool will not see it=
=20
> >>  and will transparently use the one from the BACKUP node (through the=
=20
> >>  "disk1" HAST ressource), later ada0 on MASTER dies, zpool will not=20
> >>  see it and will transparently use the one from the BACKUP node=20
> >>  (through the "disk0" HAST ressource). At this point on MASTER the two=
=20
> >>  disks are broken but the pool is still considered healthy ... What if=
=20
> >>  after that we unplug the em0 network cable on BACKUP? Storage is
> >>  down..
> >> - Under heavy I/O the MASTER box suddently dies (for some reasons),=20
> >>  thanks to CARP the BACKUP node will switch from standy -> active and=
=20
> >>  execute the failover script which does some "hastctl role primary" for
> >>  the ressources and a zpool import. I wondered if there are any
> >>  situations where the pool couldn't be imported (=3D data corruption)?
> >>  For example what if the pool hasn't been exported on the MASTER before
> >>  it dies?
> >> - Is it a problem if the NFS daemons are started at boot on the standby
> >>  node, or should they only be started in the failover script? What
> >>  about stale files and active connections on the clients?
> >=20
> > sometimes stale mounts recover, sometimes not, sometimes clients need
> > even reboots
> >=20
> >> - A catastrophic power failure occur and MASTER and BACKUP are suddent=
ly
> >>  powered down. Later the power returns, is it possible that some
> >>  problem occur (split-brain scenario ?) regarding the order in which t=
he
> >=20
> > sure, you need an exact procedure to recover
> >=20
> >>  two machines boot up?
> >=20
> > best practice should be to keep everything down after boot
> >=20
> >> - Other things I have not thought?
> >>=20
> >=20
> >=20
> >=20
> >> Thanks!
> >> Julien
> >>=20
> >=20
> >=20
> > imho:
> >=20
> > leave hast where it is, go for zfs replication. will save your butt,
> > sooner or later if you avoid this fragile combination
>=20
> I was also replying, and finishing by this :
> Why don't you set your slave as an iSCSI target and simply do ZFS mirrori=
ng ?

Yes that's another option, so a zpool with two mirrors (local +=20
exported iSCSI) ?

> ZFS would then know as soon as a disk is failing.
> And if the master fails, you only have to import (-f certainly, in case o=
f a master power failure) on the slave.
>=20
> Ben
>=20
> _______________________________________________
> freebsd-fs@freebsd.org mailing list
> https://lists.freebsd.org/mailman/listinfo/freebsd-fs
> To unsubscribe, send any mail to "freebsd-fs-unsubscribe@freebsd.org"

--=20
Julien Cigar
Belgian Biodiversity Platform (http://www.biodiversity.be)
PGP fingerprint: EEF9 F697 4B68 D275 7B11  6A25 B2BB 3710 A204 23C0
No trees were killed in the creation of this message.
However, many electrons were terribly inconvenienced.

--mxv5cy4qt+RJ9ypb
Content-Type: application/pgp-signature; name="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQIcBAABCgAGBQJXdTzLAAoJELK7NxCiBCPAU+EP/2sN1DkF36296xw1+mRA0dIz
rqyGM7fBZOhHCSqZ7YzzCKvpb28Pg5qe+oMD/tJ2N0sp5O07nG84sYMGFqEAirE1
LZaDoiMItRYobsIFpNjrMagmX5l9kiMy1aFo//eTFfY/J6Bzka2m7kErOtuhOkiP
gp7r02Rsr3dztjAa60Mpo0XYv2A9AOa2KkNJ0wQm2vdzXjzFIYomGvK7Eg0cDdMH
cfZWVSEDl5LundgtX7b1NY8gG04eP7c2lgOUTBO++D5p4SjzwZ07px56tSX9pqP1
TMG38CtUmA0w0MDDdXE6KiFg0ZYmFPQSmmV1SD/O34iuP1wo4BkIiOPf7lIYoxYQ
O/TZbD8VKbYAEKNChpTjFlKgGYFKPLK1IL3H6x0G4UtLC2SnUFlel4eH5LvZmlvz
hhTpjX9Ybjdn3Y4YXbnCr02rhwacapVghnwJQ8kjPHp/azLtAI7eyHtMWdZv2m2I
fxGw3npMM1CgF0xCF56CdM8t4XyqD5XMyU2RCYFNeTx/5t7MbjktCdK53QDlxHT9
3kuGYJG6MhLdjXuedl1QFyb91x3di7BjNDxt0qmrIMCsjO58J+0GOGmP5Te/sBlA
vWYMkfwKcX8lNuqT3xKESrclBofyboGsFRQBK5tbnCkfQEr4w4AlpVM3SPH7LspV
wYacJIrWBrAlEAunATQ8
=OY/+
-----END PGP SIGNATURE-----

--mxv5cy4qt+RJ9ypb--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20160630153747.GB5695>