Date: Sun, 24 Mar 2013 08:54:48 -0700 From: Jeremy Chadwick <jdc@koitsu.org> To: Quartz <quartz@sneakertech.com> Cc: freebsd-fs@freebsd.org Subject: Re: ZFS question Message-ID: <20130324155448.GA4122@icarus.home.lan> In-Reply-To: <20130324153342.GA3687@icarus.home.lan> References: <20130321044557.GA15977@icarus.home.lan> <514AA192.2090006@sneakertech.com> <20130321085304.GB16997@icarus.home.lan> <20130324153342.GA3687@icarus.home.lan>
next in thread | previous in thread | raw e-mail | index | archive | help
On Sun, Mar 24, 2013 at 08:33:42AM -0700, Jeremy Chadwick wrote:
> However, commands like "zpool status"
...and seems a typo I made in vim caused the rest of my sentence to get
deleted before I sent it out.  This should have read:
> However, commands like "zpool status" work just fine, but things like
> "zpool destroy" and so on indefinitely block ("mount drain"), which to
> me makes some degree of sense.
To expand: for example, you've lost 3 disks of a 4-disk raidz2 pool,
your data is buggered and you'll need to recover from backups.
Yes, you will need to reboot for the ZFS layer to effectively "un-wedge"
itself from whatever catatonic state its in.  No argument: this is a bug
somewhere, and my guess is that it relates to the confused state of the
devices in CAM-land.  But regardless, I think if you were to lose 3 of 4
disks on a raidz2 pool you'd have much more serious things to be worried
about than "well crap I have to issue a reboot".
And yes, I did test a reboot in the scenario I described -- the system
did reboot without physically pressing the button.  But then again, for
remotely-managed systems, administrators should have the ability to
remotely power-cycle or force resets (e.g. drop to DDB via serial
console and force a reset).
People who run servers remotely yet lack this capability are
intentionally choosing to live dangerously and I do not condone such.
These folks also make me wonder how they update world without remote
console access, since to do it right you *must* drop to single-user for
the installworld phase.
I learned my lesson of "assuming" installworld would work from
multi-user long ago when it broke one time and I ended up with a system
with broken /libexec/ld-elf* binaries.  Having to go to the datacenter
30 minutes away at 3 in the morning taught me to follow instructions.
-- 
| Jeremy Chadwick                                   jdc@koitsu.org |
| UNIX Systems Administrator                http://jdc.koitsu.org/ |
| Mountain View, CA, US                                            |
| Making life hard for others since 1977.             PGP 4BD6C0CB |
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20130324155448.GA4122>
