From owner-freebsd-stable@FreeBSD.ORG Sat Jul 2 21:43:22 2011 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B8CC81065670; Sat, 2 Jul 2011 21:43:22 +0000 (UTC) (envelope-from tts@personalmis.com) Received: from mail-bw0-f54.google.com (mail-bw0-f54.google.com [209.85.214.54]) by mx1.freebsd.org (Postfix) with ESMTP id 9C7D38FC08; Sat, 2 Jul 2011 21:43:21 +0000 (UTC) Received: by bwa20 with SMTP id 20so4957917bwa.13 for ; Sat, 02 Jul 2011 14:43:20 -0700 (PDT) Received: by 10.204.9.203 with SMTP id m11mr4122816bkm.56.1309643000463; Sat, 02 Jul 2011 14:43:20 -0700 (PDT) Received: from mail-bw0-f54.google.com (mail-bw0-f54.google.com [209.85.214.54]) by mx.google.com with ESMTPS id ek1sm4166176bkb.21.2011.07.02.14.43.18 (version=SSLv3 cipher=OTHER); Sat, 02 Jul 2011 14:43:18 -0700 (PDT) Received: by bwa20 with SMTP id 20so4957882bwa.13 for ; Sat, 02 Jul 2011 14:43:15 -0700 (PDT) MIME-Version: 1.0 Received: by 10.204.9.195 with SMTP id m3mr282764bkm.29.1309642995570; Sat, 02 Jul 2011 14:43:15 -0700 (PDT) Received: by 10.204.117.198 with HTTP; Sat, 2 Jul 2011 14:43:15 -0700 (PDT) In-Reply-To: <8639ioadji.fsf@kopusha.home.net> References: <8639ioadji.fsf@kopusha.home.net> Date: Sat, 2 Jul 2011 14:43:15 -0700 Message-ID: From: Timothy Smith To: Mikolaj Golub Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Pawel Jakub Dawidek , freebsd-stable@freebsd.org Subject: Re: HAST + ZFS: no action on drive failure X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 02 Jul 2011 21:43:22 -0000 Hello Mikolaj, So, just to be clear, if a local drive fails in my pool, but the corresponding remote drive remains available, then hastd will both write to and read from the remote drive? That's really very cool! I looked more closely at the hastd(8) man page. There is some indication of what you say, but not so clear: "Read operations (BIO_READ) are handled locally unless I/O error occurs or local version of the data is not up-to-date yet (synchronization is in progress)." Perhaps this can be modified a bit? Adding, "or the local disk is unavailable. In such a case, the I/O operation will be handled by the remote resource." It does makes sense however, since HAST is base on the idea of raid. This feature increases the redundancy of the system greatly. My boss will be very impressed, as am I! I did notice however that when the pulled drive is reinserted, I need to change the associated hast resource to init, then back to primary to allow hastd to once again use it (perhaps the same if the secondary drive is failed?). Unless it will do this on it's own after some time? I did not wait more than a few minutes. But this is easy enough to script or to monitor the log and present a notification to admin at such a time. Thank you so much for the help! On Sat, Jul 2, 2011 at 8:49 AM, Mikolaj Golub wrote: > > On Thu, 30 Jun 2011 20:02:19 -0700 Timothy Smith wrote: > > TS> First posting here, hopefully I'm doing it right =) > > TS> I also posted this to the FreeBSD forum, but I know some hast folks > monitor > TS> this list regularly and not so much there, so... > > TS> Basically, I'm testing failure scenarios with HAST/ZFS. I got two > nodes, > TS> scripted up a bunch of checks and failover actions between the nodes. > TS> Looking good so far, though more complex that I expected. It would be > cool > TS> to post it somewher to get some pointers/critiques, but that's another > TS> thing. > > TS> Anyway, now I'm just seeing what happens when a drive fails on primary > node. > TS> Oddly/sadly, NOTHING! > > TS> Hast just keeps on a ticking, and doesn't change the state of the > failed > TS> drive, so the zpool has no clue the drive is offline. The > TS> /dev/hast/ remains. The hastd does log some errors to the > system > TS> log like this, but nothing more. > > TS> messages.0:Jun 30 18:39:59 nas1 hastd[11066]: [ada6] (primary) Unable > to > TS> flush activemap to disk: Device not configured. > TS> messages.0:Jun 30 18:39:59 nas1 hastd[11066]: [ada6] (primary) Local > request > TS> failed (Device not configured): WRITE(4736512, 512). > > Although the request to local drive failed it succeeded on remote node, so > data was not lost, it was considered as successful, and no error was > returned > to ZFS. > > TS> So, I guess the question is, "Do I have to script a cronjob to check > for > TS> these kinds of errors and then change the hast resource to 'init' or > TS> something to handle this?" Or is there some kind of hastd config > setting > TS> that I need to set? What's the SOP for this? > > Currently the only way to know is monitoring logs. It is not difficult to > hook > event for these errors in the HAST code (like it is done for > connect/disconnect, syncstart/done etc) so one could script what to do on > an > error occurrence but I am not sure it is a good idea -- the errors may be > generated with high rate. > > TS> As something related too, when the zpool in FreeBSD does finally > notice that > TS> the drive is missing because I have manually changed the hast resource > to > TS> INIT (so the /dev/hast/ is gone), my zpool (raidz2) hot spare > doesn't > TS> engage, even with "autoreplace=on". The zpool status of the degraded > pool > TS> seems to indicate that I should manually replace the failed drive. If > that's > TS> the case, it's not really a "hot spare". Does this mean the "FMA > Agent" > TS> referred to in the ZFS manual is not implemented in FreeBSD? > > TS> thanks! > TS> _______________________________________________ > TS> freebsd-stable@freebsd.org mailing list > TS> http://lists.freebsd.org/mailman/listinfo/freebsd-stable > TS> To unsubscribe, send any mail to " > freebsd-stable-unsubscribe@freebsd.org" > > -- > Mikolaj Golub >