From owner-freebsd-fs@FreeBSD.ORG Wed Oct 1 13:29:05 2014 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 10C4253F for ; Wed, 1 Oct 2014 13:29:05 +0000 (UTC) Received: from mx1.internetx.com (mx1.internetx.com [62.116.129.39]) by mx1.freebsd.org (Postfix) with ESMTP id 8D6B669A for ; Wed, 1 Oct 2014 13:29:04 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by mx1.internetx.com (Postfix) with ESMTP id 1B4561472007; Wed, 1 Oct 2014 15:29:02 +0200 (CEST) X-Virus-Scanned: InterNetX GmbH amavisd-new at ix-mailer.internetx.de Received: from mx1.internetx.com ([62.116.129.39]) by localhost (ix-mailer.internetx.de [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id j3G2TTR76mKL; Wed, 1 Oct 2014 15:28:59 +0200 (CEST) Received: from [192.168.100.26] (pizza.internetx.de [62.116.129.3]) (using TLSv1 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by mx1.internetx.com (Postfix) with ESMTPSA id 7B30F4C4C9B0; Wed, 1 Oct 2014 15:28:59 +0200 (CEST) Message-ID: <542C019E.2080702@internetx.com> Date: Wed, 01 Oct 2014 15:29:02 +0200 From: InterNetX - Juergen Gotteswinter Reply-To: jg@internetx.com User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:31.0) Gecko/20100101 Thunderbird/31.1.2 MIME-Version: 1.0 To: George Kontostanos Subject: Re: HAST with broken HDD References: <542BC135.1070906@Skynet.be> <542BDDB3.8080805@internetx.com> <542BF853.3040604@internetx.com> In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Cc: freebsd-fs@freebsd.org, JF-Bogaerts X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 01 Oct 2014 13:29:05 -0000 Am 01.10.2014 um 15:06 schrieb George Kontostanos: > > > On Wed, Oct 1, 2014 at 3:49 PM, InterNetX - Juergen Gotteswinter > > wrote: > > Am 01.10.2014 um 14:28 schrieb George Kontostanos: > > > > On Wed, Oct 1, 2014 at 1:55 PM, InterNetX - Juergen Gotteswinter > > > >> wrote: > > > > Am 01.10.2014 um 10:54 schrieb JF-Bogaerts: > > > Hello, > > > I'm preparing a HA NAS solution using HAST. > > > I'm wondering what will happen if one of disks of the > primary node will > > > fail or become erratic. > > > > > > Thx, > > > Jean-François Bogaerts > > > > nothing. if you are using zfs on top of hast zfs wont even > take notice > > about the disk failure. > > > > as long as the write operation was sucessfull on one of the 2 > nodes, > > hast doesnt notify the ontop layers about io errors. > > > > interesting concept, took me some time to deal with this. > > > > > > Are you saying that the pool will appear to be optimal even with a bad > > drive? > > > > > > https://forums.freebsd.org/viewtopic.php?&t=24786 > > > > It appears that this is actually the case. And it is very disturbing, > meaning that a drive failure goes unnoticed. In my case I completely > removed the second disk on the primary node and a zpool status showed > absolutely no problem. Scrubbing the pool began resilvering which > indicates that there is actually something wrong! right. lets go further and think how zfs works regarding direct hardware / disk access. theres a layer between which always says ey, everthing is fine. no more need for pool scrubbing, since hastd wont tell if anything is wrong :D > > pool: tank > > state: ONLINE > > status: One or more devices has experienced an error resulting in data > > corruption. Applications may be affected. > > action: Restore the file in question if possible. Otherwise restore the > > entire pool from backup. > > see: http://illumos.org/msg/ZFS-8000-8A > > scan: scrub repaired 16K in 0h2m with 7 errors on Wed Oct 1 16:00:47 2014 > > config: > > > NAME STATE READ WRITE CKSUM > > tank ONLINE 0 0 7 > > mirror-0 ONLINE 0 0 40 > > hast/disk1 ONLINE 0 0 40 > > hast/disk2 ONLINE 0 0 40 > > > Unfortunately, in this case there was data loss and hastctl status does > not report the missing disk! > > NameStatusRoleComponents > > disk1complete primary /dev/ada1hast2 > > disk2complete primary /dev/ada2hast2 > > > -- > George Kontostanos > --- > http://www.aisecure.net