From owner-freebsd-fs@FreeBSD.ORG Mon Jun 22 01:50:25 2015 Return-Path: Delivered-To: freebsd-fs@nevdull.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id A0B8A267 for ; Mon, 22 Jun 2015 01:50:25 +0000 (UTC) (envelope-from wjw@digiware.nl) Received: from smtp.digiware.nl (smtp.digiware.nl [31.223.170.169]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 30C85AFF for ; Mon, 22 Jun 2015 01:50:24 +0000 (UTC) (envelope-from wjw@digiware.nl) Received: from rack1.digiware.nl (unknown [127.0.0.1]) by smtp.digiware.nl (Postfix) with ESMTP id 1DF1416A408; Mon, 22 Jun 2015 01:23:40 +0200 (CEST) X-Virus-Scanned: amavisd-new at digiware.nl Received: from smtp.digiware.nl ([127.0.0.1]) by rack1.digiware.nl (rack1.digiware.nl [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id lZryYmhesiXL; Mon, 22 Jun 2015 01:23:28 +0200 (CEST) Received: from [IPv6:2001:4cb8:3:1:a079:ce8f:c2bf:e69] (unknown [IPv6:2001:4cb8:3:1:a079:ce8f:c2bf:e69]) by smtp.digiware.nl (Postfix) with ESMTPA id BE2CF16A407; Mon, 22 Jun 2015 01:23:28 +0200 (CEST) Message-ID: <55874772.4090607@digiware.nl> Date: Mon, 22 Jun 2015 01:23:30 +0200 From: Willem Jan Withagen User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0 MIME-Version: 1.0 To: Quartz CC: freebsd-fs@freebsd.org Subject: Re: This diskfailure should not panic a system, but just disconnect disk from ZFS References: <5585767B.4000206@digiware.nl> <558590BD.40603@isletech.net> <5586C396.9010100@digiware.nl> <55871F4C.5010103@sneakertech.com> In-Reply-To: <55871F4C.5010103@sneakertech.com> Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 Jun 2015 01:50:25 -0000 On 21/06/2015 22:32, Quartz wrote: >> Or do I have to high hopes of ZFS? >> And is a hung disk a 'catastrophic pool failure'? > > Yes to both. > > I encountered this exact same issue a couple years ago (and complained > about it to this list as well, although I didn't get a complete answer > at the time. I can provide links to the conversation if interested). > > Basically, the heart of the issue is the way the kernel/drivers/ZFS > deals with IO and DMA. There's currently no way to tell what's going on > with the disks and what outstanding IO to the pool can be dropped or > ignored. As-currently-designed there's no safe way to just kick out the > pool and keep going, so the only options are to wait, panic, or wait and > then panic. Fixing this would require a major rewrite of a lot of code, > which isn't going to happen any time soon. The failmode setting and > deadman timer were implemented as a bandage to prevent the system from > hanging forever. > > See this page for more info: > http://comments.gmane.org/gmane.os.illumos.zfs/61 > Yes, I know of this discussion already as long as I'm working with ZFS. But reading it like this does make it much more sense. Perhaps the text should suggest some thing more painfull :), since now it suggest that chopping the disk would help... (Hence my reaction) But especially the hung disk during reading is sort of a ticking timebomb... On the other hand, if it is already outstanding for 100's of seconds, then it is never going to arrive. So the chance of running into corrupted memory is going to be 0.00000...... But I do agree that in this case a panic might be next best solution. >From another response I conclude that htere could be something in the driver/hardware combo that could run me into trouble... >> All failmode settings result in a seriously handicapped system... > > Yes. Again, this is a design issue/flaw with how DMA works. There's no > real way to continue on gracefully when a pool completely dies due to > hung IO. > > We're all pretty much stuck with this problem, at least for quite a while. We'll the pool did not die, (at least not IMHO) just one disk stopt working.... The pool is still resilient, so it could continue alert the operator have the operator fix/reboot/..... But it the fact that a stalled DMA action could "corrupt" memory after the fact. Just because a command outstanding for way too long, does all of a sudden completes. >> Is waiting only meant to wait a limited time? And then panic anyways? > > By default yes. However, if you know that on your system the issue will > eventually resolve itself given several hours (and you want to wait that > long) you can change the deadman timeout or disable it completely. Look > at "vfs.zfs.deadman_enabled" and "vfs.zfs.deadman_synctime". I see: vfs.zfs.deadman_enabled: 1 vfs.zfs.deadman_checktime_ms: 5000 vfs.zfs.deadman_synctime_ms: 1000000 So the "hung" on this I/O action has taken 1000 secs, and did not complete... I guess that if I like to live dangerously, I could set enabled to 0, and run the risk... ?? Probably the better solution is to see if it occurs more often, and in that case upgrade to modern hardware with newer HD controllers. Current config has worked for me for already quite some time. For the time being I'll offload a few disks to a mvs controller that is the only PCIe slot in the MB. Thanx for all the info, --WjW