From owner-freebsd-fs@FreeBSD.ORG Mon Jun 22 01:36:47 2015 Return-Path: Delivered-To: freebsd-fs@nevdull.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 95E8C73 for ; Mon, 22 Jun 2015 01:36:47 +0000 (UTC) (envelope-from quartz@sneakertech.com) Received: from hub.freebsd.org (hub.freebsd.org [8.8.178.136]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "hub.freebsd.org", Issuer "hub.freebsd.org" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id 798DE1E3E for ; Mon, 22 Jun 2015 01:36:47 +0000 (UTC) (envelope-from quartz@sneakertech.com) Received: by hub.freebsd.org (Postfix) id 6EFFD72; Mon, 22 Jun 2015 01:36:47 +0000 (UTC) Delivered-To: fs@nevdull.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 6D37071 for ; Mon, 22 Jun 2015 01:36:47 +0000 (UTC) (envelope-from quartz@sneakertech.com) Received: from douhisi.pair.com (douhisi.pair.com [209.68.5.179]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4B0B71E3D for ; Mon, 22 Jun 2015 01:36:47 +0000 (UTC) (envelope-from quartz@sneakertech.com) Received: from [10.2.2.1] (pool-173-48-121-235.bstnma.fios.verizon.net [173.48.121.235]) by douhisi.pair.com (Postfix) with ESMTPSA id CABD33F71F; Sun, 21 Jun 2015 16:49:46 -0400 (EDT) Message-ID: <5587236A.6020404@sneakertech.com> Date: Sun, 21 Jun 2015 16:49:46 -0400 From: Quartz User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:10.0.2) Gecko/20120216 Thunderbird/10.0.2 MIME-Version: 1.0 To: Willem Jan Withagen CC: fs@freebsd.org Subject: Re: This diskfailure should not panic a system, but just disconnect disk from ZFS References: <5585767B.4000206@digiware.nl> In-Reply-To: <5585767B.4000206@digiware.nl> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 Jun 2015 01:36:47 -0000 Also: > And thus I'd would have expected that ZFS would disconnect /dev/da0 and > then switch to DEGRADED state and continue, letting the operator fix the > broken disk. > Next question to answer is why this WD RED on: > got hung, and nothing for this shows in SMART.... You have a raidz2, which means THREE disks need to go down before the pool is unwritable. The problem is most likely your controller or power supply, not your disks. Also2: don't rely too much on SMART for determining drive health. Google released a paper a few years ago revealing that half of all drives die without reporting SMART errors. http://research.google.com/archive/disk_failures.pdf