From owner-freebsd-stable@FreeBSD.ORG Wed Dec 11 20:06:33 2013 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 3058D864 for ; Wed, 11 Dec 2013 20:06:33 +0000 (UTC) Received: from elf.hq.norma.perm.ru (mail.norma.perm.ru [128.127.144.4]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 9C73812AD for ; Wed, 11 Dec 2013 20:06:31 +0000 (UTC) Received: from [192.168.248.35] ([192.168.248.35]) by elf.hq.norma.perm.ru (8.14.5/8.14.5) with ESMTP id rBBK6D8U034739 (version=TLSv1/SSLv3 cipher=DHE-RSA-CAMELLIA256-SHA bits=256 verify=NO) for ; Thu, 12 Dec 2013 02:06:15 +0600 (YEKT) (envelope-from emz@norma.perm.ru) Message-ID: <52A8C5AF.1000800@norma.perm.ru> Date: Thu, 12 Dec 2013 02:06:07 +0600 From: "Eugene M. Zheganin" User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:24.0) Gecko/20100101 Thunderbird/24.1.1 MIME-Version: 1.0 To: "freebsd-stable@freebsd.org Stable" Subject: zfs i/o error - all block copies unavailable Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.2.7 (elf.hq.norma.perm.ru [192.168.3.10]); Thu, 12 Dec 2013 02:06:15 +0600 (YEKT) X-Spam-Status: No hits=-101.0 bayes=0.5 testhits ALL_TRUSTED=-1, USER_IN_WHITELIST=-100 autolearn=unavailable version=3.3.2 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on elf.hq.norma.perm.ru X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.17 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 11 Dec 2013 20:06:33 -0000 Hi. I have a server, it was running 8.2-STABLE/i386 with zfs v28. All of a sudden, on last reboot I got a big bunch of "zfs i/o error - all block copies unavailable" messages and server was unable to boot. I decided that one disk is dying, detached it, and booted successfully. On a next boot I got this again. I booted from CD, replaced a zpool.cache, and booted successfully once again. On next reboot I got it again, and was unable to fix it. However (as you may already know) disks were fine, and all the data wasn't corrupted. Iread a couple of mailing list posts about this mentioning that this could be an i386 issue, and decided to deal with in a radical way: I've installed 10.0-BETA1/amd64 (booted from a LiveCD, mounted an NFS share with /usr/src and obj, and did the upgrade). Now everything is almost fine, except that I still get this message, but only once and it seems to be harmless, as the server can still be booted (I experimented and tried this like a dozen times). So, questions: - is it really harmless ? - can I run with this ? - is there any way to get rid of it ? (probably, without recreating a pool, because it holds several TBytes of user data) - and, finally - what does it mean ? Thanks. Eugene.