From owner-freebsd-fs@FreeBSD.ORG Thu Nov 22 09:27:23 2007 Return-Path: Delivered-To: freebsd-fs@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8ADCA16A41A for ; Thu, 22 Nov 2007 09:27:23 +0000 (UTC) (envelope-from bra@fsn.hu) Received: from people.fsn.hu (people.fsn.hu [195.228.252.137]) by mx1.freebsd.org (Postfix) with ESMTP id 0CF6713C461 for ; Thu, 22 Nov 2007 09:27:00 +0000 (UTC) (envelope-from bra@fsn.hu) Received: from japan.t-online.private (people [192.168.2.4]) by people.fsn.hu (Postfix) with ESMTP id CE8ED7664 for ; Thu, 22 Nov 2007 10:08:05 +0100 (CET) Message-ID: <474546F5.2000007@fsn.hu> Date: Thu, 22 Nov 2007 10:08:05 +0100 From: Attila Nagy User-Agent: Thunderbird 2.0.0.0 (X11/20070421) MIME-Version: 1.0 To: freebsd-fs@FreeBSD.org Content-Type: text/plain; charset=ISO-8859-2; format=flowed Content-Transfer-Encoding: 7bit Cc: Subject: ZFS and FAULTED devices (corrupted data), can't make the pool ONLINE again X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 22 Nov 2007 09:27:23 -0000 Hello, FreeBSD RELENG_7, x86, a terrible disk array, called Promise RM-8000 with 8 disks on an ahc. The pool is a RAIDZ2. Tomorrow the array went crazy (its firmware is a total crap), so I had to reboot both the machine and the disk array. The effect: pool: people state: DEGRADED status: One or more devices could not be used because the label is missing or invalid. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Replace the device using 'zpool replace'. see: http://www.sun.com/msg/ZFS-8000-4J scrub: resilver completed with 0 errors on Thu Nov 22 10:45:27 2007 config: NAME STATE READ WRITE CKSUM people DEGRADED 0 0 0 raidz2 DEGRADED 0 0 0 da8 ONLINE 0 0 0 da3 FAULTED 0 0 0 corrupted data da5 ONLINE 0 0 0 da6 ONLINE 0 0 0 da7 ONLINE 0 0 0 da9 ONLINE 0 0 0 da10 ONLINE 0 0 0 da4 FAULTED 0 0 0 corrupted data errors: No known data errors I've tried everything I could think of: zpool replace people da3 invalid vdev specification use '-f' to override the following errors: da3 is in use (r1w1e1) zpool replace -f people da3 invalid vdev specification the following errors must be manually repaired: da3 is in use (r1w1e1) zpool offline people da3 cannot offline da3: no valid replicas zpool online people da3 Bringing device da3 online (nothing has changed, the device is still FAULTED) Hmm. Is this this bug? http://www.opensolaris.org/jive/thread.jspa?messageID=161812 -- Attila Nagy e-mail: Attila.Nagy@fsn.hu Free Software Network (FSN.HU) phone: +3630 306 6758 http://www.fsn.hu/