From owner-freebsd-fs@FreeBSD.ORG Thu Dec 24 00:57:22 2009 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C80E5106566C for ; Thu, 24 Dec 2009 00:57:22 +0000 (UTC) (envelope-from stevenschlansker@gmail.com) Received: from mail-yw0-f172.google.com (mail-yw0-f172.google.com [209.85.211.172]) by mx1.freebsd.org (Postfix) with ESMTP id 7AD438FC1D for ; Thu, 24 Dec 2009 00:57:22 +0000 (UTC) Received: by ywh2 with SMTP id 2so8020008ywh.27 for ; Wed, 23 Dec 2009 16:57:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:subject:mime-version :content-type:from:in-reply-to:date:cc:content-transfer-encoding :message-id:references:to:x-mailer; bh=6eqvBPIPp1K0Roz37I4QfhYCSXjMFav62dwv9e9dcl8=; b=F6Wzhzb0YaKo6z2CBPwRRhufCW0jb650yZIWR9cq+itBmtxTLBeEeUxntzFiPamh8y d+ZQMuSdPdV0L4vvSB9JT2kZlLfayVs3z1GzD42gr8CAIPabLFrL+ZSlITYBzLtcy+os 5aF6U4SIrTjtOIrLiY/lW59/smVmybBv3QuW4= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=subject:mime-version:content-type:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to:x-mailer; b=VZL7i2qQLGFXKMzyuKg/DWa9lNTqAGAVnVaVRjL/MmwyfKVcAQ790Jopx8ovrG+PiA G31HLKnJ06SeSZE+5est/lLlh4hWYSe1B8miPlyJ9+KNhszGxA+u5qZFSVDptcsf1tUy l8yZ/lT5D7FDK6BjFH6Yyho+9k0zv3Qj0WKhs= Received: by 10.150.118.20 with SMTP id q20mr11338537ybc.112.1261616241819; Wed, 23 Dec 2009 16:57:21 -0800 (PST) Received: from 68-29-245-15.pools.spcsdns.net (68-29-245-15.pools.spcsdns.net [68.29.245.15]) by mx.google.com with ESMTPS id 23sm3472385yxe.0.2009.12.23.16.57.18 (version=TLSv1/SSLv3 cipher=RC4-MD5); Wed, 23 Dec 2009 16:57:21 -0800 (PST) Mime-Version: 1.0 (Apple Message framework v1077) Content-Type: text/plain; charset=us-ascii From: Steven Schlansker In-Reply-To: <5da0588e0912231644w2a7afb9dg41ceffbafc8c2df6@mail.gmail.com> Date: Wed, 23 Dec 2009 16:57:12 -0800 Content-Transfer-Encoding: quoted-printable Message-Id: <409922ED-42D5-4892-B74D-D2E696846AFB@gmail.com> References: <048AF210-8B9A-40EF-B970-E8794EC66B2F@gmail.com> <4B315320.5050504@quip.cz> <5da0588e0912221741r48395defnd11e34728d2b7b97@mail.gmail.com> <9CEE3EE5-2CF7-440E-B5F4-D2BD796EA55C@gmail.com> <5da0588e0912231644w2a7afb9dg41ceffbafc8c2df6@mail.gmail.com> To: Rich X-Mailer: Apple Mail (2.1077) Cc: freebsd-fs@freebsd.org Subject: Re: ZFS: Can't repair raidz2 (Cannot replace a replacing device) X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 24 Dec 2009 00:57:23 -0000 On Dec 23, 2009, at 4:44 PM, Rich wrote: > Export then import, perhaps? >=20 Sadly exporting seems to write the state of this funky "replacing" device out along with everything else, so importing just brings it back with. Granted I haven't tried this in the new state with the newly-resilvered drive, but I don't hold much hope. I'll try it next time I'm physically with my server, as I now have to reboot it anyway (sadness!) > I don't honestly know what to suggest - there are horrid workarounds > you could do involving manually diddling the metadata state, but I > feel like the correct solution is to open up a bug report and get a > fix put in. >=20 I'd be more willing to try hackish workarounds if I either didn't care about the data or had proper backups... but I don't so I'm rather worried about trashing the pool. It's so hard to back up 6TB of data on a college kid's budget! :( I'd file a PR but the last three times I've filed things on the FreeBSD bug tracker, they've gone largely ignored. One from two years ago still is open, and one from last summer hasn't even been replied to... so I've rather given up on it. > - Rich >=20 > On Wed, Dec 23, 2009 at 7:36 PM, Steven Schlansker > wrote: >>=20 >> On Dec 22, 2009, at 5:41 PM, Rich wrote: >>=20 >>> http://kerneltrap.org/mailarchive/freebsd-fs/2009/9/30/6457763 may = be >>> useful to you - it's what we did when we got stuck in a resilver = loop. >>> I recall being in the same state you're in right now at one point, = and >>> getting out of it from there. >>>=20 >>> I think if you apply that patch, you'll be able to cancel the >>> resilver, and then resilver again with the device you'd like to >>> resilver with. >>>=20 >>=20 >> Thanks for the suggestion, but the problem isn't that it's stuck >> in a resilver loop (which is what the patch seems to try to avoid) >> but that I can't detach a drive. >>=20 >> Now I got clever and fudged a label onto the new drive (copied the = first >> 50MB of one of the dying drives), ran a scrub, and have this layout - >>=20 >> pool: universe >> state: DEGRADED >> status: One or more devices has experienced an unrecoverable error. = An >> attempt was made to correct the error. Applications are = unaffected. >> action: Determine if the device needs to be replaced, and clear the = errors >> using 'zpool clear' or replace the device with 'zpool = replace'. >> see: http://www.sun.com/msg/ZFS-8000-9P >> scrub: scrub completed after 20h58m with 0 errors on Wed Dec 23 = 11:36:43 2009 >> config: >>=20 >> NAME STATE READ WRITE CKSUM >> universe DEGRADED 0 0 0 >> raidz2 DEGRADED 0 0 0 >> ad16 ONLINE 0 0 0 >> replacing DEGRADED 0 0 40.7M >> ad26 ONLINE 0 0 0 506G = repaired >> 6170688083648327969 UNAVAIL 0 88.7M 0 was = /dev/ad12 >> ad8 ONLINE 0 0 0 >> concat/back2 ONLINE 0 0 0 >> ad10 ONLINE 0 0 0 >> concat/ad4ex ONLINE 0 0 0 >> ad24 ONLINE 0 0 0 >> concat/ad6ex ONLINE 48 0 0 28.5K = repaired >>=20 >> Why has the replacing vdev not gone away? I still can't detach - >> [steven@universe:~]% sudo zpool detach universe 6170688083648327969 >> cannot detach 6170688083648327969: no valid replicas >> even though now there actually is a valid replica (ad26) >>=20 >> Additionally, running zpool clear hangs permanently and in fact = freezes all IO >> to the pool. Since I've mounted /usr from the pool, this is = effectively >> death to the system. Any other zfs commands seem to work okay >> (zpool scrub, zfs mount, etc.). Just clear is insta-death. I can't >> help but suspect that this is caused by the now non-sensical vdev = configuration >> (replacing with one good drive and one nonexistent one)... >>=20 >> Any further thoughts? Thanks, >> Steven >>=20 >>=20 >>> - Rich >>>=20 >>> On Tue, Dec 22, 2009 at 6:15 PM, Miroslav Lachman <000.fbsd@quip.cz> = wrote: >>>> Steven Schlansker wrote: >>>>>=20 >>>>> As a corollary, you may notice some funky concat business going = on. >>>>> This is because I have drives which are very slightly different in = size (< >>>>> 1MB) >>>>> and whenever one of them goes down and I bring the pool up, it = helpfully >>>>> (?) >>>>> expands the pool by a whole megabyte then won't let the drive back = in. >>>>> This is extremely frustrating... is there any way to fix that? = I'm >>>>> eventually going to keep expanding each of my drives one megabyte = at a >>>>> time >>>>> using gconcat and space on another drive! Very frustrating... >>>>=20 >>>> You can avoid it by partitioning the drives to the well known = 'minimal' size >>>> (size of smallest disk) and use the partition instead of raw disk. >>>> For example ad12s1 instead of ad12 (if you creat slices by fdisk) >>>> of ad12p1 (if you creat partitions by gpart) >>>>=20 >>>> You can also use labels instead of device name. >>>>=20 >>>> Miroslav Lachman >>>> _______________________________________________ >>>> freebsd-fs@freebsd.org mailing list >>>> http://lists.freebsd.org/mailman/listinfo/freebsd-fs >>>> To unsubscribe, send any mail to = "freebsd-fs-unsubscribe@freebsd.org" >>>>=20 >>>=20 >>>=20 >>>=20 >>> -- >>>=20 >>> If you are over 80 years old and accompanied by your parents, we = will >>> cash your check. >>=20 >> _______________________________________________ >> freebsd-fs@freebsd.org mailing list >> http://lists.freebsd.org/mailman/listinfo/freebsd-fs >> To unsubscribe, send any mail to "freebsd-fs-unsubscribe@freebsd.org" >>=20 >=20 >=20 >=20 > --=20 >=20 > Forest fires cause Smokey Bears.