From owner-freebsd-stable@FreeBSD.ORG Mon Jul 19 22:04:13 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 698311065674 for ; Mon, 19 Jul 2010 22:04:13 +0000 (UTC) (envelope-from garrettmoore@gmail.com) Received: from mail-iw0-f182.google.com (mail-iw0-f182.google.com [209.85.214.182]) by mx1.freebsd.org (Postfix) with ESMTP id 2A6728FC16 for ; Mon, 19 Jul 2010 22:04:12 +0000 (UTC) Received: by iwn35 with SMTP id 35so6366766iwn.13 for ; Mon, 19 Jul 2010 15:04:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:cc:content-type; bh=jspWGSe7JkncMATLGmJDHbvMOalqNU5VowGR9PmVObE=; b=HiDE+f+hNQP6lO22J6XJXCV2PaLmvnw1LWV5CVYiozkuAFqhT9RD3ijK9inEOx7Hgs ze05kTL8vSM4S9VUzYyzAzbxvOtIZWs1BPVR9cq+HqjeZZtgH3EIQ04VpyQ8WQSg4cPq ql15eKrAasy8h0vLKA9f9rVPUS5uJj3c18EuE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=c6Q7pWVWDM3ll7b0TSXwrFVlVdBc2MQB/Etl2KsojciDy9YiwdC2bKUnVjmPcuwfyc NSccCZCSBvnpwJXVJdrC9PYnFH47VS9qxJyBddngsXdBjtdGyW+NGd2poapXFECTq0DF vDQTXGcf6JLXKeHOh44Mgf2pqmtoPskCrsllk= MIME-Version: 1.0 Received: by 10.231.14.194 with SMTP id h2mr6124764iba.67.1279577052316; Mon, 19 Jul 2010 15:04:12 -0700 (PDT) Received: by 10.231.117.72 with HTTP; Mon, 19 Jul 2010 15:04:12 -0700 (PDT) In-Reply-To: <4C44B104.2050000@libeljournal.com> References: <4C44B104.2050000@libeljournal.com> Date: Mon, 19 Jul 2010 18:04:12 -0400 Message-ID: From: Garrett Moore To: John Hawkes-Reed Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-stable Subject: Re: Problems replacing failing drive in ZFS pool X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 19 Jul 2010 22:04:13 -0000 Well, hotswapping worked, but now I have a totally different problem. Just for reference: # zpool offline tank da3 # camcontrol stop da3 # camcontrol rescan all <'da3 lost device, removing device entry'> # camcontrol rescan all <'da3 at mpt0 ...', so new drive was found! yay> # zpool replace tank da3 *cannot replace da3 with da3: device is too small* So I looked at the smartctl output for the old and new drive. Old: Device Model: WDC WD15EADS-00P8B0 Serial Number: WD-WMAVU0087717 Firmware Version: 01.00A01 User Capacity: 1,500,301,910,016 bytes New: Device Model: WDC WD15EADS-00R6B0 Serial Number: WD-WCAVY4770428 Firmware Version: 01.00A01 User Capacity: 1,500,300,828,160 bytes God damnit, Western Digital. What can I do now? It's such a small difference, is there a way I can work around this? My other replacement drive is the "00R6B0" drive model as well, with the slightly smaller capacity. On Mon, Jul 19, 2010 at 4:09 PM, John Hawkes-Reed wrote: > On 19/07/2010 17:52, Garrett Moore wrote: > >> I'm nervous to trust the hotswap features and camcontrol to set things up >> properly, but I guess I could try it. When I first set the system up >> before >> I put data on the array I tried the hotswap functionality and drives >> wouldn't always re-attach when reinserted, even if I fiddled with >> camcontrol, but I can't remember exactly what I did then. >> > > We've a pair of medium-sized ZFS boxes with Supermicro boards (X8DTi, IIRC) > in hotswap chassis. They've both got one hot-spare drive. Well, I say 'hot > spare'. I mean 'Ought to be a hot-spare if my shoddy Perl works when > triggered by devd'. What we've found to work is this: > > Drive fails (thus far simulated by pulling the drive from the backplane) > ZFS error reported. Pool in degraded state. > 'zpool replace pool da9 da23' (Where da23 is the hot spare and where this > *should* happen automagically.) > Wait for resilvering. > Go on and swap the failed drive (da9 in this case) > 'camcontrol rescan all' (new drive shows up in /var/log/messages) > 'zpool replace da9' > Wait while resilvering happens. > Hot-swap drive returns to 'avail' status. > > [ ... ] > > > -- > JH-R > > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" >