From owner-freebsd-stable@FreeBSD.ORG Wed Dec 15 00:54:10 2004 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 42E4A16A4CE for ; Wed, 15 Dec 2004 00:54:10 +0000 (GMT) Received: from outbound0.sv.meer.net (outbound0.sv.meer.net [205.217.152.13]) by mx1.FreeBSD.org (Postfix) with ESMTP id 224AC43D41 for ; Wed, 15 Dec 2004 00:54:10 +0000 (GMT) (envelope-from jrhett@mail.meer.net) Received: from mail.meer.net (mail.meer.net [209.157.152.14]) iBF0s1wN074756; Tue, 14 Dec 2004 16:54:01 -0800 (PST) (envelope-from jrhett@mail.meer.net) Received: from mail.meer.net (localhost [127.0.0.1]) by mail.meer.net (8.12.10/8.12.10/meer) with ESMTP id iBF0rxr8062842; Tue, 14 Dec 2004 16:53:59 -0800 (PST) (envelope-from jrhett@mail.meer.net) Received: (from jrhett@localhost) by mail.meer.net (8.12.1/8.12.10) id iBF0rxC0062840; Tue, 14 Dec 2004 16:53:59 -0800 (PST) (envelope-from jrhett) Date: Tue, 14 Dec 2004 16:53:59 -0800 From: Joe Rhett To: =?iso-8859-1?Q?S=F8ren?= Schmidt Message-ID: <20041215005359.GK27283@meer.net> Mail-Followup-To: =?iso-8859-1?Q?S=F8ren?= Schmidt , freebsd-stable@freebsd.org References: <20041213052628.GB78120@meer.net> <20041213054159.GC78120@meer.net> <20041212215841.X83257@carver.gumbysoft.com> <20041213060549.GE78120@meer.net> <20041213102333.V92964@carver.gumbysoft.com> <20041213192119.GB4781@meer.net> <20041213183336.T97507@carver.gumbysoft.com> <41BE8F2D.8000407@DeepCore.dk> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <41BE8F2D.8000407@DeepCore.dk> User-Agent: Mutt/1.4i Organization: Meer.net LLC cc: freebsd-stable@freebsd.org Subject: Re: drive failure during rebuild causes page fault X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 15 Dec 2004 00:54:10 -0000 Soren, do you have any thoughts on what I could do to alleviate or better debug this page fault? I've found three ways to cause this: in all cases "pull" is either physical pull or "atacontrol detach " 1. Pull a drive and rebuild onto hot spare. Pull hot spare *boom* 2. Pull a drive and rebuild onto hot spare. Pull good disk *boom* ...should cause filesystem failure, but not page fault when it's not / 3. Pull a drive and then put it back. The system suddenly has a new array with just that drive in it. "atacontrol delete " *boom* In particular, what's the story with the new array appearing when you insert a drive with array meta-data on it? That array appears to be half-there (no devices, etc) which is probably what causes #2... On Tue, Dec 14, 2004 at 07:58:53AM +0100, Søren Schmidt wrote: > Actually I'm in the process of rewriting the ATA RAID code, so things > are rolling, albeit slowly, time is a precious resource. I belive that > it can be made pretty robust, but the rest of the kernel still have > issues with disappearing devices etc thats out of ATA's realm. > > Anyhow. I can only test with the HW I have here in the lab, which by far > covers all possible permutations, so testing etc by the community is > very much needed here to get things sorted out... -- Joe Rhett Senior Geek Meer.net