From owner-freebsd-fs@FreeBSD.ORG Thu Apr 19 19:17:25 2007 Return-Path: X-Original-To: fs@freebsd.org Delivered-To: freebsd-fs@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 1E76416A414 for ; Thu, 19 Apr 2007 19:17:25 +0000 (UTC) (envelope-from rick@kiwi-computer.com) Received: from kiwi-computer.com (keira.kiwi-computer.com [63.224.10.3]) by mx1.freebsd.org (Postfix) with SMTP id 99C0D13C4AE for ; Thu, 19 Apr 2007 19:17:22 +0000 (UTC) (envelope-from rick@kiwi-computer.com) Received: (qmail 59963 invoked by uid 2001); 19 Apr 2007 19:17:21 -0000 Date: Thu, 19 Apr 2007 14:17:21 -0500 From: "Rick C. Petty" To: Dag-Erling Sm?rgrav Message-ID: <20070419191721.GA59824@keira.kiwi-computer.com> References: <20070418104155.GA31727@eschew.pusen.org> <86hcrdlqak.fsf@dwp.des.no> <20070418144103.GB31727@eschew.pusen.org> <20070418155156.GB20441@keira.kiwi-computer.com> <20070418180200.GA32061@eschew.pusen.org> <86odlku5xg.fsf@dwp.des.no> <20070419173316.GA57227@keira.kiwi-computer.com> <863b2w41tz.fsf@dwp.des.no> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <863b2w41tz.fsf@dwp.des.no> User-Agent: Mutt/1.4.2.1i Cc: piso@freebsd.org, fs@freebsd.org Subject: Re: ZFS + replacing failing hard-drive. X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: rick-freebsd@kiwi-computer.com List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 19 Apr 2007 19:17:25 -0000 On Thu, Apr 19, 2007 at 09:10:48PM +0200, Dag-Erling Sm?rgrav wrote: > "Rick C. Petty" writes: > > What do you mean by recently? I've seen this problem which started around > > 5.4-RELEASE (perhaps earlier) and on, including 6.0-R thru 6.2-stable as of > > a few weeks ago. Would the interrupt filtering be present on these > > systems? > > No, and in fact it's not used by the ata driver in -CURRENT, so it's > probably not relevant. > > I said recently because I didn't start having these problems until > around early march. However, this could be due to changes in usage > patterns. Oh. :( Okay. I had hoped otherwise... I'm in the process of looking at the commands sent to the PDC chips from linux's sata_promise driver and comparing what's different about ours. Linux properly reinitialized the card (tested with rmmod & modprobe), but the command which is causing the drives to be reprobed may affect the whole card vs. a single channel. I'll have time to play around with the FreeBSD kernel this weekend. Unfortunately, I couldn't figure out a way in linux to drop just a channel and reattach it, and I didn't see anything in their code which handles this case. However, I'd rather have the whole Promise card pause while reinitializing all channels than not be able to bring up a single channel. Still, it would be useful to figure out the cause of the DMA failures and avoid this altogether. Anyone volunteering to help? sos@ ? -- Rick C. Petty