From owner-freebsd-geom@FreeBSD.ORG Thu Mar 20 01:15:53 2008 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4A3401065672 for ; Thu, 20 Mar 2008 01:15:53 +0000 (UTC) (envelope-from ap@d-dt.de) Received: from dedihh.fuckner.net (dedihh.fuckner.net [81.209.183.161]) by mx1.freebsd.org (Postfix) with ESMTP id F289B8FC20 for ; Thu, 20 Mar 2008 01:15:52 +0000 (UTC) (envelope-from ap@d-dt.de) Received: from localhost (localhost [127.0.0.1]) by dedihh.fuckner.net (Postfix) with ESMTP id 51F9261F47 for ; Thu, 20 Mar 2008 01:58:40 +0100 (CET) X-Virus-Scanned: amavisd-new at fuckner.net Received: from dedihh.fuckner.net ([127.0.0.1]) by localhost (dedihh.fuckner.net [127.0.0.1]) (amavisd-new, port 10024) with SMTP id FlHD4MIdl7GR for ; Thu, 20 Mar 2008 01:58:36 +0100 (CET) Received: from [10.12.3.67] (e176188113.adsl.alicedsl.de [85.176.188.113]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by dedihh.fuckner.net (Postfix) with ESMTPSA id 2D2A661CDF for ; Thu, 20 Mar 2008 01:58:36 +0100 (CET) Message-ID: <47E1B6BD.3080002@d-dt.de> Date: Thu, 20 Mar 2008 01:58:37 +0100 From: Adam Pordzik User-Agent: Thunderbird 2.0.0.12 (Macintosh/20080213) MIME-Version: 1.0 To: freebsd-geom@freebsd.org Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Subject: geom mirror in error-prone situations X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 20 Mar 2008 01:15:53 -0000 Hello, recentlty, I was faced a strange situation, where ata(4) reports irregularly interrupt storms followed by DMA_WRITE timeouts. Mar 20 01:00:30 brain kernel: interrupt storm detected on "irq14:"; throttling interrupt source Mar 20 01:00:30 brain last message repeated 4 times Mar 20 01:00:30 brain kernel: ata5: reiniting channel .. Mar 20 01:00:30 brain kernel: ata5: SATA connect time=0ms Mar 20 01:00:30 brain kernel: ata5: reset tp1 mask=01 ostat0=51 ostat1=00 Mar 20 01:00:30 brain kernel: ata5: stat0=0x50 err=0x01 lsb=0x00 msb=0x00 Mar 20 01:00:30 brain kernel: ata5: reset tp2 stat0=50 stat1=00 devices=0x1 Mar 20 01:00:30 brain kernel: ata5: reinit done .. Mar 20 01:00:30 brain kernel: ad10: TIMEOUT - WRITE_DMA48 retrying (0 retries left) LBA=871709696 The disk was (and still is) pysically faultless. How would gmirror handle such a disk? Would that disc be removed at the first error? And would it be inserted the next boot? If so, does it first have to be rebuild? A --