From owner-freebsd-stable@FreeBSD.ORG Fri Mar 2 11:25:01 2012 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id B515B106566B for ; Fri, 2 Mar 2012 11:25:01 +0000 (UTC) (envelope-from prvs=0408e3a959=ob@gruft.de) Received: from main.mx.e-gitt.net (service.rules.org [IPv6:2001:1560:2342::2]) by mx1.freebsd.org (Postfix) with ESMTP id 407698FC0A for ; Fri, 2 Mar 2012 11:25:01 +0000 (UTC) Received: from ob by main.mx.e-gitt.net with local (Exim 4.77 (FreeBSD)) (envelope-from ) id 1S3Qba-0002xm-1V for freebsd-stable@freebsd.org; Fri, 02 Mar 2012 12:24:58 +0100 Date: Fri, 2 Mar 2012 12:24:57 +0100 From: Oliver Brandmueller To: freebsd-stable@freebsd.org Message-ID: <20120302112457.GB6032@e-Gitt.NET> Mail-Followup-To: freebsd-stable@freebsd.org References: <4F5089B7.2070103@withagen.nl> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4F5089B7.2070103@withagen.nl> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: Oliver Brandmueller Subject: Re: Disk disconnects, with immediate reconnect X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 02 Mar 2012 11:25:01 -0000 Hi, On Fri, Mar 02, 2012 at 09:49:59AM +0100, Willem Jan Withagen wrote: > On my ZFS server: > (info on: http://www.tegenbosch28.nl/FreeBSD/systems/ZFS/ ) > > +ahcich4: Timeout on slot 23 port 0 > +ahcich4: is 00000000 cs 00800000 ss 00000000 rs 00800000 tfd c0 serr > 00000000 cmd 0004d717 > +(ada3:ahcich4:0:0:0): lost device > +(ada3:ahcich4:0:0:0): removing device entry > +ada3 at ahcich4 bus 0 scbus5 target 0 lun 0 > +ada3: ATA-8 SATA 2.x device > +ada3: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes) > +ada3: Command Queueing enabled > +ada3: 1430799MB (2930277168 512 byte sectors: 16H 63S/T 16383C) > > The reconnect occurs immediately after the disconnect. > > I had some discussions with Jeremy Chadwick, so below are the smartctl > stats. > > The system was not particularly busy at that moment. > Is this disk failure, or why other did it disconnect. I suggest changing the disk: > 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail > Always - 13 I guess, this growing soon and fast ... > 197 Current_Pending_Sector 0x0012 100 100 000 Old_age > Always - 3 > 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age > Offline - 3 Doesn't look too promising. - Oliver -- | Oliver Brandmueller http://sysadm.in/ ob@sysadm.in | | Ich bin das Internet. Sowahr ich Gott helfe. |