From owner-freebsd-stable@FreeBSD.ORG Fri May 14 19:53:51 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1D9D9106566C for ; Fri, 14 May 2010 19:53:51 +0000 (UTC) (envelope-from jdc@koitsu.dyndns.org) Received: from qmta04.westchester.pa.mail.comcast.net (qmta04.westchester.pa.mail.comcast.net [76.96.62.40]) by mx1.freebsd.org (Postfix) with ESMTP id C19CC8FC0A for ; Fri, 14 May 2010 19:53:50 +0000 (UTC) Received: from omta18.westchester.pa.mail.comcast.net ([76.96.62.90]) by qmta04.westchester.pa.mail.comcast.net with comcast id HUHx1e0061wpRvQ54Xtqzt; Fri, 14 May 2010 19:53:50 +0000 Received: from koitsu.dyndns.org ([98.248.46.159]) by omta18.westchester.pa.mail.comcast.net with comcast id HXtp1e0033S48mS3eXtpkS; Fri, 14 May 2010 19:53:50 +0000 Received: by icarus.home.lan (Postfix, from userid 1000) id D55889B419; Fri, 14 May 2010 12:53:46 -0700 (PDT) Date: Fri, 14 May 2010 12:53:46 -0700 From: Jeremy Chadwick To: Pieter de Boer Message-ID: <20100514195346.GA8977@icarus.home.lan> References: <4BED8B89.6010901@os3.nl> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4BED8B89.6010901@os3.nl> User-Agent: Mutt/1.5.20 (2009-06-14) Cc: freebsd-stable@freebsd.org Subject: Re: Read / write timeouts on SATA disks connected to ICH9 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 14 May 2010 19:53:51 -0000 On Fri, May 14, 2010 at 07:42:33PM +0200, Pieter de Boer wrote: > Hi list, > > I'm running FreeBSD 8.0-RELEASE-p1 on a Dell R300 which has a ICH9 > SATA controller on-board (do not have the RAID controller). > > The system has 2 disks in a gmirror setup. Every now and then, > probably under some load, one of the disks gets read or write > timeouts like: > May 5 03:01:37 aberdeen kernel: ad4: timeout waiting to issue command > May 5 03:01:37 aberdeen kernel: ad4: error issuing WRITE_DMA48 command > May 5 03:01:37 aberdeen kernel: GEOM_MIRROR: Request failed > (error=5). ad4[WRITE(offset=200404975104, length=16384)] > May 5 03:01:37 aberdeen kernel: GEOM_MIRROR: Device gm0: provider > ad4 disconnected. > > or: > > May 13 14:41:26 aberdeen kernel: ad6: TIMEOUT - READ_DMA48 retrying > (1 retry left) LBA=975513887 > > Sometimes the read/write succeeds after a few retries, but sometimes > it does not, so geom_mirror throws the disk out of the mirror. > > Tonight ad6 was thrown out of the mirror and ad4 then gave actual > read errors, resulting in a big mess :( > > My question: does anyone have experience with FreeBSD on a Dell R300 > or can anyone give me some help in trying to fix the timeouts? Could you please do the following: - Provide output from "vmstat -i" - Provide output from "dmesg | grep -i ata" - Install ports/sysutils/smartmontools (5.40 or later) and provide full output from commands "smartctl -a /dev/ad4" and "smartctl -a /dev/ad6" -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |