From owner-freebsd-stable@FreeBSD.ORG Fri Jan 25 20:56:24 2008 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 13B9816A41B for ; Fri, 25 Jan 2008 20:56:24 +0000 (UTC) (envelope-from joe@skyrush.com) Received: from shadow.wildlava.net (shadow.wildlava.net [67.40.138.81]) by mx1.freebsd.org (Postfix) with ESMTP id DC29D13C442 for ; Fri, 25 Jan 2008 20:56:23 +0000 (UTC) (envelope-from joe@skyrush.com) Received: from [192.168.1.112] (wireless.boulder.swri.edu [65.241.78.7]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by shadow.wildlava.net (Postfix) with ESMTP id B97348F43C; Fri, 25 Jan 2008 13:56:22 -0700 (MST) Message-ID: <479A4CB0.5080206@skyrush.com> Date: Fri, 25 Jan 2008 13:55:12 -0700 From: Joe Peterson User-Agent: Thunderbird 2.0.0.9 (Windows/20071031) MIME-Version: 1.0 To: Chuck Swiger References: <479A0731.6020405@skyrush.com> <20080125162940.GA38494@eos.sc1.parodius.com> <479A3764.6050800@skyrush.com> <3803988D-8D18-4E89-92EA-19BF62FD2395@mac.com> In-Reply-To: <3803988D-8D18-4E89-92EA-19BF62FD2395@mac.com> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: "ad0: TIMEOUT - WRITE_DMA" type errors with 7.0-RC1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 25 Jan 2008 20:56:24 -0000 Chuck Swiger wrote: > On Jan 25, 2008, at 11:24 AM, Joe Peterson wrote: >> ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE >> UPDATED WHEN_FAILED RAW_VALUE >> 1 Raw_Read_Error_Rate 0x000f 114 071 006 Pre-fail >> Always - 82422948 > [ ... ] >> 7 Seek_Error_Rate 0x000f 084 060 030 Pre-fail >> Always - 286126605 > [ ... ] >> 195 Hardware_ECC_Recovered 0x001a 063 046 000 Old_age >> Always - 166181300 > > These numbers are quite worrysome-- they should be zero or nearly so > in a healthy drive. It seems to depend on the drive manufacturer. E.g. this is a Seagate. Every Seagate I've ever had (or heard about on the web via smartctl dumps) reports very large numbers for these values. I've heard it described that Seagate shows you the raw numbers (and correctable errors do happen all the time in all drives). In Western Digital drives (IIRC), the numbers shown are the ones that *should* be zero, thereby hiding the low-level errors. Hard to say if my numbers are "too high", but these "corrected" error counts are always frighteningly high in Seagates. -Joe