From owner-freebsd-questions@FreeBSD.ORG Fri Oct 13 19:04:07 2006 Return-Path: X-Original-To: FreeBSD-Questions@freebsd.org Delivered-To: FreeBSD-Questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 3F44616A407 for ; Fri, 13 Oct 2006 19:04:07 +0000 (UTC) (envelope-from rem@tco2.thecompanyonline.com) Received: from tco2.thecompanyonline.com (dsl017-004-081.ser1.dsl.speakeasy.net [69.17.4.81]) by mx1.FreeBSD.org (Postfix) with ESMTP id F164443D90 for ; Fri, 13 Oct 2006 19:03:47 +0000 (GMT) (envelope-from rem@tco2.thecompanyonline.com) Received: from [10.50.30.149] ([216.109.255.7]) (authenticated bits=0) by tco2.thecompanyonline.com (8.13.6/8.13.6) with ESMTP id k9DJ2vp7012406 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Fri, 13 Oct 2006 15:03:02 -0400 (EDT) (envelope-from rem@tco2.thecompanyonline.com) Message-ID: <452FE303.90002@tco2.thecompanyonline.com> Date: Fri, 13 Oct 2006 15:03:31 -0400 From: Richard McIntyre User-Agent: Mozilla Thunderbird 1.0.7 (Windows/20050923) X-Accept-Language: en-us, en MIME-Version: 1.0 To: FreeBSD-Questions@freebsd.org References: <003a01c6ee0a$841e74f0$6908a8c0@pcmoperations> <20061012182206.GA81008@Grumpy.DynDNS.org> In-Reply-To: <20061012182206.GA81008@Grumpy.DynDNS.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=0.0 required=5.0 tests=none autolearn=failed X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on tco2.thecompanyonline.com at Fri, 13 Oct 2006 15:03:05 -0400 X-Virus-Scanned: ClamAV 0.88.3/2030/Fri Oct 13 09:34:34 2006 on tco2.thecompanyonline.com X-Virus-Status: Clean Cc: Subject: Re: Hard Drive Issues X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 13 Oct 2006 19:04:07 -0000 David Kelly wrote: >On Thu, Oct 12, 2006 at 06:54:53PM +0100, Spiros Papadopoulos wrote: > > >>Since as you say everything is working, maybe it is a good idea to >>take a look and run the fsck command at least it may give you some >>more information, which you can post in order to get better answers >> >> > >That too, but first I'd start with sysutils/smartmontools and see what >the drive and its built-in log says. > > > I'm having a similar problem, Oct 13 03:01:31 tco1 kernel: ad2: FAILURE - READ_DMA status=51 error=40 LBA=181778119 Oct 13 07:11:15 tco1 kernel: ad2: FAILURE - READ_DMA status=51 error=40 LBA=181778119 I'm assuming that particular sector on the drive is dying, I have backed everything up on the drive, can anyone give me more information, should the drive simply be replaced or is it possible that this is simply a TOC error and could be corrected by newfs to the drive? I'm guessing it will need to be replaced, output of smartctl is below.... Thanks ~Richard uname -a >>FreeBSD 5.3-RELEASE FreeBSD 5.3-RELEASE #0: Mon May 2 22:32:50 EDT 2005 >>root@tco1:/usr/src/sys/i386/compile/TCO1.2005.05.02.001 i386 My output of smartmontools is: smartctl -a -s on /dev/ad2 smartctl version 5.36 [i386-portbld-freebsd5.3] Copyright (C) 2002-6 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family Device Model: ST3200822A Serial Number: 5LJ0LW2T Firmware Version: 3.01 User Capacity: 200,049,647,616 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 6 ATA Standard is: ATA/ATAPI-6 T13 1410D revision 2 Local Time is: Fri Oct 13 14:56:23 2006 EDT SMART support is: Available - device has SMART capability. SMART support is: Disabled === START OF ENABLE/DISABLE COMMANDS SECTION === SMART Enabled. === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 430) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. No General Purpose Logging support. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 111) minutes. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 051 048 006 Pre-fail Always - 22488920 3 Spin_Up_Time 0x0003 097 097 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 21 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 1 7 Seek_Error_Rate 0x000f 084 060 030 Pre-fail Always - 328020832 9 Power_On_Hours 0x0032 082 082 000 Old_age Always - 16043 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 22 194 Temperature_Celsius 0x0022 030 040 000 Old_age Always - 30 195 Hardware_ECC_Recovered 0x001a 051 048 000 Old_age Always - 22488920 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 1 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 1 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age Offline - 0 202 TA_Increase_Count 0x0032 051 204 000 Old_age Always - 49 SMART Error Log Version: 1 ATA Error Count: 7742 (device log contains only the most recent five errors) CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 7742 occurred at disk power-on lifetime: 16036 hours (668 days + 4 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 04 c7 b6 d5 ea 00 15:22:37.737 READ DMA c8 00 04 9b b4 e1 ea 00 15:22:37.493 READ DMA c8 00 04 97 b4 e1 ea 00 15:22:37.251 READ DMA c8 00 04 a7 b4 e1 ea 00 15:22:37.002 READ DMA c8 00 04 a3 b4 e1 ea 00 15:22:36.761 READ DMA Error 7741 occurred at disk power-on lifetime: 16032 hours (668 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 04 c7 b6 d5 ea 00 11:08:40.154 READ DMA 35 00 20 df ff 2b 40 00 11:08:40.145 WRITE DMA EXT 35 00 20 1f d5 16 40 00 11:08:44.953 WRITE DMA EXT ca 00 20 3f c0 92 ef 00 11:08:40.258 WRITE DMA ca 00 20 df 85 81 ef 00 11:08:40.250 WRITE DMA Error 7740 occurred at disk power-on lifetime: 16012 hours (667 days + 4 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 04 c7 b6 d5 ea 00 15:49:49.473 READ DMA c8 00 04 9b b4 e1 ea 00 15:49:49.220 READ DMA c8 00 04 97 b4 e1 ea 00 15:49:52.420 READ DMA c8 00 04 a7 b4 e1 ea 00 15:49:52.175 READ DMA c8 00 04 a3 b4 e1 ea 00 15:49:51.929 READ DMA Error 7739 occurred at disk power-on lifetime: 16008 hours (667 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 04 c7 b6 d5 ea 00 11:35:56.771 READ DMA 35 00 20 bf e7 39 40 00 11:35:56.765 WRITE DMA EXT 35 00 20 7f 6b 2e 40 00 11:35:56.749 WRITE DMA EXT 35 00 20 3f 0d c7 40 00 11:35:56.740 WRITE DMA EXT 35 00 20 1f 4f c1 40 00 11:35:56.732 WRITE DMA EXT Error 7738 occurred at disk power-on lifetime: 15989 hours (666 days + 5 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 04 c7 b6 d5 ea 00 16:16:27.719 READ DMA c8 00 04 9b b4 e1 ea 00 16:16:27.468 READ DMA c8 00 04 97 b4 e1 ea 00 16:16:30.682 READ DMA c8 00 04 a7 b4 e1 ea 00 16:16:30.440 READ DMA c8 00 04 a3 b4 e1 ea 00 16:16:30.174 READ DMA SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.