Date: Fri, 13 Oct 2006 15:03:31 -0400 From: Richard McIntyre <rem@tco2.thecompanyonline.com> To: FreeBSD-Questions@freebsd.org Subject: Re: Hard Drive Issues Message-ID: <452FE303.90002@tco2.thecompanyonline.com> In-Reply-To: <20061012182206.GA81008@Grumpy.DynDNS.org> References: <003a01c6ee0a$841e74f0$6908a8c0@pcmoperations> <dab71e150610121054s2c4fd6bdh88372c1143e29cd7@mail.gmail.com> <20061012182206.GA81008@Grumpy.DynDNS.org>
next in thread | previous in thread | raw e-mail | index | archive | help
David Kelly wrote: >On Thu, Oct 12, 2006 at 06:54:53PM +0100, Spiros Papadopoulos wrote: > > >>Since as you say everything is working, maybe it is a good idea to >>take a look and run the fsck command at least it may give you some >>more information, which you can post in order to get better answers >> >> > >That too, but first I'd start with sysutils/smartmontools and see what >the drive and its built-in log says. > > > I'm having a similar problem, Oct 13 03:01:31 tco1 kernel: ad2: FAILURE - READ_DMA status=51<READY,DSC,ERROR> error=40<UNCORRECTABLE> LBA=181778119 Oct 13 07:11:15 tco1 kernel: ad2: FAILURE - READ_DMA status=51<READY,DSC,ERROR> error=40<UNCORRECTABLE> LBA=181778119 I'm assuming that particular sector on the drive is dying, I have backed everything up on the drive, can anyone give me more information, should the drive simply be replaced or is it possible that this is simply a TOC error and could be corrected by newfs to the drive? I'm guessing it will need to be replaced, output of smartctl is below.... Thanks ~Richard uname -a >>FreeBSD 5.3-RELEASE FreeBSD 5.3-RELEASE #0: Mon May 2 22:32:50 EDT 2005 >>root@tco1:/usr/src/sys/i386/compile/TCO1.2005.05.02.001 i386 My output of smartmontools is: smartctl -a -s on /dev/ad2 smartctl version 5.36 [i386-portbld-freebsd5.3] Copyright (C) 2002-6 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family Device Model: ST3200822A Serial Number: 5LJ0LW2T Firmware Version: 3.01 User Capacity: 200,049,647,616 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 6 ATA Standard is: ATA/ATAPI-6 T13 1410D revision 2 Local Time is: Fri Oct 13 14:56:23 2006 EDT SMART support is: Available - device has SMART capability. SMART support is: Disabled === START OF ENABLE/DISABLE COMMANDS SECTION === SMART Enabled. === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 430) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. No General Purpose Logging support. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 111) minutes. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 051 048 006 Pre-fail Always - 22488920 3 Spin_Up_Time 0x0003 097 097 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 21 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 1 7 Seek_Error_Rate 0x000f 084 060 030 Pre-fail Always - 328020832 9 Power_On_Hours 0x0032 082 082 000 Old_age Always - 16043 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 22 194 Temperature_Celsius 0x0022 030 040 000 Old_age Always - 30 195 Hardware_ECC_Recovered 0x001a 051 048 000 Old_age Always - 22488920 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 1 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 1 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age Offline - 0 202 TA_Increase_Count 0x0032 051 204 000 Old_age Always - 49 SMART Error Log Version: 1 ATA Error Count: 7742 (device log contains only the most recent five errors) CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 7742 occurred at disk power-on lifetime: 16036 hours (668 days + 4 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 04 c7 b6 d5 ea 00 15:22:37.737 READ DMA c8 00 04 9b b4 e1 ea 00 15:22:37.493 READ DMA c8 00 04 97 b4 e1 ea 00 15:22:37.251 READ DMA c8 00 04 a7 b4 e1 ea 00 15:22:37.002 READ DMA c8 00 04 a3 b4 e1 ea 00 15:22:36.761 READ DMA Error 7741 occurred at disk power-on lifetime: 16032 hours (668 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 04 c7 b6 d5 ea 00 11:08:40.154 READ DMA 35 00 20 df ff 2b 40 00 11:08:40.145 WRITE DMA EXT 35 00 20 1f d5 16 40 00 11:08:44.953 WRITE DMA EXT ca 00 20 3f c0 92 ef 00 11:08:40.258 WRITE DMA ca 00 20 df 85 81 ef 00 11:08:40.250 WRITE DMA Error 7740 occurred at disk power-on lifetime: 16012 hours (667 days + 4 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 04 c7 b6 d5 ea 00 15:49:49.473 READ DMA c8 00 04 9b b4 e1 ea 00 15:49:49.220 READ DMA c8 00 04 97 b4 e1 ea 00 15:49:52.420 READ DMA c8 00 04 a7 b4 e1 ea 00 15:49:52.175 READ DMA c8 00 04 a3 b4 e1 ea 00 15:49:51.929 READ DMA Error 7739 occurred at disk power-on lifetime: 16008 hours (667 days + 0 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 04 c7 b6 d5 ea 00 11:35:56.771 READ DMA 35 00 20 bf e7 39 40 00 11:35:56.765 WRITE DMA EXT 35 00 20 7f 6b 2e 40 00 11:35:56.749 WRITE DMA EXT 35 00 20 3f 0d c7 40 00 11:35:56.740 WRITE DMA EXT 35 00 20 1f 4f c1 40 00 11:35:56.732 WRITE DMA EXT Error 7738 occurred at disk power-on lifetime: 15989 hours (666 days + 5 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 04 c7 b6 d5 ea 00 16:16:27.719 READ DMA c8 00 04 9b b4 e1 ea 00 16:16:27.468 READ DMA c8 00 04 97 b4 e1 ea 00 16:16:30.682 READ DMA c8 00 04 a7 b4 e1 ea 00 16:16:30.440 READ DMA c8 00 04 a3 b4 e1 ea 00 16:16:30.174 READ DMA SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?452FE303.90002>