Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 13 Dec 2017 09:54:43 +0100
From:      "O. Hartmann" <o.hartmann@walstatt.org>
To:        "Rodney W. Grimes" <freebsd-rwg@pdx.rh.CN85.dnsmgr.net>
Cc:        "O. Hartmann" <ohartmann@walstatt.org>, FreeBSD CURRENT <freebsd-current@freebsd.org>, Freddie Cash <fjwcash@gmail.com>, Alan Somers <asomers@freebsd.org>
Subject:   Re: SMART: disk problems on RAIDZ1 pool: (ada6:ahcich6:0:0:0): CAM status: ATA Status Error
Message-ID:  <20171213095510.4f025922@thor.intern.walstatt.dynvpn.de>
In-Reply-To: <201712122255.vBCMtnfZ088889@pdx.rh.CN85.dnsmgr.net>
References:  <20171212231858.294a2cb5@thor.intern.walstatt.dynvpn.de> <201712122255.vBCMtnfZ088889@pdx.rh.CN85.dnsmgr.net>

next in thread | previous in thread | raw e-mail | index | archive | help
--Sig_/mvP06IJ4=8CV0Ve8YnW_1hQ
Content-Type: multipart/mixed; boundary="MP_/Dd35P6LvSzBZKm5JLE2mpNQ"

--MP_/Dd35P6LvSzBZKm5JLE2mpNQ
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

Am Tue, 12 Dec 2017 14:55:49 -0800 (PST)
"Rodney W. Grimes" <freebsd-rwg@pdx.rh.CN85.dnsmgr.net> schrieb:

> > Am Tue, 12 Dec 2017 10:52:27 -0800 (PST)
> > "Rodney W. Grimes" <freebsd-rwg@pdx.rh.CN85.dnsmgr.net> schrieb:
> >=20
> >=20
> > Thank you for answering that fast!
> >  =20
> > > > Hello,
> > > >=20
> > > > running CURRENT (recent r326769), I realised that smartmond sends o=
ut some console
> > > > messages when booting the box:
> > > >=20
> > > > [...]
> > > > Dec 12 14:14:33 <3.2> box1 smartd[68426]: Device: /dev/ada6, 1 Curr=
ently
> > > > unreadable (pending) sectors Dec 12 14:14:33 <3.2> box1 smartd[6842=
6]:
> > > > Device: /dev/ada6, 1 Offline uncorrectable sectors
> > > > [...]
> > > >=20
> > > > Checking the drive's SMART log with smartctl (it is one of four 3TB=
 disk drives),
> > > > I gather these informations:
> > > >=20
> > > > [... smartctl -x /dev/ada6 ...]
> > > > Error 42 [17] occurred at disk power-on lifetime: 25335 hours (1055=
 days + 15
> > > > hours) When the command that caused the error occurred, the device =
was active or
> > > > idle.
> > > >=20
> > > >   After command completion occurred, registers were:
> > > >   ER -- ST COUNT  LBA_48  LH LM LL DV DC
> > > >   -- -- -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --
> > > >   40 -- 51 00 00 00 00 c2 7a 72 98 40 00  Error: UNC at LBA =3D 0xc=
27a7298 =3D
> > > > 3262804632
> > > >=20
> > > >   Commands leading to the command that caused the error were:
> > > >   CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/=
Feature_Name
> > > >   -- =3D=3D -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --  -----=
----------  --------------------
> > > >   60 00 b0 00 88 00 00 c2 7a 73 20 40 08     23:38:12.195  READ FPD=
MA QUEUED
> > > >   60 00 b0 00 80 00 00 c2 7a 72 70 40 08     23:38:12.195  READ FPD=
MA QUEUED
> > > >   2f 00 00 00 01 00 00 00 00 00 10 40 08     23:38:12.195  READ LOG=
 EXT
> > > >   60 00 b0 00 70 00 00 c2 7a 73 20 40 08     23:38:09.343  READ FPD=
MA QUEUED
> > > >   60 00 b0 00 68 00 00 c2 7a 72 70 40 08     23:38:09.343  READ FPD=
MA QUEUED
> > > > [...]
> > > >=20
> > > > and
> > > >=20
> > > > [...]
> > > > SMART Attributes Data Structure revision number: 16
> > > > Vendor Specific SMART Attributes with Thresholds:
> > > > ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VA=
LUE
> > > >   1 Raw_Read_Error_Rate     POSR-K   200   200   051    -    64
> > > >   3 Spin_Up_Time            POS--K   178   170   021    -    6075
> > > >   4 Start_Stop_Count        -O--CK   098   098   000    -    2406
> > > >   5 Reallocated_Sector_Ct   PO--CK   200   200   140    -    0
> > > >   7 Seek_Error_Rate         -OSR-K   200   200   000    -    0
> > > >   9 Power_On_Hours          -O--CK   066   066   000    -    25339
> > > >  10 Spin_Retry_Count        -O--CK   100   100   000    -    0
> > > >  11 Calibration_Retry_Count -O--CK   100   100   000    -    0
> > > >  12 Power_Cycle_Count       -O--CK   098   098   000    -    2404
> > > > 192 Power-Off_Retract_Count -O--CK   200   200   000    -    154
> > > > 193 Load_Cycle_Count        -O--CK   001   001   000    -    2055746
> > > > 194 Temperature_Celsius     -O---K   122   109   000    -    28
> > > > 196 Reallocated_Event_Count -O--CK   200   200   000    -    0
> > > > 197 Current_Pending_Sector  -O--CK   200   200   000    -    1
> > > > 198 Offline_Uncorrectable   ----CK   200   200   000    -    1
> > > > 199 UDMA_CRC_Error_Count    -O--CK   200   200   000    -    0
> > > > 200 Multi_Zone_Error_Rate   ---R--   200   200   000    -    5
> > > >                             ||||||_ K auto-keep
> > > >                             |||||__ C event count
> > > >                             ||||___ R error rate
> > > >                             |||____ S speed/performance
> > > >                             ||_____ O updated online
> > > >                             |______ P prefailure warning
> > > >=20
> > > > [...]   =20
> > >=20
> > > The data up to this point informs us that you have 1 bad sector
> > > on a 3TB drive, that is actually an expected event given the data
> > > error rate on this stuff is such that your gona have these now
> > > and again.
> > >=20
> > > Given you have 1 single event I would not suspect that this drive
> > > is dying, but it would be prudent to prepare for that possibility. =20
> >=20
> > Hello.
> >=20
> > Well, I copied simply "one single event" that has been logged so far.
> >=20
> > As you (and I) can see, it is error #42. After I posted here, a reboot =
has taken place
> > because the "repair" process on the Pool suddenly increased time and no=
w I'm with
> > error #47, but interestingly, it is a new block that is damaged, but th=
e SMART
> > attribute fields show this for now: =20
>=20
> Can you send the complete output of smartctl -a /dev/foo, I somehow missed
> that 40+ other errors had occured.


Yes, here it is, but please do not beat me due to its size ;-). It is "smar=
tctl -x", that
shows me the errors. See file attached named "smart_ada.txt". It is everyth=
ing of
interest about the drive, I guess.


>=20
> > [...]
> > SMART Attributes Data Structure revision number: 16
> > Vendor Specific SMART Attributes with Thresholds:
> > ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
> >   1 Raw_Read_Error_Rate     POSR-K   200   200   051    -    69
> >   3 Spin_Up_Time            POS--K   178   170   021    -    6075
> >   4 Start_Stop_Count        -O--CK   098   098   000    -    2406
> >   5 Reallocated_Sector_Ct   PO--CK   200   200   140    -    0 =20
>=20
> Interesting, no reallocation has occured....
>=20
> >   7 Seek_Error_Rate         -OSR-K   200   200   000    -    0
> >   9 Power_On_Hours          -O--CK   066   066   000    -    25343
> >  10 Spin_Retry_Count        -O--CK   100   100   000    -    0
> >  11 Calibration_Retry_Count -O--CK   100   100   000    -    0
> >  12 Power_Cycle_Count       -O--CK   098   098   000    -    2404
> > 192 Power-Off_Retract_Count -O--CK   200   200   000    -    154
> > 193 Load_Cycle_Count        -O--CK   001   001   000    -    2055746 =20
>=20
> Hum, just noticed this.  25k hours power on, 2M load cycles, this is
> very hard on a hard drive.  Your drive is going into power save mode
> and unloading the heads.  Infact at a rate of 81 times per hour?
> Oh, I can not believe that.  Either way we need to get this stopped,
> it shall wear your drives out.
>=20
> > 194 Temperature_Celsius     -O---K   122   109   000    -    28
> > 196 Reallocated_Event_Count -O--CK   200   200   000    -    0
> > 197 Current_Pending_Sector  -O--CK   200   200   000    -    0
> > 198 Offline_Uncorrectable   ----CK   200   200   000    -    1
> > 199 UDMA_CRC_Error_Count    -O--CK   200   200   000    -    0
> > 200 Multi_Zone_Error_Rate   ---R--   200   200   000    -    5
> >                             ||||||_ K auto-keep
> >                             |||||__ C event count
> >                             ||||___ R error rate
> >                             |||____ S speed/performance
> >                             ||_____ O updated online
> >                             |______ P prefailure warning
> > [...]
> >=20
> >=20
> > 197 Current_Pending_Sector decreased to zero so far, but with every reb=
oot, the error
> > count seems to increase: =20
>=20
> Ok, some drive firmware well at the power on even try to test the
> pending sector list and clear it if it can actually read the sector.
>=20
> >=20
> > [...]
> > Error 47 [22] occurred at disk power-on lifetime: 25343 hours (1055 day=
s + 23 hours)
> >   When the command that caused the error occurred, the device was activ=
e or idle.
> >=20
> >   After command completion occurred, registers were:
> >   ER -- ST COUNT  LBA_48  LH LM LL DV DC
> >   -- -- -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --
> >   40 -- 51 00 00 00 00 c2 19 d9 88 40 00  Error: UNC at LBA =3D 0xc219d=
988 =3D 3256473992
> >=20
> >   Commands leading to the command that caused the error were:
> >   CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feat=
ure_Name
> >   -- =3D=3D -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --  ---------=
------  --------------------
> >   60 00 b0 00 d0 00 00 c2 19 da 28 40 08  1d+07:12:34.336  READ FPDMA Q=
UEUED
> >   60 00 b0 00 c8 00 00 c2 19 d9 78 40 08  1d+07:12:34.336  READ FPDMA Q=
UEUED
> >   2f 00 00 00 01 00 00 00 00 00 10 40 08  1d+07:12:34.336  READ LOG EXT
> >   60 00 b0 00 b8 00 00 c2 19 da 28 40 08  1d+07:12:31.484  READ FPDMA Q=
UEUED
> >   60 00 b0 00 b0 00 00 c2 19 d9 78 40 08  1d+07:12:31.483  READ FPDMA Q=
UEUED
> >=20
> >=20
> > I think this is watching a HDD dying, isn't it? =20
>=20
> It could be, need to see as many of the other 46 errors as we can to make
> a decision on that.   Probably only 5 in the log though.

As said above, see attached file ;-)

>=20
> > I'd say, a broken cabling would produce different errors, wouldn't it? =
=20
> Yes, there is a CRC error that would occur on cabling error.
>=20
> > The Western Digital Green series HDD is a useful fellow when the HDD is=
 used as a
> > single drive. I think there might be an issue with paring 4 HDDs, 3 of =
them "GREEN",
> > in a RAIDZ and physically sitting next to each other. Maybe it is time =
to replace
> > them one by one ... =20
>=20
> I am more suspecioius of them loading and unloading the head at a rate of
> more than once per minute!
>=20
[ ... schnipp ... ]


--=20
O. Hartmann

Ich widerspreche der Nutzung oder =C3=9Cbermittlung meiner Daten f=C3=BCr
Werbezwecke oder f=C3=BCr die Markt- oder Meinungsforschung (=C2=A7 28 Abs.=
 4 BDSG).

--MP_/Dd35P6LvSzBZKm5JLE2mpNQ
Content-Type: text/plain
Content-Transfer-Encoding: quoted-printable
Content-Disposition: attachment; filename=smart_ada.txt

smartctl 6.6 2017-11-05 r4594 [FreeBSD 12.0-CURRENT amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D
Model Family:     Western Digital Green
Device Model:     WDC WD30EZRX-00DC0B0
Serial Number:    WD-SERIALNUMBER
LU WWN Device Id: 5 0014ee 0ae168a02
Firmware Version: 80.00A80
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Wed Dec 13 09:42:51 2017 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Unavailable
Rd look-ahead is: Enabled
Write cache is:   Enabled
DSN feature is:   Unavailable
ATA Security is:  Disabled, frozen [SEC2]
Wt Cache Reorder: Enabled

=3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84)	Offline data collection activity
					was suspended by an interrupting command from host.
					Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)	The previous self-test routine comp=
leted
					without error or no self-test has ever=20
					been run.
Total time to complete Offline=20
data collection: 		(38160) seconds.
Offline data collection
capabilities: 			 (0x7b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					Offline surface scan supported.
					Self-test supported.
					Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine=20
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 ( 383) minutes.
Conveyance self-test routine
recommended polling time: 	 (   5) minutes.
SCT capabilities: 	       (0x70b5)	SCT Status supported.
					SCT Feature Control supported.
					SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     POSR-K   200   200   051    -    69
  3 Spin_Up_Time            POS--K   178   170   021    -    6058
  4 Start_Stop_Count        -O--CK   098   098   000    -    2407
  5 Reallocated_Sector_Ct   PO--CK   200   200   140    -    0
  7 Seek_Error_Rate         -OSR-K   200   200   000    -    0
  9 Power_On_Hours          -O--CK   066   066   000    -    25344
 10 Spin_Retry_Count        -O--CK   100   100   000    -    0
 11 Calibration_Retry_Count -O--CK   100   100   000    -    0
 12 Power_Cycle_Count       -O--CK   098   098   000    -    2405
192 Power-Off_Retract_Count -O--CK   200   200   000    -    154
193 Load_Cycle_Count        -O--CK   001   001   000    -    2055747
194 Temperature_Celsius     -O---K   130   109   000    -    20
196 Reallocated_Event_Count -O--CK   200   200   000    -    0
197 Current_Pending_Sector  -O--CK   200   200   000    -    0
198 Offline_Uncorrectable   ----CK   200   200   000    -    1
199 UDMA_CRC_Error_Count    -O--CK   200   200   000    -    0
200 Multi_Zone_Error_Rate   ---R--   200   200   000    -    5
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x02           SL  R/O      5  Comprehensive SMART error log
0x03       GPL     R/O      6  Ext. Comprehensive SMART error log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters log
0x80-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS      16  Device vendor specific log
0xa8-0xb7  GPL,SL  VS       1  Device vendor specific log
0xbd       GPL,SL  VS       1  Device vendor specific log
0xc0       GPL,SL  VS       1  Device vendor specific log
0xc1       GPL     VS      93  Device vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
Device Error Count: 47 (device log contains only the most recent 24 errors)
	CR     =3D Command Register
	FEATR  =3D Features Register
	COUNT  =3D Count (was: Sector Count) Register
	LBA_48 =3D Upper bytes of LBA High/Mid/Low Registers ]  ATA-8
	LH     =3D LBA High (was: Cylinder High) Register    ]   LBA
	LM     =3D LBA Mid (was: Cylinder Low) Register      ] Register
	LL     =3D LBA Low (was: Sector Number) Register     ]
	DV     =3D Device (was: Device/Head) Register
	DC     =3D Device Control Register
	ER     =3D Error register
	ST     =3D Status register
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=3Ddays, hh=3Dhours, mm=3Dminutes,
SS=3Dsec, and sss=3Dmillisec. It "wraps" after 49.710 days.

Error 47 [22] occurred at disk power-on lifetime: 25343 hours (1055 days + =
23 hours)
  When the command that caused the error occurred, the device was active or=
 idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --
  40 -- 51 00 00 00 00 c2 19 d9 88 40 00  Error: UNC at LBA =3D 0xc219d988 =
=3D 3256473992

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_=
Name
  -- =3D=3D -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --  -------------=
--  --------------------
  60 00 b0 00 d0 00 00 c2 19 da 28 40 08  1d+07:12:34.336  READ FPDMA QUEUED
  60 00 b0 00 c8 00 00 c2 19 d9 78 40 08  1d+07:12:34.336  READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08  1d+07:12:34.336  READ LOG EXT
  60 00 b0 00 b8 00 00 c2 19 da 28 40 08  1d+07:12:31.484  READ FPDMA QUEUED
  60 00 b0 00 b0 00 00 c2 19 d9 78 40 08  1d+07:12:31.483  READ FPDMA QUEUED

Error 46 [21] occurred at disk power-on lifetime: 25343 hours (1055 days + =
23 hours)
  When the command that caused the error occurred, the device was active or=
 idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --
  40 -- 51 00 00 00 00 c2 19 d9 88 40 00  Error: UNC at LBA =3D 0xc219d988 =
=3D 3256473992

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_=
Name
  -- =3D=3D -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --  -------------=
--  --------------------
  60 00 b0 00 b8 00 00 c2 19 da 28 40 08  1d+07:12:31.484  READ FPDMA QUEUED
  60 00 b0 00 b0 00 00 c2 19 d9 78 40 08  1d+07:12:31.483  READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08  1d+07:12:31.483  READ LOG EXT
  60 00 b0 00 a0 00 00 c2 19 da 28 40 08  1d+07:12:28.631  READ FPDMA QUEUED
  60 00 b0 00 98 00 00 c2 19 d9 78 40 08  1d+07:12:28.631  READ FPDMA QUEUED

Error 45 [20] occurred at disk power-on lifetime: 25343 hours (1055 days + =
23 hours)
  When the command that caused the error occurred, the device was active or=
 idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --
  40 -- 51 00 00 00 00 c2 19 d9 88 40 00  Error: UNC at LBA =3D 0xc219d988 =
=3D 3256473992

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_=
Name
  -- =3D=3D -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --  -------------=
--  --------------------
  60 00 b0 00 a0 00 00 c2 19 da 28 40 08  1d+07:12:28.631  READ FPDMA QUEUED
  60 00 b0 00 98 00 00 c2 19 d9 78 40 08  1d+07:12:28.631  READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08  1d+07:12:28.630  READ LOG EXT
  60 00 b0 00 88 00 00 c2 19 da 28 40 08  1d+07:12:25.767  READ FPDMA QUEUED
  60 00 b0 00 80 00 00 c2 19 d9 78 40 08  1d+07:12:25.767  READ FPDMA QUEUED

Error 44 [19] occurred at disk power-on lifetime: 25343 hours (1055 days + =
23 hours)
  When the command that caused the error occurred, the device was active or=
 idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --
  40 -- 51 00 00 00 00 c2 19 d9 88 40 00  Error: UNC at LBA =3D 0xc219d988 =
=3D 3256473992

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_=
Name
  -- =3D=3D -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --  -------------=
--  --------------------
  60 00 b0 00 88 00 00 c2 19 da 28 40 08  1d+07:12:25.767  READ FPDMA QUEUED
  60 00 b0 00 80 00 00 c2 19 d9 78 40 08  1d+07:12:25.767  READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08  1d+07:12:25.767  READ LOG EXT
  60 00 b0 00 70 00 00 c2 19 da 28 40 08  1d+07:12:22.936  READ FPDMA QUEUED
  60 00 b0 00 68 00 00 c2 19 d9 78 40 08  1d+07:12:22.936  READ FPDMA QUEUED

Error 43 [18] occurred at disk power-on lifetime: 25343 hours (1055 days + =
23 hours)
  When the command that caused the error occurred, the device was active or=
 idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --
  40 -- 51 00 00 00 00 c2 19 d9 88 40 00  Error: UNC at LBA =3D 0xc219d988 =
=3D 3256473992

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_=
Name
  -- =3D=3D -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --  -------------=
--  --------------------
  60 00 b0 00 70 00 00 c2 19 da 28 40 08  1d+07:12:22.936  READ FPDMA QUEUED
  60 00 b0 00 68 00 00 c2 19 d9 78 40 08  1d+07:12:22.936  READ FPDMA QUEUED
  60 00 a8 00 60 00 00 c2 19 d8 b8 40 08  1d+07:12:22.934  READ FPDMA QUEUED
  60 01 00 00 58 00 00 c2 19 d7 58 40 08  1d+07:12:22.934  READ FPDMA QUEUED
  60 01 00 00 50 00 00 c2 19 d6 50 40 08  1d+07:12:22.926  READ FPDMA QUEUED

Error 42 [17] occurred at disk power-on lifetime: 25335 hours (1055 days + =
15 hours)
  When the command that caused the error occurred, the device was active or=
 idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --
  40 -- 51 00 00 00 00 c2 7a 72 98 40 00  Error: UNC at LBA =3D 0xc27a7298 =
=3D 3262804632

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_=
Name
  -- =3D=3D -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --  -------------=
--  --------------------
  60 00 b0 00 88 00 00 c2 7a 73 20 40 08     23:38:12.195  READ FPDMA QUEUED
  60 00 b0 00 80 00 00 c2 7a 72 70 40 08     23:38:12.195  READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08     23:38:12.195  READ LOG EXT
  60 00 b0 00 70 00 00 c2 7a 73 20 40 08     23:38:09.343  READ FPDMA QUEUED
  60 00 b0 00 68 00 00 c2 7a 72 70 40 08     23:38:09.343  READ FPDMA QUEUED

Error 41 [16] occurred at disk power-on lifetime: 25335 hours (1055 days + =
15 hours)
  When the command that caused the error occurred, the device was active or=
 idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --
  40 -- 51 00 00 00 00 c2 7a 72 98 40 00  Error: UNC at LBA =3D 0xc27a7298 =
=3D 3262804632

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_=
Name
  -- =3D=3D -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --  -------------=
--  --------------------
  60 00 b0 00 70 00 00 c2 7a 73 20 40 08     23:38:09.343  READ FPDMA QUEUED
  60 00 b0 00 68 00 00 c2 7a 72 70 40 08     23:38:09.343  READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08     23:38:09.342  READ LOG EXT
  60 00 b0 00 58 00 00 c2 7a 73 20 40 08     23:38:06.490  READ FPDMA QUEUED
  60 00 b0 00 50 00 00 c2 7a 72 70 40 08     23:38:06.490  READ FPDMA QUEUED

Error 40 [15] occurred at disk power-on lifetime: 25335 hours (1055 days + =
15 hours)
  When the command that caused the error occurred, the device was active or=
 idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --
  40 -- 51 00 00 00 00 c2 7a 72 98 40 00  Error: UNC at LBA =3D 0xc27a7298 =
=3D 3262804632

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_=
Name
  -- =3D=3D -- =3D=3D -- =3D=3D =3D=3D =3D=3D -- -- -- -- --  -------------=
--  --------------------
  60 00 b0 00 58 00 00 c2 7a 73 20 40 08     23:38:06.490  READ FPDMA QUEUED
  60 00 b0 00 50 00 00 c2 7a 72 70 40 08     23:38:06.490  READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08     23:38:06.489  READ LOG EXT
  60 00 b0 00 40 00 00 c2 7a 73 20 40 08     23:38:03.637  READ FPDMA QUEUED
  60 00 b0 00 38 00 00 c2 7a 72 70 40 08     23:38:03.637  READ FPDMA QUEUED

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining  LifeTime(hours)=
  LBA_of_first_error
# 1  Short offline       Completed without error       00%     22725       =
  -
# 2  Short offline       Completed without error       00%      7313       =
  -
# 3  Extended offline    Completed without error       00%      4465       =
  -
# 4  Extended offline    Interrupted (host reset)      10%      4045       =
  -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       258 (0x0102)
SCT Support Level:                   1
Device State:                        Active (0)
Current Temperature:                    20 Celsius
Power Cycle Min/Max Temperature:     13/20 Celsius
Lifetime    Min/Max Temperature:     10/41 Celsius
Under/Over Temperature Limit Count:   0/0
Vendor specific:
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -41/85 Celsius
Temperature History Size (Index):    478 (311)

Index    Estimated Time   Temperature Celsius
 312    2017-12-13 01:45    27  ********
 ...    ..( 98 skipped).    ..  ********
 411    2017-12-13 03:24    27  ********
 412    2017-12-13 03:25    28  *********
 ...    ..( 67 skipped).    ..  *********
   2    2017-12-13 04:33    28  *********
   3    2017-12-13 04:34     ?  -
   4    2017-12-13 04:35    13  -
   5    2017-12-13 04:36    13  -
   6    2017-12-13 04:37    14  -
   7    2017-12-13 04:38    15  -
   8    2017-12-13 04:39    16  -
   9    2017-12-13 04:40    16  -
  10    2017-12-13 04:41    16  -
  11    2017-12-13 04:42    17  -
  12    2017-12-13 04:43    17  -
  13    2017-12-13 04:44    17  -
  14    2017-12-13 04:45    18  -
  15    2017-12-13 04:46    18  -
  16    2017-12-13 04:47    19  -
  17    2017-12-13 04:48    19  -
  18    2017-12-13 04:49    19  -
  19    2017-12-13 04:50    20  *
 ...    ..(  7 skipped).    ..  *
  27    2017-12-13 04:58    20  *
  28    2017-12-13 04:59    27  ********
 ...    ..( 13 skipped).    ..  ********
  42    2017-12-13 05:13    27  ********
  43    2017-12-13 05:14    28  *********
 ...    ..(163 skipped).    ..  *********
 207    2017-12-13 07:58    28  *********
 208    2017-12-13 07:59    27  ********
 ...    ..(102 skipped).    ..  ********
 311    2017-12-13 09:42    27  ********

SCT Error Recovery Control command not supported

Device Statistics (GP/SMART Log 0x04) not supported

Pending Defects log (GP Log 0x0c) not supported

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x0008  2            0  Device-to-host non-data FIS retries
0x0009  2            3  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2            3  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x000f  2            0  R_ERR response for host-to-device data FIS, CRC
0x0012  2            0  R_ERR response for host-to-device non-data FIS, CRC
0x8000  4         1448  Vendor specific


--MP_/Dd35P6LvSzBZKm5JLE2mpNQ--

--Sig_/mvP06IJ4=8CV0Ve8YnW_1hQ
Content-Type: application/pgp-signature
Content-Description: OpenPGP digital signature

-----BEGIN PGP SIGNATURE-----

iLUEARMKAB0WIQQZVZMzAtwC2T/86TrS528fyFhYlAUCWjDq7gAKCRDS528fyFhY
lOt0AgCqmkPxxiiFp2RkDZmV2t90nbKYwRnVnmz83MaldwiNyETz+55wofCczcns
MYyhctct6YaelwcFIdObYop8FjkIAfoD01lTN8no2E6SSlOqSsV34S1lALB9sEPP
/6GirQ6nDF5jBO78vreiWlgB9Dl/fd5H/9EawF1GbP+CaMHHZt8I
=GUQW
-----END PGP SIGNATURE-----

--Sig_/mvP06IJ4=8CV0Ve8YnW_1hQ--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20171213095510.4f025922>