Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 29 Nov 2022 15:00:46 -0700
From:      Warner Losh <imp@bsdimp.com>
To:        Maxim Sobolev <sobomax@freebsd.org>
Cc:        FreeBSD User <freebsd@walstatt-de.de>, FreeBSD CURRENT <freebsd-current@freebsd.org>
Subject:   Re: CAM: extract HDD informations about failure/to fail?
Message-ID:  <CANCZdfr1aAraApZqHrThkNXB1mr%2B0BUAsdj9Lu-QjepDgL6FBA@mail.gmail.com>
In-Reply-To: <CAH7qZfuJNgLfG8k43AgZgw7oGgY4X%2BMrry7dKfQvesvQKffjzA@mail.gmail.com>
References:  <20221127161544.7dd1207c@thor.intern.walstatt.dynvpn.de> <CAH7qZfuJNgLfG8k43AgZgw7oGgY4X%2BMrry7dKfQvesvQKffjzA@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help

[-- Attachment #1 --]
Average Latency would also do the trick.

Warner

On Tue, Nov 29, 2022 at 2:20 PM Maxim Sobolev <sobomax@freebsd.org> wrote:

> Perhaps if you log r/w queue length for all 4 drives with a reasonable
> interval (say 1 second) under the load using gstat(8) and plot all 4 as
> function of time on the same graph you should have no problem to visually
> identify the culprit(s). At least that's how I would do it.
>
> -Maksym
>
> On Sun, Nov 27, 2022, 7:15 AM FreeBSD User <freebsd@walstatt-de.de> wrote:
>
>> Hello,
>>
>> well, the aim of my post sounds strange, but I'm serious.
>> Background: I run at home a 14-CURRENT based server with a ZFS volume
>> (RAIDZ) comprised from
>> 4x 4 TB HDD. A couple of days I had to exchange the HGST NAS drives since
>> one got a permanent
>> SMART error. So all HDDs have been replaced by now with four times Seagte
>> IronWolfe Pro 4TB
>> drives. So far, so good.
>> Now I face a weird sound sourcing at one of the new HDDs. The box is
>> supposed to be a heavy
>> duty poudriere build facility, so the drives are up 24/7. It seems that
>> one (or even more)
>> drives emitt a weird sound like the spindle motor is loosing for a
>> fraction of a second power
>> and spiining up the the drive again. Searching the net reveals that at
>> least one Seagate
>> customer did have the same issue and he provided an audio file of that
>> very weird sound, to be
>> found here:
>>
>> Post at reddit:
>>
>> https://www.reddit.com/r/techsupport/comments/sca6al/seagate_ironwolf_pro_making_weird_noise/
>>
>> and herin the post of the audio file:
>>
>>  https://www.mediafire.com/file/x3le816qsakiff9/Hdd.mp4/file
>>
>> I checked S.M.A.R.T for any unusual data, but everything is fine. The
>> values for
>>
>> Power_Cycle_Count
>> Power-Off_Retract_Count
>> Start_Stop_Count
>>
>> seem all within a reasonable range compared to the life time in hours
>> (did some simple
>> statistsics ), nothing looks unusual.
>>
>> Also, the advanced view onto each drive via
>>
>> smartctl -x
>>
>> doesn't give me any hint of a power failure as a source for the noise.
>>
>> So, big question here is: the drives are attached to a HBA, LSI3008 based
>> SAS9300-8i. Is it
>> possible to retrieve via CAM more health paramteres than those gathered
>> by SMART/smartmontools
>> and if the answer is yes, how can this be achieved?
>> It close to impossible to isolate the drive making the noise. My guts
>> tell me to RMA the
>> supposed to be faulty drive and not to wait until it dies from "spindle
>> motor desease" or
>> something that is the source for the noises.
>>
>> Thanks in advance,
>>
>> oh
>>
>>
>> --
>> O. Hartmann
>>
>>

[-- Attachment #2 --]
<div dir="ltr"><div>Average Latency would also do the trick.</div><div><br></div><div>Warner</div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Nov 29, 2022 at 2:20 PM Maxim Sobolev &lt;<a href="mailto:sobomax@freebsd.org">sobomax@freebsd.org</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="auto"><div style="font-size:12.8px" dir="auto">Perhaps if you log r/w queue length for all 4 drives with a reasonable interval (say 1 second) under the load using gstat(8) and plot all 4 as function of time on the same graph you should have no problem to visually identify the culprit(s). At least that&#39;s how I would do it.</div><div dir="auto" style="font-size:12.8px"><br></div><div dir="auto" style="font-size:12.8px">-Maksym</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sun, Nov 27, 2022, 7:15 AM FreeBSD User &lt;<a href="mailto:freebsd@walstatt-de.de" target="_blank">freebsd@walstatt-de.de</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Hello,<br>
<br>
well, the aim of my post sounds strange, but I&#39;m serious.<br>
Background: I run at home a 14-CURRENT based server with a ZFS volume (RAIDZ) comprised from<br>
4x 4 TB HDD. A couple of days I had to exchange the HGST NAS drives since one got a permanent<br>
SMART error. So all HDDs have been replaced by now with four times Seagte IronWolfe Pro 4TB<br>
drives. So far, so good.<br>
Now I face a weird sound sourcing at one of the new HDDs. The box is supposed to be a heavy<br>
duty poudriere build facility, so the drives are up 24/7. It seems that one (or even more)<br>
drives emitt a weird sound like the spindle motor is loosing for a fraction of a second power<br>
and spiining up the the drive again. Searching the net reveals that at least one Seagate<br>
customer did have the same issue and he provided an audio file of that very weird sound, to be<br>
found here:<br>
<br>
Post at reddit:<br>
 <a href="https://www.reddit.com/r/techsupport/comments/sca6al/seagate_ironwolf_pro_making_weird_noise/" rel="noreferrer noreferrer" target="_blank">https://www.reddit.com/r/techsupport/comments/sca6al/seagate_ironwolf_pro_making_weird_noise/</a><br>;
<br>
and herin the post of the audio file:<br>
<br>
 <a href="https://www.mediafire.com/file/x3le816qsakiff9/Hdd.mp4/file" rel="noreferrer noreferrer" target="_blank">https://www.mediafire.com/file/x3le816qsakiff9/Hdd.mp4/file</a><br>;
<br>
I checked S.M.A.R.T for any unusual data, but everything is fine. The values for <br>
<br>
Power_Cycle_Count<br>
Power-Off_Retract_Count<br>
Start_Stop_Count<br>
<br>
seem all within a reasonable range compared to the life time in hours (did some simple<br>
statistsics ), nothing looks unusual.<br>
<br>
Also, the advanced view onto each drive via <br>
<br>
smartctl -x<br>
<br>
doesn&#39;t give me any hint of a power failure as a source for the noise.<br>
<br>
So, big question here is: the drives are attached to a HBA, LSI3008 based SAS9300-8i. Is it<br>
possible to retrieve via CAM more health paramteres than those gathered by SMART/smartmontools<br>
and if the answer is yes, how can this be achieved?<br>
It close to impossible to isolate the drive making the noise. My guts tell me to RMA the<br>
supposed to be faulty drive and not to wait until it dies from &quot;spindle motor desease&quot; or<br>
something that is the source for the noises.<br>
<br>
Thanks in advance,<br>
<br>
oh<br>
<br>
<br>
-- <br>
O. Hartmann<br>
<br>
</blockquote></div>
</blockquote></div></div>

Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CANCZdfr1aAraApZqHrThkNXB1mr%2B0BUAsdj9Lu-QjepDgL6FBA>