Date: Tue, 29 Nov 2022 13:20:13 -0800 From: Maxim Sobolev <sobomax@freebsd.org> To: FreeBSD User <freebsd@walstatt-de.de> Cc: FreeBSD CURRENT <freebsd-current@freebsd.org> Subject: Re: CAM: extract HDD informations about failure/to fail? Message-ID: <CAH7qZfuJNgLfG8k43AgZgw7oGgY4X%2BMrry7dKfQvesvQKffjzA@mail.gmail.com> In-Reply-To: <20221127161544.7dd1207c@thor.intern.walstatt.dynvpn.de>
index | next in thread | previous in thread | raw e-mail
[-- Attachment #1 --] Perhaps if you log r/w queue length for all 4 drives with a reasonable interval (say 1 second) under the load using gstat(8) and plot all 4 as function of time on the same graph you should have no problem to visually identify the culprit(s). At least that's how I would do it. -Maksym On Sun, Nov 27, 2022, 7:15 AM FreeBSD User <freebsd@walstatt-de.de> wrote: > Hello, > > well, the aim of my post sounds strange, but I'm serious. > Background: I run at home a 14-CURRENT based server with a ZFS volume > (RAIDZ) comprised from > 4x 4 TB HDD. A couple of days I had to exchange the HGST NAS drives since > one got a permanent > SMART error. So all HDDs have been replaced by now with four times Seagte > IronWolfe Pro 4TB > drives. So far, so good. > Now I face a weird sound sourcing at one of the new HDDs. The box is > supposed to be a heavy > duty poudriere build facility, so the drives are up 24/7. It seems that > one (or even more) > drives emitt a weird sound like the spindle motor is loosing for a > fraction of a second power > and spiining up the the drive again. Searching the net reveals that at > least one Seagate > customer did have the same issue and he provided an audio file of that > very weird sound, to be > found here: > > Post at reddit: > > https://www.reddit.com/r/techsupport/comments/sca6al/seagate_ironwolf_pro_making_weird_noise/ > > and herin the post of the audio file: > > https://www.mediafire.com/file/x3le816qsakiff9/Hdd.mp4/file > > I checked S.M.A.R.T for any unusual data, but everything is fine. The > values for > > Power_Cycle_Count > Power-Off_Retract_Count > Start_Stop_Count > > seem all within a reasonable range compared to the life time in hours (did > some simple > statistsics ), nothing looks unusual. > > Also, the advanced view onto each drive via > > smartctl -x > > doesn't give me any hint of a power failure as a source for the noise. > > So, big question here is: the drives are attached to a HBA, LSI3008 based > SAS9300-8i. Is it > possible to retrieve via CAM more health paramteres than those gathered by > SMART/smartmontools > and if the answer is yes, how can this be achieved? > It close to impossible to isolate the drive making the noise. My guts tell > me to RMA the > supposed to be faulty drive and not to wait until it dies from "spindle > motor desease" or > something that is the source for the noises. > > Thanks in advance, > > oh > > > -- > O. Hartmann > > [-- Attachment #2 --] <div dir="auto"><div style="font-size:12.8px" dir="auto">Perhaps if you log r/w queue length for all 4 drives with a reasonable interval (say 1 second) under the load using gstat(8) and plot all 4 as function of time on the same graph you should have no problem to visually identify the culprit(s). At least that's how I would do it.</div><div dir="auto" style="font-size:12.8px"><br></div><div dir="auto" style="font-size:12.8px">-Maksym</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sun, Nov 27, 2022, 7:15 AM FreeBSD User <<a href="mailto:freebsd@walstatt-de.de">freebsd@walstatt-de.de</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hello,<br> <br> well, the aim of my post sounds strange, but I'm serious.<br> Background: I run at home a 14-CURRENT based server with a ZFS volume (RAIDZ) comprised from<br> 4x 4 TB HDD. A couple of days I had to exchange the HGST NAS drives since one got a permanent<br> SMART error. So all HDDs have been replaced by now with four times Seagte IronWolfe Pro 4TB<br> drives. So far, so good.<br> Now I face a weird sound sourcing at one of the new HDDs. The box is supposed to be a heavy<br> duty poudriere build facility, so the drives are up 24/7. It seems that one (or even more)<br> drives emitt a weird sound like the spindle motor is loosing for a fraction of a second power<br> and spiining up the the drive again. Searching the net reveals that at least one Seagate<br> customer did have the same issue and he provided an audio file of that very weird sound, to be<br> found here:<br> <br> Post at reddit:<br> <a href="https://www.reddit.com/r/techsupport/comments/sca6al/seagate_ironwolf_pro_making_weird_noise/" rel="noreferrer noreferrer" target="_blank">https://www.reddit.com/r/techsupport/comments/sca6al/seagate_ironwolf_pro_making_weird_noise/</a><br> <br> and herin the post of the audio file:<br> <br> <a href="https://www.mediafire.com/file/x3le816qsakiff9/Hdd.mp4/file" rel="noreferrer noreferrer" target="_blank">https://www.mediafire.com/file/x3le816qsakiff9/Hdd.mp4/file</a><br> <br> I checked S.M.A.R.T for any unusual data, but everything is fine. The values for <br> <br> Power_Cycle_Count<br> Power-Off_Retract_Count<br> Start_Stop_Count<br> <br> seem all within a reasonable range compared to the life time in hours (did some simple<br> statistsics ), nothing looks unusual.<br> <br> Also, the advanced view onto each drive via <br> <br> smartctl -x<br> <br> doesn't give me any hint of a power failure as a source for the noise.<br> <br> So, big question here is: the drives are attached to a HBA, LSI3008 based SAS9300-8i. Is it<br> possible to retrieve via CAM more health paramteres than those gathered by SMART/smartmontools<br> and if the answer is yes, how can this be achieved?<br> It close to impossible to isolate the drive making the noise. My guts tell me to RMA the<br> supposed to be faulty drive and not to wait until it dies from "spindle motor desease" or<br> something that is the source for the noises.<br> <br> Thanks in advance,<br> <br> oh<br> <br> <br> -- <br> O. Hartmann<br> <br> </blockquote></div>home | help
Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAH7qZfuJNgLfG8k43AgZgw7oGgY4X%2BMrry7dKfQvesvQKffjzA>
