Date: Tue, 29 Nov 2022 15:00:46 -0700 From: Warner Losh <imp@bsdimp.com> To: Maxim Sobolev <sobomax@freebsd.org> Cc: FreeBSD User <freebsd@walstatt-de.de>, FreeBSD CURRENT <freebsd-current@freebsd.org> Subject: Re: CAM: extract HDD informations about failure/to fail? Message-ID: <CANCZdfr1aAraApZqHrThkNXB1mr%2B0BUAsdj9Lu-QjepDgL6FBA@mail.gmail.com> In-Reply-To: <CAH7qZfuJNgLfG8k43AgZgw7oGgY4X%2BMrry7dKfQvesvQKffjzA@mail.gmail.com> References: <20221127161544.7dd1207c@thor.intern.walstatt.dynvpn.de> <CAH7qZfuJNgLfG8k43AgZgw7oGgY4X%2BMrry7dKfQvesvQKffjzA@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
[-- Attachment #1 --] Average Latency would also do the trick. Warner On Tue, Nov 29, 2022 at 2:20 PM Maxim Sobolev <sobomax@freebsd.org> wrote: > Perhaps if you log r/w queue length for all 4 drives with a reasonable > interval (say 1 second) under the load using gstat(8) and plot all 4 as > function of time on the same graph you should have no problem to visually > identify the culprit(s). At least that's how I would do it. > > -Maksym > > On Sun, Nov 27, 2022, 7:15 AM FreeBSD User <freebsd@walstatt-de.de> wrote: > >> Hello, >> >> well, the aim of my post sounds strange, but I'm serious. >> Background: I run at home a 14-CURRENT based server with a ZFS volume >> (RAIDZ) comprised from >> 4x 4 TB HDD. A couple of days I had to exchange the HGST NAS drives since >> one got a permanent >> SMART error. So all HDDs have been replaced by now with four times Seagte >> IronWolfe Pro 4TB >> drives. So far, so good. >> Now I face a weird sound sourcing at one of the new HDDs. The box is >> supposed to be a heavy >> duty poudriere build facility, so the drives are up 24/7. It seems that >> one (or even more) >> drives emitt a weird sound like the spindle motor is loosing for a >> fraction of a second power >> and spiining up the the drive again. Searching the net reveals that at >> least one Seagate >> customer did have the same issue and he provided an audio file of that >> very weird sound, to be >> found here: >> >> Post at reddit: >> >> https://www.reddit.com/r/techsupport/comments/sca6al/seagate_ironwolf_pro_making_weird_noise/ >> >> and herin the post of the audio file: >> >> https://www.mediafire.com/file/x3le816qsakiff9/Hdd.mp4/file >> >> I checked S.M.A.R.T for any unusual data, but everything is fine. The >> values for >> >> Power_Cycle_Count >> Power-Off_Retract_Count >> Start_Stop_Count >> >> seem all within a reasonable range compared to the life time in hours >> (did some simple >> statistsics ), nothing looks unusual. >> >> Also, the advanced view onto each drive via >> >> smartctl -x >> >> doesn't give me any hint of a power failure as a source for the noise. >> >> So, big question here is: the drives are attached to a HBA, LSI3008 based >> SAS9300-8i. Is it >> possible to retrieve via CAM more health paramteres than those gathered >> by SMART/smartmontools >> and if the answer is yes, how can this be achieved? >> It close to impossible to isolate the drive making the noise. My guts >> tell me to RMA the >> supposed to be faulty drive and not to wait until it dies from "spindle >> motor desease" or >> something that is the source for the noises. >> >> Thanks in advance, >> >> oh >> >> >> -- >> O. Hartmann >> >> [-- Attachment #2 --] <div dir="ltr"><div>Average Latency would also do the trick.</div><div><br></div><div>Warner</div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Nov 29, 2022 at 2:20 PM Maxim Sobolev <<a href="mailto:sobomax@freebsd.org">sobomax@freebsd.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="auto"><div style="font-size:12.8px" dir="auto">Perhaps if you log r/w queue length for all 4 drives with a reasonable interval (say 1 second) under the load using gstat(8) and plot all 4 as function of time on the same graph you should have no problem to visually identify the culprit(s). At least that's how I would do it.</div><div dir="auto" style="font-size:12.8px"><br></div><div dir="auto" style="font-size:12.8px">-Maksym</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sun, Nov 27, 2022, 7:15 AM FreeBSD User <<a href="mailto:freebsd@walstatt-de.de" target="_blank">freebsd@walstatt-de.de</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Hello,<br> <br> well, the aim of my post sounds strange, but I'm serious.<br> Background: I run at home a 14-CURRENT based server with a ZFS volume (RAIDZ) comprised from<br> 4x 4 TB HDD. A couple of days I had to exchange the HGST NAS drives since one got a permanent<br> SMART error. So all HDDs have been replaced by now with four times Seagte IronWolfe Pro 4TB<br> drives. So far, so good.<br> Now I face a weird sound sourcing at one of the new HDDs. The box is supposed to be a heavy<br> duty poudriere build facility, so the drives are up 24/7. It seems that one (or even more)<br> drives emitt a weird sound like the spindle motor is loosing for a fraction of a second power<br> and spiining up the the drive again. Searching the net reveals that at least one Seagate<br> customer did have the same issue and he provided an audio file of that very weird sound, to be<br> found here:<br> <br> Post at reddit:<br> <a href="https://www.reddit.com/r/techsupport/comments/sca6al/seagate_ironwolf_pro_making_weird_noise/" rel="noreferrer noreferrer" target="_blank">https://www.reddit.com/r/techsupport/comments/sca6al/seagate_ironwolf_pro_making_weird_noise/</a><br> <br> and herin the post of the audio file:<br> <br> <a href="https://www.mediafire.com/file/x3le816qsakiff9/Hdd.mp4/file" rel="noreferrer noreferrer" target="_blank">https://www.mediafire.com/file/x3le816qsakiff9/Hdd.mp4/file</a><br> <br> I checked S.M.A.R.T for any unusual data, but everything is fine. The values for <br> <br> Power_Cycle_Count<br> Power-Off_Retract_Count<br> Start_Stop_Count<br> <br> seem all within a reasonable range compared to the life time in hours (did some simple<br> statistsics ), nothing looks unusual.<br> <br> Also, the advanced view onto each drive via <br> <br> smartctl -x<br> <br> doesn't give me any hint of a power failure as a source for the noise.<br> <br> So, big question here is: the drives are attached to a HBA, LSI3008 based SAS9300-8i. Is it<br> possible to retrieve via CAM more health paramteres than those gathered by SMART/smartmontools<br> and if the answer is yes, how can this be achieved?<br> It close to impossible to isolate the drive making the noise. My guts tell me to RMA the<br> supposed to be faulty drive and not to wait until it dies from "spindle motor desease" or<br> something that is the source for the noises.<br> <br> Thanks in advance,<br> <br> oh<br> <br> <br> -- <br> O. Hartmann<br> <br> </blockquote></div> </blockquote></div></div>
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CANCZdfr1aAraApZqHrThkNXB1mr%2B0BUAsdj9Lu-QjepDgL6FBA>
