Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 13 Feb 2024 11:56:52 -0800
From:      Pete Wright <pete@nomadlogic.org>
To:        Don Lewis <truckman@FreeBSD.org>, Warner Losh <imp@bsdimp.com>
Cc:        Maxim Sobolev <sobomax@freebsd.org>, FreeBSD current <freebsd-current@freebsd.org>, John Baldwin <jhb@freebsd.org>
Subject:   Re: nvme controller reset failures on recent -CURRENT
Message-ID:  <65cddfff-84ab-45e4-bcc5-84fc8f5784cb@nomadlogic.org>
In-Reply-To: <tkrat.9717b2cdbbab83de@FreeBSD.org>
References:  <tkrat.edddc2469f43baf6@FreeBSD.org> <CAH7qZfunD154VYPD1vh_GNtOMM-quX=S00iQGvrpbhaegpXRnw@mail.gmail.com> <tkrat.76b39844cd6da514@FreeBSD.org> <CANCZdfrKeHJg5Tt-3cUq9hBgwwNqF4qnOWyFpF=TUjMdANOMfg@mail.gmail.com> <tkrat.9717b2cdbbab83de@FreeBSD.org>

next in thread | previous in thread | raw e-mail | index | archive | help
>> There's a tiny chance that this could be something more exotic,
>> but my money is on hardware gone bad after 2 years of service. I don't think
>> this is 'wear out' of the NAND (it's only 15TB written, but it could be if
>> this
>> drive is really really crappy nand: first generation QLC maybe, but it seems
>> too new). It might also be a connector problem that's developed over time.
>> There might be a few other things too, but I don't think this is a U.2 drive
>> with funky cables.
> The system was probably idle the majority of those two years of power on
> time.
>
> It's one of these:
> https://www.techpowerup.com/ssd-specs/intel-660p-512-gb.d437
> I've seen comments that these generally don't need cooling.
>
> I just ordered a heatsink with some nice big fins, but it will take a
> week or more to arrive.


just wanted to add another data-point to this discussion.  i had a 
crucial NVME drive on my workstation that recently was showing similar 
problems.  after much debugging i came to the same conclusion that it 
was getting too hot.  i went ahead an purchased a Sabrent NVME drive 
that came with a heat sink.  i've also starting making much more use of 
my workstation (and the disk subsystem) and have had zero issues.

so lessons learnt:

1. M.2 nvme really does need proper cooling, much more so than 
traditional SATA/SAS/SCSI drives.

2. not all vendors do a great job reporting the health of devices

-pete

-- 
Pete Wright
pete@nomadlogic.org




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?65cddfff-84ab-45e4-bcc5-84fc8f5784cb>