Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 5 Nov 2024 12:10:35 +0100
From:      Tomek CEDRO <tomek@cedro.info>
To:        Dave Cottlehuber <dch@freebsd.org>
Cc:        Warner Losh <imp@bsdimp.com>, freebsd-fs <freebsd-fs@freebsd.org>
Subject:   Re: nvme device errors & zfs
Message-ID:  <CAFYkXjkdvq29aFvNfkmFjb%2BZN8gPJgZFMr942iju=KVcwieDYw@mail.gmail.com>
In-Reply-To: <ad8551cc-a595-454b-8645-89a16f60ab0f@app.fastmail.com>
References:  <3293802b-3785-4715-8a6b-0802afb6f908@app.fastmail.com> <CANCZdfpPmVtt0wMWAYzhq4R0nkt39dg3S2-zVCCQcw%2BTSugkEg@mail.gmail.com> <ad8551cc-a595-454b-8645-89a16f60ab0f@app.fastmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, Nov 5, 2024 at 10:15=E2=80=AFAM Dave Cottlehuber <dch@freebsd.org> =
wrote:
> these are samsung 990, mainly chosen for low price at the time:
> nda0: <Samsung SSD 990 PRO 2TB 0B2QJXG7 S7DNNJ0WC12665P>
> nda1: <Samsung SSD 990 PRO 2TB 0B2QJXG7 S7DNNJ0WC12664X>

These are pretty decent and not really cheap drives!

Magician software can upgrade firmware and perform other checks, works
on Windoze macOS and Android:

https://www.samsung.com/ca/support/model/MZ-V9P2T0B/AM/#downloads

> I forgot to mention dmesg prior:
> Oct 31 16:11:05 wintermute kernel[9406]: nvme1: Resetting controller due =
to a timeout.
> Oct 31 16:11:05 wintermute kernel[9406]: nvme1: event=3D"start"
> Oct 31 16:11:05 wintermute kernel[9406]: nvme1: Waiting for reset to comp=
lete
> Oct 31 16:11:05 wintermute kernel[9406]: nvme1: Waiting for reset to comp=
lete
> ... repeated x400

Another idea is maybe disk overheats and resets itself to cool down?

Lots of people in the reviews of various nvme drives asks about
temperature and suggests using heatsink ;-)


--=20
CeDeROM, SQ7MHZ, http://www.tomek.cedro.info



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAFYkXjkdvq29aFvNfkmFjb%2BZN8gPJgZFMr942iju=KVcwieDYw>