0 0 0 > diskid/DISK-WD-WX52D625R2TPp4 ONLINE 0 0 0 > diskid/DISK-WD-WCC4N1PYJ3F8p4 OFFLINE 0 0 0 > logs > diskid/DISK-S649NL0T819360Vp2 ONLINE 0 0 0 > cache > diskid/DISK-S649NL0T819360Vp3 ONLINE 0 0 0 > ---snip--- > > The offline marked partitions are on the same HD (the broken one). The > DISK-S649NL0T819360V device use as log and cache in the second pool > causes the issue on 16-current. > > On 16-current I get "uncorrectable parity/CRC error" messages on boot > from the broken disk. I used this to get rid of those errors: > ---snip--- > # grep kern.cam /tmp/be_mount.MhLw/boot/loader.conf > kern.cam.tur_timeout=3D"60" > kern.cam.inquiry_timeout=3D"60" > kern.cam.modesense_timeout=3D"60" > ---snip--- > > But the second pool ("space") fails to get imported. When I import it > via "zpool import -m space" it shows me that the log and cache devices > (different partitions on the same hardware) are not available. > This is the device in question as seen from 15-current: > ---snip--- > nda0: > nda0: Serial Number S649NL0T819360V > [1] nda0: nvme version 1.4 > nda0: 953869MB (1953525168 512 byte sectors) > [1] GEOM: new disk nda0 > ... > [1] pass6 at nvme0 bus 0 scbus6 target 0 lun 1 > pass6: > pass6: Serial Number S649NL0T819360V > [1] pass6: nvme version 1.4 > ---snip--- > > In case you need some info from the 15- or 16-current BE, which info do > you need? > > Bye, > Alexander. > > -- > http://www.Leidinger.net Alexander@Leidinger.net: PGP 0x8F31830F9F2772BF > http://www.FreeBSD.org netchild@FreeBSD.org : PGP 0x8F31830F9F2772BF > > > -- > http://www.Leidinger.net Alexander@Leidinger.net: PGP 0x8F31830F9F2772BF > http://www.FreeBSD.org netchild@FreeBSD.org : PGP 0x8F31830F9F2772BF > --000000000000d0827c06468d5538 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable


On Sun, Dec 21,= 2025 at 8:37=E2=80=AFAM Alexander Leidinger <Alexander@leidinger.net> wrote:

Am 2025-12-14 14:05, schrieb Wa= rner Losh:

Let's do one issue at=C2=A0a time. There's too mu= ch missing info. Top posting since there's=C2=A0 not a lot of context t= o this request=C2=A0
=C2=A0
The disk died now completely, so the CRC errors are out o= f reach now.
=C2=A0
First, let's start with pciconf -l of the nvme drive.= I have a strong idea, but need some data.
=C2=A0
While already provided privately with some other data, he= re for the public so that people are aware that currently there is an issue= with such drives:
nvme0@pci0:5:0:0: class=3D0x010802 rev=3D0x00 hdr=3D0x00 = vendor=3D0x144d device=3D0xa809 subvendor=3D0x144d subdevice=3D0xa801
Samsung SSD 980 1TB 2B4QFXO7 S649NL0T819360V
<= /div>

Yea, so far this is the only re= port I've received, and there's not enough data in it to reproduce = it with any of the dozen NVMe drives that I have, or to spot a difference w= ith what I know I check in the code. So if it's compiled into the kerne= l with cam also compiled into the kernel, I know it works.

Warner=C2=A0
=C2=A0
=
Bye,
Alexander.
=C2=A0
Also, the disk report needs full logs with and without th= e settings that have uncorrectable in them. I'd expect that a shorter t= imeout would lead to different behavior, but maybe that error syndrome isn&= #39;t one I've seen. It would also be helpful to know which of the time= s changes the behavior...
=C2=A0
Warner

On Sun, Dec 14, 2025, 5:06=E2=80=AFAM Alexander Leidinger = <Alexander@leidinger.net> wrote:
Hi Warner,

I try to update a 15-current = (as of 2025-11-27-110715) to a recent 16
(as of 2025-12-13-132815). It = fails to import a pool due to a missing
nvme. I also have a broken HD i= n this system... to be on the safe side I
mention it.

This is fr= om 15-current:
---snip---
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0NAME=C2= =A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0STATE=C2=A0 =C2=A0 =C2=A0READ WRITE CKSUM=
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0rpool=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2= =A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 D= EGRADED=C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00
= =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0mirror-0=C2=A0 =C2=A0 =C2=A0 =C2= =A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0DEGRADED= =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00
=C2=A0 = =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0diskid/DISK-WD-WCC4N4KLEZT7p3=C2= =A0 ONLINE=C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 = =C2=A00
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0diskid/DISK-WD-W= CC4N1DF9DA2p3=C2=A0 ONLINE=C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00= =C2=A0 =C2=A0 =C2=A00
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0di= skid/DISK-WD-WX52D625R0NTp3=C2=A0 ONLINE=C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 = =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0diskid/DISK-WD-WCC4N1PYJ3F8p3=C2=A0 OFFLINE=C2=A0 =C2=A0 =C2= =A0 0=C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00
=C2=A0 =C2=A0 =C2=A0 =C2= =A0 =C2=A0logs
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0diskid/DISK-4935= 04058890547p1=C2=A0 =C2=A0 ONLINE=C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 = =C2=A00=C2=A0 =C2=A0 =C2=A00
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0cache
= =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0diskid/DISK-493504058890547p2=C2= =A0 =C2=A0 ONLINE=C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00=C2=A0 = =C2=A0 =C2=A00

=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0NAME=C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2= =A0 =C2=A0 =C2=A0 =C2=A0STATE=C2=A0 =C2=A0 =C2=A0READ WRITE CKSUM
=C2=A0= =C2=A0 =C2=A0 =C2=A0 =C2=A0space=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 DEGRADED=C2= =A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00
=C2=A0 =C2= =A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0raidz2-0=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0DEGRADED=C2=A0 =C2= =A0 =C2=A00=C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00
=C2=A0 =C2=A0 =C2= =A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0diskid/DISK-WD-WCC4N4KLEZT7p4=C2=A0 ONLINE= =C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00
= =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0diskid/DISK-WD-WCC4N1DF9DA2= p4=C2=A0 ONLINE=C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00=C2=A0 =C2= =A0 =C2=A00
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0diskid/DISK-= WD-WX52D625R0NTp4=C2=A0 ONLINE=C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2= =A00=C2=A0 =C2=A0 =C2=A00
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2= =A0diskid/DISK-WD-WX52D625R2TPp4=C2=A0 ONLINE=C2=A0 =C2=A0 =C2=A0 =C2=A00= =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00
=C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0diskid/DISK-WD-WCC4N1PYJ3F8p4=C2=A0 OFFLINE=C2=A0 =C2= =A0 =C2=A0 0=C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00
=C2=A0 =C2=A0 =C2= =A0 =C2=A0 =C2=A0logs
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0diskid/DI= SK-S649NL0T819360Vp2=C2=A0 =C2=A0 ONLINE=C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 = =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0ca= che
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0diskid/DISK-S649NL0T819360V= p3=C2=A0 =C2=A0 ONLINE=C2=A0 =C2=A0 =C2=A0 =C2=A00=C2=A0 =C2=A0 =C2=A00=C2= =A0 =C2=A0 =C2=A00
---snip---

The offline marked partitions are o= n the same HD (the broken one). The
DISK-S649NL0T819360V device use as = log and cache in the second pool
causes the issue on 16-current.
On 16-current I get "uncorrectable parity/CRC error" messages on= boot
from the broken disk. I used this to get rid of those errors:
= ---snip---
# grep kern.cam /tmp/be_mount.MhLw/boot/loader.conf
kern.c= am.tur_timeout=3D"60"
kern.cam.inquiry_timeout=3D"60"= ;
kern.cam.modesense_timeout=3D"60"
---snip---

But t= he second pool ("space") fails to get imported. When I import it =
via "zpool import -m space" it shows me that the log and cach= e devices
(different partitions on the same hardware) are not available= .
This is the device in question as seen from 15-current:
---snip---<= br>nda0: <Samsung SSD 980 1TB 2B4QFXO7 S649NL0T819360V>
nda0: Seri= al Number S649NL0T819360V
[1] nda0: nvme version 1.4
nda0: 953869MB (= 1953525168 512 byte sectors)
[1] GEOM: new disk nda0
...
[1] pass6= at nvme0 bus 0 scbus6 target 0 lun 1
pass6: <Samsung SSD 980 1TB 2B4= QFXO7 S649NL0T819360V>
pass6: Serial Number S649NL0T819360V
[1] pa= ss6: nvme version 1.4
---snip---

In case you need some info from = the 15- or 16-current BE, which info do
you need?

Bye,
Alexan= der.

--
http://www.Leidinger.net Alexander@Leidinger= .net: PGP 0x8F31830F9F2772BF
http://www.FreeBSD.org=C2=A0 =C2= =A0 netchild@FreeBSD.org=C2=A0 : PGP 0x8F31830F9F2772BF


--
http://= www.Leidinger.net Alexander@Leidinger.net: PGP 0x8F31830F9F2772BF
http:= //www.FreeBSD.org =C2=A0 =C2=A0netchild@FreeBSD.org =C2=A0: PGP 0x8F31830F9F2772BF
--000000000000d0827c06468d5538--