Date: Thu, 8 Jun 2023 04:48:22 -0700 From: Warner Losh <imp@bsdimp.com> To: Rebecca Cran <rebecca@bsdio.com> Cc: Tomek CEDRO <tomek@cedro.info>, FreeBSD CURRENT <freebsd-current@freebsd.org> Subject: Re: Seemingly random nvme (nda) write error on new drive (retries exhausted) Message-ID: <CANCZdfrCY7PfopRy27_wBLBzS%2BaCq5GUAGtpK2pra1ZjyUrdYA@mail.gmail.com> In-Reply-To: <b94e421a-60de-5dd7-dc51-3b7f8ed6d36e@bsdio.com> References: <5b52fc08-fb5a-900e-b98c-817a4ab79846@bsdio.com> <CAFYkXjn%2BhPdBrCwvwin-sus66Nem%2BvA3mHPuc7jJPrn6F_dP2Q@mail.gmail.com> <b94e421a-60de-5dd7-dc51-3b7f8ed6d36e@bsdio.com>
index | next in thread | previous in thread | raw e-mail
[-- Attachment #1 --] On Thu, Jun 8, 2023, 4:35 AM Rebecca Cran <rebecca@bsdio.com> wrote: > It's ZFS, using the default options when creating it via the FreeBSD > installer so I presume TRIM is enabled. Without a reliable way to > reproduce the error I'm not sure disabling TRIM will help at the moment. > > I don't think there's any newer firmware for it. > pci gen 4 has a highter error rate so that needs to be managed with retries. There's a whole protocol to do that which linux implements. I suspect the time has come for us to do so too. There's some code floating around I'll have to track down. Warner -- > > Rebecca Cran > > > On 6/8/23 04:25, Tomek CEDRO wrote: > > what filesystem? is TRIM enabled on that drive? have you tried > > disabling trim? i had similar ssd related problem on samsung's ssd > > long time ago that was related to trim. maybe drive firmware can be > > updated too? :-) > > > > -- > > CeDeROM, SQ7MHZ, http://www.tomek.cedro.info > > [-- Attachment #2 --] <div dir="auto"><div><br><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Jun 8, 2023, 4:35 AM Rebecca Cran <<a href="mailto:rebecca@bsdio.com">rebecca@bsdio.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">It's ZFS, using the default options when creating it via the FreeBSD <br> installer so I presume TRIM is enabled. Without a reliable way to <br> reproduce the error I'm not sure disabling TRIM will help at the moment.<br> <br> I don't think there's any newer firmware for it.<br></blockquote></div></div><div dir="auto"><br></div><div dir="auto">pci gen 4 has a highter error rate so that needs to be managed with retries. There's a whole protocol to do that which linux implements. I suspect the time has come for us to do so too. There's some code floating around I'll have to track down.</div><div dir="auto"><br></div><div dir="auto">Warner</div><div dir="auto"><br></div><div dir="auto"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"> -- <br> <br> Rebecca Cran<br> <br> <br> On 6/8/23 04:25, Tomek CEDRO wrote:<br> > what filesystem? is TRIM enabled on that drive? have you tried <br> > disabling trim? i had similar ssd related problem on samsung's ssd <br> > long time ago that was related to trim. maybe drive firmware can be <br> > updated too? :-)<br> ><br> > --<br> > CeDeROM, SQ7MHZ, <a href="http://www.tomek.cedro.info" rel="noreferrer noreferrer" target="_blank">http://www.tomek.cedro.info</a><br> <br> </blockquote></div></div></div>help
Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CANCZdfrCY7PfopRy27_wBLBzS%2BaCq5GUAGtpK2pra1ZjyUrdYA>
