Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 14 Dec 2023 14:25:13 -0800
From:      Xin LI <delphij@gmail.com>
To:        Pete Wright <pete@nomadlogic.org>
Cc:        Lexi Winter <lexi@le-fay.org>, "freebsd-fs@freebsd.org" <freebsd-fs@freebsd.org>
Subject:   Re: unusual ZFS issue
Message-ID:  <CAGMYy3sp-6T0GYEBVG1WSkn4hzvQd15XuwD8bvx1qg1Qys%2BpCg@mail.gmail.com>
In-Reply-To: <ec82ce34-6463-4200-960a-c820605d0cfd@nomadlogic.org>
References:  <787CB64A-1687-49C3-9063-2CE3B6F957EF@le-fay.org> <twivq3fadiotyfpblbqzfulx2wvel5povfic7626hktylqr4gg@uxipr5j7igs5> <C210159C-4747-45A1-9FB1-1708D6A21DE8@le-fay.org> <ec82ce34-6463-4200-960a-c820605d0cfd@nomadlogic.org>

next in thread | previous in thread | raw e-mail | index | archive | help
--0000000000008bebbd060c7fc204
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Try "zpool status -x" and see if it would show something useful?

Cheers,

On Thu, Dec 14, 2023 at 2:10=E2=80=AFPM Pete Wright <pete@nomadlogic.org> w=
rote:

>
>
> On 12/14/23 2:05 PM, Lexi Winter wrote:
> > On 14 Dec 2023, at 22:02, Pete Wright <pete@nomadlogic.org> wrote:
> >> On Thu, Dec 14, 2023 at 09:17:06PM +0000, Lexi Winter wrote:
> >>> hi list,
> >>>
> >>> i=E2=80=99ve just hit this ZFS error:
> >>>
> >>> # zfs list -rt snapshot data/vm/media/disk1
> >>> cannot iterate filesystems: I/O error
> >>
> >> hrm, i wonder if you see any errors in dmesg or /var/log/messages abou=
t
> a
> >> device failing?
> >
> > nothing that looks relevant in the last few days (the problem appeared
> last night, Dec 13th):
> >
> > Dec 11 15:44:21 hemlock kernel: ix1: link state changed to DOWN
> > Dec 11 15:44:21 hemlock kernel: ix1.107: link state changed to DOWN
> > Dec 11 15:44:35 hemlock kernel: ix1: link state changed to UP
> > Dec 11 15:44:35 hemlock kernel: ix1.107: link state changed to UP
> > Dec 11 15:44:47 hemlock kernel: nfsrv_cache_session: no session
> IPaddr=3D2001:8b0:aab5:ffff::2, check NFS clients for unique /etc/hostid'=
s
> > Dec 11 15:44:47 hemlock syslogd: last message repeated 1 times
> > Dec 11 17:00:48 hemlock kernel: tcp_vnet_init: WARNING: unable to
> initialise TCP stats
> > Dec 11 17:00:48 hemlock kernel: lo0: link state changed to UP
> > Dec 12 06:17:23 hemlock ntpd[25836]: leapsecond file
> ('/var/db/ntpd.leap-seconds.list'): will expire in less than 16 days
> > Dec 13 06:17:23 hemlock ntpd[25836]: leapsecond file
> ('/var/db/ntpd.leap-seconds.list'): will expire in less than 15 days
> > Dec 14 06:17:23 hemlock ntpd[25836]: leapsecond file
> ('/var/db/ntpd.leap-seconds.list'): will expire in less than 14 days
> > Dec 14 16:30:12 hemlock smbd[98264]: [2023/12/14 16:30:12.404883,  0]
> ../../source3/smbd/server.c:1741(main)
> > Dec 14 16:30:12 hemlock smbd[98264]:   smbd version 4.16.11 started.
> > Dec 14 16:30:12 hemlock smbd[98264]:   Copyright Andrew Tridgell and th=
e
> Samba Team 1992-2022
> >
> > i=E2=80=99ve also checked the disks with smartctl and i didn=E2=80=99t =
see any errors
> there.  (a couple of devices have corrected read errors, but that=E2=80=
=99s
> expected given their age - and if it *was* a disk error i=E2=80=99d expec=
t it to
> show up as a checksum error).
> >
>
> dang, was hoping something obvious would pop up there or with smartctl.
> hopefully others here have some ideas about trying to find the root
> cause before a restart.
>
> -pete
>
> --
> Pete Wright
> pete@nomadlogic.org
>
>

--0000000000008bebbd060c7fc204
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div class=3D"gmail_default" style=3D"font-family:monospac=
e,monospace">Try &quot;zpool status -x&quot; and see if it would show somet=
hing useful?</div><div class=3D"gmail_default" style=3D"font-family:monospa=
ce,monospace"><br></div><div class=3D"gmail_default" style=3D"font-family:m=
onospace,monospace">Cheers,<br></div></div><br><div class=3D"gmail_quote"><=
div dir=3D"ltr" class=3D"gmail_attr">On Thu, Dec 14, 2023 at 2:10=E2=80=AFP=
M Pete Wright &lt;<a href=3D"mailto:pete@nomadlogic.org">pete@nomadlogic.or=
g</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin=
:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"=
><br>
<br>
On 12/14/23 2:05 PM, Lexi Winter wrote:<br>
&gt; On 14 Dec 2023, at 22:02, Pete Wright &lt;<a href=3D"mailto:pete@nomad=
logic.org" target=3D"_blank">pete@nomadlogic.org</a>&gt; wrote:<br>
&gt;&gt; On Thu, Dec 14, 2023 at 09:17:06PM +0000, Lexi Winter wrote:<br>
&gt;&gt;&gt; hi list,<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; i=E2=80=99ve just hit this ZFS error:<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; # zfs list -rt snapshot data/vm/media/disk1<br>
&gt;&gt;&gt; cannot iterate filesystems: I/O error<br>
&gt;&gt;<br>
&gt;&gt; hrm, i wonder if you see any errors in dmesg or /var/log/messages =
about a<br>
&gt;&gt; device failing?<br>
&gt; <br>
&gt; nothing that looks relevant in the last few days (the problem appeared=
 last night, Dec 13th):<br>
&gt; <br>
&gt; Dec 11 15:44:21 hemlock kernel: ix1: link state changed to DOWN<br>
&gt; Dec 11 15:44:21 hemlock kernel: ix1.107: link state changed to DOWN<br=
>
&gt; Dec 11 15:44:35 hemlock kernel: ix1: link state changed to UP<br>
&gt; Dec 11 15:44:35 hemlock kernel: ix1.107: link state changed to UP<br>
&gt; Dec 11 15:44:47 hemlock kernel: nfsrv_cache_session: no session IPaddr=
=3D2001:8b0:aab5:ffff::2, check NFS clients for unique /etc/hostid&#39;s<br=
>
&gt; Dec 11 15:44:47 hemlock syslogd: last message repeated 1 times<br>
&gt; Dec 11 17:00:48 hemlock kernel: tcp_vnet_init: WARNING: unable to init=
ialise TCP stats<br>
&gt; Dec 11 17:00:48 hemlock kernel: lo0: link state changed to UP<br>
&gt; Dec 12 06:17:23 hemlock ntpd[25836]: leapsecond file (&#39;/var/db/ntp=
d.leap-seconds.list&#39;): will expire in less than 16 days<br>
&gt; Dec 13 06:17:23 hemlock ntpd[25836]: leapsecond file (&#39;/var/db/ntp=
d.leap-seconds.list&#39;): will expire in less than 15 days<br>
&gt; Dec 14 06:17:23 hemlock ntpd[25836]: leapsecond file (&#39;/var/db/ntp=
d.leap-seconds.list&#39;): will expire in less than 14 days<br>
&gt; Dec 14 16:30:12 hemlock smbd[98264]: [2023/12/14 16:30:12.404883,=C2=
=A0 0] ../../source3/smbd/server.c:1741(main)<br>
&gt; Dec 14 16:30:12 hemlock smbd[98264]:=C2=A0 =C2=A0smbd version 4.16.11 =
started.<br>
&gt; Dec 14 16:30:12 hemlock smbd[98264]:=C2=A0 =C2=A0Copyright Andrew Trid=
gell and the Samba Team 1992-2022<br>
&gt; <br>
&gt; i=E2=80=99ve also checked the disks with smartctl and i didn=E2=80=99t=
 see any errors there.=C2=A0 (a couple of devices have corrected read error=
s, but that=E2=80=99s expected given their age - and if it *was* a disk err=
or i=E2=80=99d expect it to show up as a checksum error).<br>
&gt; <br>
<br>
dang, was hoping something obvious would pop up there or with smartctl. <br=
>
hopefully others here have some ideas about trying to find the root <br>
cause before a restart.<br>
<br>
-pete<br>
<br>
-- <br>
Pete Wright<br>
<a href=3D"mailto:pete@nomadlogic.org" target=3D"_blank">pete@nomadlogic.or=
g</a><br>
<br>
</blockquote></div>

--0000000000008bebbd060c7fc204--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAGMYy3sp-6T0GYEBVG1WSkn4hzvQd15XuwD8bvx1qg1Qys%2BpCg>