From nobody Sat Feb 26 16:13:39 2022 X-Original-To: freebsd-current@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 6418D19EBFDE for ; Sat, 26 Feb 2022 16:13:55 +0000 (UTC) (envelope-from Alexander@leidinger.net) Received: from mailgate.Leidinger.net (bastille.leidinger.net [89.238.82.207]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature ECDSA (P-256) client-digest SHA256) (Client CN "mailgate.leidinger.net", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4K5Wst3mPhz3MJ0; Sat, 26 Feb 2022 16:13:54 +0000 (UTC) (envelope-from Alexander@leidinger.net) Received: from outgoing.leidinger.net (p5b165562.dip0.t-ipconnect.de [91.22.85.98]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-256) server-digest SHA256 client-signature ECDSA (P-256) client-digest SHA256) (Client CN "outgoing.leidinger.net", Issuer "R3" (verified OK)) by mailgate.Leidinger.net (Postfix) with ESMTPSA id D0E4B237B7; Sat, 26 Feb 2022 17:13:45 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=leidinger.net; s=outgoing-alex; t=1645892025; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=kWgUEynMTDnM47zyudvrse81hIy58AP/Hc6AsYraCAM=; b=Y9Sx7Xk4+k+L1ToxK0gcnRnBxijJzb/fk+o7/ZLIHZdi1M9y5MUUB9HB1o2/oq1tzLWugi Nc1+WOkPE5VHJNVVbONDZefecekhUeVojrRfSdk87JAMYUGEMTSuYb1v2iC8+9Qegl0KEb 3QhMTmoOJ6N/6KlyEWwb+p0aSUHDdbfmxAc+oqazUeCgpPMkkyaUaYHktsREKD+J6JooR4 MIkynCpjANoblYzLDSD6/BFNz83RDKo3SMWc+u2riiEM2MpVmyU2ovHuRZ5HstQPOEPRJx nWjkj9ziLpKpMUTdvVN5q0ux/pVnsnWKQt6sOkqbayYLlalRYwZCv2u2aaWceA== Received: from webmail.leidinger.net (localhost [127.0.0.1]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-256) server-digest SHA256) (Client did not present a certificate) by outgoing.leidinger.net (Postfix) with ESMTPS id 3769FA221; Sat, 26 Feb 2022 17:13:43 +0100 (CET) Date: Sat, 26 Feb 2022 17:13:39 +0100 Message-ID: <20220226171339.Horde.l4P5gYzg-XT3ZRAFvL4KBNq@webmail.leidinger.net> From: Alexander Leidinger To: Larry Rosenman Cc: Rob Wing , Alexander Motin , Freebsd current Subject: Re: ZFS PANIC: HELP. References: <07c0c9c34b4a4133acab597c96867d27@lerctr.org> <95c7c326-e2f9-6e66-7b97-b9fb2671f4ad@FreeBSD.org> <1b6b2017ba69e6fda1ca237c3016ac61@lerctr.org> <6182c57bf1859482e72af78037c399e4@lerctr.org> <7abcbe88-446f-44b9-c3a7-0997d3430b57@FreeBSD.org> <2bc12c9ee3e6dd71b65079116ff2b845@lerctr.org> <5930f3d2b71c0932f903bb5b88a3c87d@lerctr.org> <36d3896af19acc4fdd1712822ba9d420@lerctr.org> <9f6a8ad62fc0dbd6f3a19c7d695bd302@lerctr.org> <20220225091120.Horde.76VjoVNtr5BsqAw_5ftjpRZ@webmail.leidinger.net> In-Reply-To: Accept-Language: de,en Content-Type: multipart/signed; boundary="=_6wtqejNbDh9PhdsKDTzVZom"; protocol="application/pgp-signature"; micalg=pgp-sha256 List-Id: Discussions about the use of FreeBSD-current List-Archive: https://lists.freebsd.org/archives/freebsd-current List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-current@freebsd.org MIME-Version: 1.0 X-Rspamd-Queue-Id: 4K5Wst3mPhz3MJ0 X-Spamd-Bar: --- Authentication-Results: mx1.freebsd.org; dkim=pass header.d=leidinger.net header.s=outgoing-alex header.b=Y9Sx7Xk4; dmarc=pass (policy=quarantine) header.from=leidinger.net; spf=pass (mx1.freebsd.org: domain of Alexander@leidinger.net designates 89.238.82.207 as permitted sender) smtp.mailfrom=Alexander@leidinger.net X-Spamd-Result: default: False [-3.40 / 15.00]; RCVD_VIA_SMTP_AUTH(0.00)[]; R_SPF_ALLOW(-0.20)[+mx]; RCVD_COUNT_THREE(0.00)[3]; MID_RHS_MATCH_FROMTLD(0.00)[]; TO_DN_ALL(0.00)[]; DKIM_TRACE(0.00)[leidinger.net:+]; DMARC_POLICY_ALLOW(-0.50)[leidinger.net,quarantine]; SUBJ_ALL_CAPS(1.20)[16]; SIGNED_PGP(-2.00)[]; NEURAL_HAM_SHORT(-1.00)[-1.000]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+,1:+,2:+,3:~,4:~]; ASN(0.00)[asn:34240, ipnet:89.238.64.0/18, country:DE]; RECEIVED_SPAMHAUS_PBL(0.00)[91.22.85.98:received]; ARC_NA(0.00)[]; NEURAL_HAM_MEDIUM(-1.00)[-1.000]; R_DKIM_ALLOW(-0.20)[leidinger.net:s=outgoing-alex]; FROM_HAS_DN(0.00)[]; RCPT_COUNT_THREE(0.00)[4]; NEURAL_HAM_LONG(-1.00)[-1.000]; TAGGED_RCPT(0.00)[]; MIME_GOOD(-0.20)[multipart/signed,multipart/alternative,text/plain]; TO_MATCH_ENVRCPT_SOME(0.00)[]; MLMMJ_DEST(0.00)[freebsd-current]; FREEMAIL_CC(0.00)[gmail.com,freebsd.org]; RCVD_TLS_ALL(0.00)[]; SUSPICIOUS_RECIPS(1.50)[] X-ThisMailContainsUnwantedMimeParts: N This message is in MIME format and has been PGP signed. --=_6wtqejNbDh9PhdsKDTzVZom Content-Type: multipart/alternative; boundary="=_IsRE-cZPUR6cZuTgtb5REjG" This message is in MIME format. --=_IsRE-cZPUR6cZuTgtb5REjG Content-Type: text/plain; charset=utf-8; format=flowed; DelSp=Yes Content-Description: Textnachricht Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Quoting Larry Rosenman (from Fri, 25 Feb 2022=20=20 20:03:51=20-0600): > On 02/25/2022 2:11 am, Alexander Leidinger wrote: > >> Quoting Larry Rosenman (from Thu, 24 Feb 2022=20=20 >>=2020:19:45 -0600): >> >>> I tried a scrub -- it panic'd on a fatal double fault.=C2=A0 >>> >>> Suggestions? >> >> The safest / cleanest (but not fastest) is data export and=20=20 >>=20pool re-creation. If you export dataset by dataset (instead of=20=20 >>=20recursively all), you can even see which dataset is causing the=20=20 >>=20issue. In case this per dataset export narrows down the issue and=20= =20 >>=20it is a dataset you don't care about (as in: 1) no issue to=20=20 >>=20recreate from scratch or 2) there is a backup available) you could=20= =20 >>=20delete this (or each such) dataset and re-create it in-place (=3D not= =20=20 >>=20re-creating the entire pool). >> >> Bye, >> Alexander. >> http://www.Leidinger.net Alexander@Leidinger.net: PGP=20=20 >>=200x8F31830F9F2772BF >> http://www.FreeBSD.org=C2=A0 =C2=A0 netchild@FreeBSD.org=C2=A0 : PGP 0x8= F31830F9F2772BF > > I'm running this script: > #!/bin/sh > for i in $(zfs list -H | awk '{print $1}') > do > =C2=A0 FS=3D$1 > =C2=A0 FN=3D$(echo ${FS} | sed -e s@/@_@g) > =C2=A0 sudo zfs send -vecLep ${FS}@REPAIR_SNAP | ssh=20=20 >=20ler@freenas.lerctr.org cat - \> $FN > done > > =C2=A0 > > How will I know a "Problem" dataset? You told a scrub is panicing the system. A scrub only touches occupied=20= =20 blocks.=20As such a problem-dataset should panic your system. If it=20=20 doesn't=20panic at all, the problem may be within a snapshot which=20=20 contains=20data which is deleted in later versions of the dataset. Bye, Alexander. --=20 http://www.Leidinger.net Alexander@Leidinger.net: PGP 0x8F31830F9F2772BF http://www.FreeBSD.org netchild@FreeBSD.org : PGP 0x8F31830F9F2772BF --=_IsRE-cZPUR6cZuTgtb5REjG Content-Type: text/html; charset=utf-8 Content-Description: HTML-Nachricht Content-Disposition: inline Content-Transfer-Encoding: quoted-printable

Quoting Larry Rosenman <ler@lerctr.= org> (from Fri, 25 Feb 2022 20:03:51 -0600):

On 02/25/2022 2:11 am, Alexander Leidinger wrote:

Quoting Larry Rosenman <ler@lerctr.org> (from Thu, 24 Feb 2022 20:19:45 -0600):

I tried a scrub -- it panic'd on a fatal double fault. 

Suggestions?

The safest / cleanest (but not fastest) is data export and pool re-creat= ion. If you export dataset by dataset (instead of recursively all), you can= even see which dataset is causing the issue. In case this per dataset expo= rt narrows down the issue and it is a dataset you don't care about (as in: = 1) no issue to recreate from scratch or 2) there is a backup available) you= could delete this (or each such) dataset and re-create it in-place (=3D no= t re-creating the entire pool).

Bye,
Alexander.

http://www.Leidinger.net Alexander@Leidinger.net: PGP 0x8F31830F9F2= 772BF
http://www.FreeBSD.org    netchild@FreeBSD.org  : PGP 0x8F3183= 0F9F2772BF

I'm running this script:
#!/bin/sh
for i in $(zfs list -H | awk '{print $1}')
do
  FS=3D$1
  FN=3D$(echo ${FS} | sed -e s@/@_@g)
  sudo zfs send -vecLep ${FS}@REPAIR_SNAP | ssh ler@freenas.lerctr.org= cat - \> $FN
done

 

How will I know a "Problem" dataset?

You told a scrub is panicing the system. A scrub only touches occupied b= locks. As such a problem-dataset should panic your system. If it doesn't pa= nic at all, the problem may be within a snapshot which contains data which = is deleted in later versions of the dataset.

Bye,
Alexander.

--=_IsRE-cZPUR6cZuTgtb5REjG-- --=_6wtqejNbDh9PhdsKDTzVZom Content-Type: application/pgp-signature Content-Description: Digitale PGP-Signatur Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQIzBAABCAAdFiEER9UlYXp1PSd08nWXEg2wmwP42IYFAmIaUbIACgkQEg2wmwP4 2IbSuA//T121fzZQ8X/pGxkBSDRqBU95Nq4ZSSxw/9jMCLLN97smAYD/YTeer4W3 uzk1nEIfP3Mvq5fVoBlE3S2K/lWPP9sE4n2MmSzQJmlZO/P9VkW+FhFn2aEAjDlm SzRjnOd2qgqdGVAuJKqAfcVudJaqKqC/W/8+4taE+/o2EhbwY3j0fr+kSvp6hjRr WnKMLpMaVOnSOEZ3BOdhilXssDZ8/hEuCrQV8mpAEHDeJz/taTI+Vqi+0nxOQ+zl P3gfgmBQZwiKtzT3lemS8fNeS7ktSsSGaXzPAc9uAfFJacQuIR+/jzE/FMwQOa9H eZxc/QwkgHZMhY5uz61zPTvU2kM/184wqJ6wsQlK13rw80SjaY+3voaDD7+L9oWz 7THTjMlRzgZEzV90bdhTbFzMa/UmwbAeVsZJmM70p5mAOUJn1fRDqehzEZ4FqFQk nLqgD+B8rjf5kVJYl/0/24JOvfx4TZxUgAgTI2mWRetVxT+zvk6IjyUJnqSCnXri 7mUp/6Knu98AJU2gQUAygQcIFlpkD7fczKCicWEGIDFYQm4Erecks5b+QwMQFLJh bkRFFCnsy/Lw6oX8T59l7DQ7oxl2Av1/rZb43G7JjB+AZFvSgge+3yM/aYHvbxrz Xzmo6tsNgjgwJPUzWVvRSZVxOR3dUp6i2cmRGbz+/zfp2QHZksI= =wrbT -----END PGP SIGNATURE----- --=_6wtqejNbDh9PhdsKDTzVZom--