From nobody Sat Apr 15 15:51:51 2023 X-Original-To: dev-commits-src-main@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4PzHrY3d2Qz45YRP; Sat, 15 Apr 2023 15:52:29 +0000 (UTC) (envelope-from freebsd@walstatt-de.de) Received: from smtp052.goneo.de (smtp052.goneo.de [85.220.129.60]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4PzHrY1Szzz3Nmk; Sat, 15 Apr 2023 15:52:29 +0000 (UTC) (envelope-from freebsd@walstatt-de.de) Authentication-Results: mx1.freebsd.org; none Received: from hub1.goneo.de (hub1.goneo.de [85.220.129.52]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits)) (No client certificate requested) by smtp5.goneo.de (Postfix) with ESMTPS id 2B68510A1E8E; Sat, 15 Apr 2023 17:52:21 +0200 (CEST) Received: from hub1.goneo.de (localhost [127.0.0.1]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits)) (No client certificate requested) by hub1.goneo.de (Postfix) with ESMTPS id 4B57110A32F4; Sat, 15 Apr 2023 17:52:19 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=walstatt-de.de; s=DKIM001; t=1681573939; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=uQ5Y1tWOxt2kTlzDq2zDNlVZOkYI/NI/BsFrnymjmwY=; b=Xel4bsbEYNEJMJMioBJ054+n2DZw5bYIt9pEcAFTK8lP4vhDAoqRPfwhCnf6LH84+3yxyp Eb0UVrgX8QM/PrSosoIz0wIcQOKaPhe7oUd8IcmO9Ppovrysyhw2Fg33nOlShsHLcSWmD8 2bPS73BKIusgZH/s7c2/wLGNalTwCN7rSe6QvdnT6mahjOwP9lml4/iaCdPptPfgGWOGGH kJujjvXwbMAo38mtD5AlO62VplFaVn8JFRBrCL9O4QxC0c2dacWHEP5Ve7WE2tyRy4KNSv PzRt3VY3ggfwUuAOcgZBg44a/u6RUKjw48bou3TmqcwZxW7+w9EkwluakYhKCQ== Received: from thor.intern.walstatt.dynvpn.de (dynamic-077-013-073-163.77.13.pool.telefonica.de [77.13.73.163]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by hub1.goneo.de (Postfix) with ESMTPSA id DB4B110A32E3; Sat, 15 Apr 2023 17:52:18 +0200 (CEST) Date: Sat, 15 Apr 2023 17:51:51 +0200 From: FreeBSD User To: Cy Schubert Cc: Mark Millard , Charlie Li , Pawel Jakub Dawidek , Mateusz Guzik , dev-commits-src-main@freebsd.org, Current FreeBSD Subject: Re: git: 2a58b312b62f - main - zfs: merge openzfs/zfs@431083f75 Message-ID: <20230415175218.777d0a97@thor.intern.walstatt.dynvpn.de> In-Reply-To: <20230415143625.99388387@slippy.cwsent.com> References: <20230413071032.18BFF31F@slippy.cwsent.com> <20230413063321.60344b1f@cschubert.com> <20230413135635.6B62F354@slippy.cwsent.com> <319a267e-3f76-3647-954a-02178c260cea@dawidek.net> <441db213-2abb-b37e-e5b3-481ed3e00f96@dawidek.net> <5ce72375-90db-6d30-9f3b-a741c320b1bf@freebsd.org> <99382FF7-765C-455F-A082-C47DB4D5E2C1@yahoo.com> <32cad878-726c-4562-0971-20d5049c28ad@freebsd.org> <20230415115452.08911bb7@thor.intern.walstatt.dynvpn.de> <20230415143625.99388387@slippy.cwsent.com> Organization: walstatt-de.de List-Id: Commit messages for the main branch of the src repository List-Archive: https://lists.freebsd.org/archives/dev-commits-src-main List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-dev-commits-src-main@freebsd.org X-BeenThere: dev-commits-src-main@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Rspamd-UID: ccd090 X-Rspamd-UID: dbcd85 X-Rspamd-Queue-Id: 4PzHrY1Szzz3Nmk X-Spamd-Bar: ---- X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[]; ASN(0.00)[asn:25394, ipnet:85.220.128.0/17, country:DE] X-Rspamd-Pre-Result: action=no action; module=replies; Message is reply to one we originated X-ThisMailContainsUnwantedMimeParts: N Am Sat, 15 Apr 2023 07:36:25 -0700 Cy Schubert schrieb: > In message <20230415115452.08911bb7@thor.intern.walstatt.dynvpn.de>, > FreeBSD Us > er writes: > > Am Thu, 13 Apr 2023 22:18:04 -0700 > > Mark Millard schrieb: > > > > > On Apr 13, 2023, at 21:44, Charlie Li wrote: > > > > > > > Mark Millard wrote: > > > >> FYI: in my original report for a context that has never had > > > >> block_cloning enabled, I reported BOTH missing files and > > > >> file content corruption in the poudriere-devel bulk build > > > >> testing. This predates: > > > >> https://people.freebsd.org/~pjd/patches/brt_revert.patch > > > >> but had the changes from: > > > >> https://github.com/openzfs/zfs/pull/14739/files > > > >> The files were missing from packages installed to be used > > > >> during a port's build. No other types of examples of missing > > > >> files happened. (But only 11 ports failed.) > > > > I also don't have block_cloning enabled. "Missing files" prior to brt_rev > > ert may actually > > > > be present, but as the corruption also messes with the file(1) signature, > > some tools like > > > > ldconfig report them as missing. > > > > > > For reference, the specific messages that were not explicit > > > null-byte complaints were (some shown with a little context): > > > > > > > > > ===> py39-lxml-4.9.2 depends on shared library: libxml2.so - not found > > > ===> Installing existing package /packages/All/libxml2-2.10.3_1.pkg > > > [CA72_ZFS] Installing libxml2-2.10.3_1... > > > [CA72_ZFS] Extracting libxml2-2.10.3_1: .......... done > > > ===> py39-lxml-4.9.2 depends on shared library: libxml2.so - found > > > (/usr/local/lib/libxml2.so) . . . > > > [CA72_ZFS] Extracting libxslt-1.1.37: .......... done > > > ===> py39-lxml-4.9.2 depends on shared library: libxslt.so - found > > > (/usr/local/lib/libxslt.so) ===> Returning to build of py39-lxml-4.9.2 > > > . . . > > > ===> Configuring for py39-lxml-4.9.2 > > > Building lxml version 4.9.2. > > > Building with Cython 0.29.33. > > > Error: Please make sure the libxml2 and libxslt development packages are in > > stalled. > > > > > > > > > [CA72_ZFS] Extracting libunistring-1.1: .......... done > > > ===> libidn2-2.3.4 depends on shared library: libunistring.so - not found > > > > > > > > > > > [CA72_ZFS] Extracting gmp-6.2.1: .......... done > > > ===> mpfr-4.2.0,1 depends on shared library: libgmp.so - not found > > > > > > > > > ===> nettle-3.8.1 depends on shared library: libgmp.so - not found > > > ===> Installing existing package /packages/All/gmp-6.2.1.pkg > > > [CA72_ZFS] Installing gmp-6.2.1... > > > the most recent version of gmp-6.2.1 is already installed > > > ===> nettle-3.8.1 depends on shared library: libgmp.so - not found > > > *** Error code 1 > > > > > > > > > autom4te: error: need GNU m4 1.4 or later: /usr/local/bin/gm4 > > > > > > > > > checking for GNU > > > M4 that supports accurate traces... configure: error: no acceptable m4 coul > > d be found in > > > $PATH. GNU M4 1.4.6 or later is required; 1.4.16 or newer is recommended. > > > GNU M4 1.4.15 uses a buggy replacement strstr on some systems. > > > Glibc 2.9 - 2.12 and GNU M4 1.4.11 - 1.4.15 have another strstr bug. > > > > > > > > > ld: error: /usr/local/lib/libblkid.a: unknown file type > > > > > > > > > === > > > Mark Millard > > > marklmi at yahoo.com > > > > > > > > > > Hello > > > > whar is the recent status of fixing/mitigate this desatrous bug? Especially f > > or those with the > > new option enabled on ZFS pools. Any advice? > > > > In an act of precausion (or call it panic) I shutdown several servers to prev > > ent irreversible > > damages to databases and data storages. We face on one host with /usr/ports r > > esiding on ZFS > > always errors on the same files created while staging (using portmaster, leav > > es the system > > with noninstalled software, i.e. www/apache24 in our case). Deleting the work > > folder doesn't > > seem to change anything, even when starting a scrubbing of the entire pool (R > > AIDZ1 pool) - > > cause unknown, why it affects always the same files to be corrupted. Same wit > > h deve/ruby-gems. > > > > Poudriere has been shutdown for the time being to avoid further issues. > > > > Are there any advies to proceed apart from conserving the boxes via shutdown? > > > > Thank you ;-) > > oh > > > > > > > > -- > > O. Hartmann > > With an up-to-date tree + pjd@'s "Fix data corruption when cloning embedded > blocks. #14739" patch I didn't have any issues, except for email messages > with corruption in my sent directory, nowhere else. I'm still investigating > the email messages issue. IMO one is generally safe to run poudriere on the > latest ZFS with the additional patch. > > My tests of the additional patch concluded that it resolved my last > problems, except for the sent email problem I'm still investigating. I'm > sure there's a simple explanation for it, i.e. the email thread was > corrupted by the EXDEV regression which cannot be fixed by anything, even > reverting to the previous ZFS -- the data in those files will remain > damaged regardless. > > I cannot speak to the others who have had poudriere and other issues. I > never had any problems with poudriere on top of the new ZFS. > > WRT reverting block_cloning pools to without, your only option is to backup > your pool and recreate it without block_cloning. Then restore your data. > > All right, I interpret the answer that way, that I need a most recent source tree (and accordingly built and installed OS) AND a patch that isn't officially commited? On a box I'm with: FreeBSD 14.0-CURRENT #8 main-n262175-5ee1c90e50ce: Sat Apr 15 07:57:16 CEST 2023 amd64 The box is crashing while trying to update ports with the well known issue: Panic String: VERIFY(!zil_replaying(zilog, tx)) failed At the moment all alarm bells are ringing and I lost track about what has been patched and already commited and what is still as patch available but in the phase of evaluation or inofficially emmited here. According to the EXDEV issue: in cases of poudriere or ports trees on ZFS, what do I have to do to ensure that those datasets are clean? The OS should detect file corruption but in my case the box is crashing :-( I did several times scrubbing, but this seems to be the action of a helpless and desperate man ... ;-/ Greetings Oliver -- O. Hartmann