From nobody Sun Dec 31 23:16:48 2023 X-Original-To: freebsd-current@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4T3FRP6jhPz55Xmt for ; Sun, 31 Dec 2023 23:18:41 +0000 (UTC) (envelope-from warlock@phouka1.phouka.net) Received: from phouka1.phouka.net (phouka1.phouka.net [107.170.196.116]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (2048 bits) client-digest SHA256) (Client CN "phouka.net", Issuer "Go Daddy Secure Certificate Authority - G2" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id 4T3FRP4BRBz4LwJ; Sun, 31 Dec 2023 23:18:41 +0000 (UTC) (envelope-from warlock@phouka1.phouka.net) Authentication-Results: mx1.freebsd.org; none Received: from phouka1.phouka.net (localhost [127.0.0.1]) by phouka1.phouka.net (8.17.1/8.17.1) with ESMTPS id 3BVNGn0K071241 (version=TLSv1.3 cipher=TLS_AES_256_GCM_SHA384 bits=256 verify=NO); Sun, 31 Dec 2023 15:16:49 -0800 (PST) (envelope-from warlock@phouka1.phouka.net) Received: (from warlock@localhost) by phouka1.phouka.net (8.17.1/8.17.1/Submit) id 3BVNGnfq071240; Sun, 31 Dec 2023 15:16:49 -0800 (PST) (envelope-from warlock) Date: Sun, 31 Dec 2023 15:16:48 -0800 From: John Kennedy To: Kurt Jaeger Cc: freebsd-current@freebsd.org Subject: Re: ZFS problems since recently ? Message-ID: References: List-Id: Discussions about the use of FreeBSD-current List-Archive: https://lists.freebsd.org/archives/freebsd-current List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-current@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Rspamd-Pre-Result: action=no action; module=replies; Message is reply to one we originated X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[]; ASN(0.00)[asn:14061, ipnet:107.170.192.0/18, country:US] X-Spamd-Bar: ---- X-Rspamd-Queue-Id: 4T3FRP4BRBz4LwJ On Sun, Dec 31, 2023 at 07:34:45PM +0100, Kurt Jaeger wrote: > Hi! > > Short overview: > - Had CURRENT system from around September > - Upgrade on the 23th of December > - crashes in ZFS, see > https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=261538 > for details > - Reinstalled from scratch with new SSDs drives from > https://download.freebsd.org/snapshots/amd64/amd64/ISO-IMAGES/15.0/ > freebsd-openzfs-amd64-2020081900-memstick.img.xz > - Had one crash with > sysctl -a > https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=276039 > - Still see crashes with ZFS (and other) when using poudriere to > build ports. > > Problem: > > I happen to run in several cases of crashes in ZFS, some of > them fatal (zpool non-recoverable). I can crash mine with "sysctl -a" as well. I seeded my bhyve with: FreeBSD-15.0-CURRENT-amd64-20231228-fb03f7f8e30d-267242-disc1.iso Rebuilt the kernel (so now at main-n267320-4d08b569a01) and started crunching through poudriere package builds. Sorta stock install of encrypted ZFS. I didn't get it to crash with poudriere (yet). Mine lives in bhyve, so maybe less possible destruction via crashes. KDB: stack backtrace: db_trace_self_wrapper() at db_trace_self_wrapper+0x2b/frame 0xfffffe00fa5f3960 vpanic() at vpanic+0x131/frame 0xfffffe00fa5f3a90 panic() at panic+0x43/frame 0xfffffe00fa5f3af0 sbuf_clear() at sbuf_clear+0xa8/frame 0xfffffe00fa5f3b00 sbuf_cpy() at sbuf_cpy+0x56/frame 0xfffffe00fa5f3b20 spa_taskq_write_param() at spa_taskq_write_param+0x85/frame 0xfffffe00fa5f3bd0 sysctl_root_handler_locked() at sysctl_root_handler_locked+0x9c/frame 0xfffffe00fa5f3c20 sysctl_root() at sysctl_root+0x21e/frame 0xfffffe00fa5f3ca0 userland_sysctl() at userland_sysctl+0x184/frame 0xfffffe00fa5f3d50 sys___sysctl() at sys___sysctl+0x60/frame 0xfffffe00fa5f3e00 amd64_syscall() at amd64_syscall+0x153/frame 0xfffffe00fa5f3f30 fast_syscall_common() at fast_syscall_common+0xf8/frame 0xfffffe00fa5f3f30 --- syscall (202, FreeBSD ELF64, __sysctl), rip = 0x22e42167019a, rsp = 0x22e41ee72518, rbp = 0x22e41ee72550 --- KDB: enter: panic The sysctl died at this point, but who knows if it had pending buffered output or anything... ... vfs.zfs.zio.deadman_log_all: 0 vfs.zfs.zio.dva_throttle_enabled: 1 vfs.zfs.zio.requeue_io_start_cut_in_line: 1 vfs.zfs.zio.slow_io_ms: 30000 vfs.zfs.zio.taskq_wr_iss_ncpus: 0