From owner-freebsd-stable@freebsd.org Sun Feb 5 16:43:16 2017 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 99313CD2096 for ; Sun, 5 Feb 2017 16:43:16 +0000 (UTC) (envelope-from freebsd-listen@fabiankeil.de) Received: from smtprelay06.ispgateway.de (smtprelay06.ispgateway.de [80.67.31.104]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 5EDFFB5F for ; Sun, 5 Feb 2017 16:43:16 +0000 (UTC) (envelope-from freebsd-listen@fabiankeil.de) Received: from [78.35.153.230] (helo=fabiankeil.de) by smtprelay06.ispgateway.de with esmtpsa (TLSv1.2:AES256-GCM-SHA384:256) (Exim 4.84) (envelope-from ) id 1caPu8-000512-0D; Sun, 05 Feb 2017 17:43:08 +0100 Date: Sun, 5 Feb 2017 17:41:13 +0100 From: Fabian Keil To: "Matthew X. Economou" Cc: Subject: Re: Swapping from a zvol results in a deadman panic Message-ID: <20170205174113.5858bc99@fabiankeil.de> In-Reply-To: References: Reply-To: MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; boundary="Sig_/yAExqy3EP3MeNaxCSNqrcQG"; protocol="application/pgp-signature" X-Df-Sender: Nzc1MDY3 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 05 Feb 2017 16:43:16 -0000 --Sig_/yAExqy3EP3MeNaxCSNqrcQG Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: quoted-printable "Matthew X. Economou" wrote: > My FreeBSD 10.3-RELEASE-p16 server crashes in the middle of a Poudriere > bulk run (see below). This crash happens even if I lower > vfs.zfs.arc_max or tweak vm.v_free_min/target/reserved/severe. I'm > looking for configuration advice in case I missed something obvious, > since this seems to work on Illumos- and Linux-derived O/Ses, but > failing that, I'd like to get some advice as to how to go about > debugging this. I doubt the deadman timer causes the system to stop > responding. It's more likely a race condition elsewhere. >=20 > The pool itself uses 4k sectors and is geli-encrypted. I configured the > swap zvol based on root-on-ZFS install instructions found in the FreeBSD > wiki: Paging on geli-encrypted devices is known to cause deadlocks on FreeBSD, even if ZFS isn't involved directly: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D209759 Adding ZFS to the mix is unlikely to help ... > zfs create -V 6G -o org.freebsd:swap=3Don -o checksum=3Doff -o > compression=3Doff -o dedup=3Doff -o sync=3Ddisabled -o primarycache=3Dnone > zroot/swap >=20 > The ZoL wiki recommends a slightly different zvol configuration: >=20 > zfs create -V 4G -b $(getconf PAGESIZE) -o logbias=3Dthroughput -o > sync=3Dalways -o primarycache=3Dmetadata -o com.sun:auto-snapshot=3Dfalse > rpool/swap >=20 > I'm not sure how much of this applies to FreeBSD due to differences in > kernel design/implementation. Does anyone have an idea of what might be > going on and how I might get this working? You could try the patch from the PR and enable the kern.geom.eli.use_uma_for_all_writes sysctl. If you have a core dump, you may want to confirm that the g_eli_worker is waiting for memory first. Fabian --Sig_/yAExqy3EP3MeNaxCSNqrcQG Content-Type: application/pgp-signature Content-Description: OpenPGP digital signature -----BEGIN PGP SIGNATURE----- iF0EARECAB0WIQTKUNd6H/m3+ByGULIFiohV/3dUnQUCWJdVqgAKCRAFiohV/3dU nf37AKCZzGs59aA9nNs9jnNwmgb9nxQHFwCeNHoGLwVakcB8DFKV53C46iHNGVY= =JzFx -----END PGP SIGNATURE----- --Sig_/yAExqy3EP3MeNaxCSNqrcQG--