Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 28 Nov 2016 17:41:40 +0100
From:      Fabian Keil <freebsd-listen@fabiankeil.de>
To:        Konstantin Belousov <kostikbel@gmail.com>
Cc:        freebsd-hackers@freebsd.org
Subject:   Re: FreeBSD 11 i386 disk deadlock (I think) (now with reproduction steps!)
Message-ID:  <20161128174140.6635a726@fabiankeil.de>
In-Reply-To: <20161128160311.GQ54029@kib.kiev.ua>
References:  <CAM9edeMYMhnkWid7Lig5D-FjhahniFm0VbFRm8ysyb85h29wXg@mail.gmail.com> <20161128041847.GA65249@charmander> <20161128120046.GP54029@kib.kiev.ua> <CAM9edeNDWcJ7R_%2B_Q%2BMksVcL_pcJVR%2BO7t98s5XyfmOpXgc-zw@mail.gmail.com> <20161128144135.10f93205@fabiankeil.de> <20161128160311.GQ54029@kib.kiev.ua>

next in thread | previous in thread | raw e-mail | index | archive | help
--Sig_/R9=vfUUF6L/il=NEp96Z1ks
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: quoted-printable

Konstantin Belousov <kostikbel@gmail.com> wrote:

> On Mon, Nov 28, 2016 at 02:43:30PM +0100, Fabian Keil wrote:
> > David Cross <dcrosstech@gmail.com> wrote:
> >  =20
> > > This is certainly new behavior, or a new manifestation. =20
> >=20
> > Recently a couple of uma consumers were changed to share uma zones
> > instead of using a dedicated zone. As a result geli competes with
> > more uma consumers and is more likely to deadlock. The bug isn't
> > new, it's just triggered more often now. =20
> The problem happens on layer much lower than UMA, it is whole reusable
> page pool which is depleted and cannot be re-filled without allocating
> more memory.  If you think about it, the deadlock is obviously trivial:
> pagedaemon is the main source of the free pages, but if producing free
> page requires allocating one, low memory condition is equal to deadlock.
>=20
> It was always there, in the sense that for all versions of freebsd, if
> file/disk write path requires memory allocation, there is the trouble.
>=20
> For geom, some special unique measures were taken so that bio allocations
> do not cause the issue in typical situations.
>=20
> > geli isn't the only uma consumer that is affected:
> > https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D209680 =20

It's been a couple of months since I looked into this and apparently
I misremembered.

The commits I was thinking of didn't actually modify UMA consumers to
use shared zones instead of dedicated zones but removed the UMA_ZONE_NOFREE
flag which makes issues like the one reported in the PR above more likely.
However, this should not negatively affect UMA consumers that use different
zones and should be unrelated to the geli deadlocks.

On my systems the patch from #209759 reliably prevents the geli deadlocks
when paging, but I do not remember why the issue became more pressing
recently.

Fabian

--Sig_/R9=vfUUF6L/il=NEp96Z1ks
Content-Type: application/pgp-signature
Content-Description: OpenPGP digital signature

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iEYEARECAAYFAlg8XkYACgkQBYqIVf93VJ3Z3wCdHWmBsM5SXy1g7XQhU+4H0J8j
D7IAn0WsLYYAOAvHz4umN1WTcxOVIabF
=RmKu
-----END PGP SIGNATURE-----

--Sig_/R9=vfUUF6L/il=NEp96Z1ks--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20161128174140.6635a726>