From owner-freebsd-current@freebsd.org Tue Mar 6 22:21:52 2018 Return-Path: Delivered-To: freebsd-current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 2AFCBF34BFF for ; Tue, 6 Mar 2018 22:21:52 +0000 (UTC) (envelope-from dbaio@bsd.com.br) Received: from mail-qk0-f181.google.com (mail-qk0-f181.google.com [209.85.220.181]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id C21056EB2B for ; Tue, 6 Mar 2018 22:21:51 +0000 (UTC) (envelope-from dbaio@bsd.com.br) Received: by mail-qk0-f181.google.com with SMTP id o25so302547qkl.7 for ; Tue, 06 Mar 2018 14:21:51 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=cGxUQH1IjVamFs/GlFwc1XlM2dY7W+YglTCV9guBbys=; b=Leg25XX5Fa3z/iBurWXwxH+IoSatCA1iOcCYtZO0yTfPGybuUdvH/gvr0/D/w7Z0zg FvMh36qVe2Thi65NATRl5gD1YtkUEH47dKgMrj9jKByBgT5PBl0aVWUEzU1q6eTA5ugQ DnMK/GG86aTR4PwqzwkXdvt5162ImhRRKh+3zQUBlOdWf7aCe9RoSd0OpzGL88bdFU04 BW+6hXWtrrL6iUSALmGj5/OE1ARzGXRnwQGsWeR74cn44Kb8Jav3DwvOO1VR4ctLm7FP nudqsU/N5/Jr8gbsXy9Uez++x9wIRs0UmlZS/OtKXuRc6xllxoZF+c6zPLjNrgwHdLsV uFqQ== X-Gm-Message-State: AElRT7HaTyZ68jxnCb4MNI41oTrZEB2k2eU1plGPRwU+C7VE1MrOVjw/ xr9vLt7UcSHsAYm8BA1GFtFgMQ== X-Google-Smtp-Source: AG47ELvn26W7sbCv2TZXxzmuxAMTaljz5UOqlqBphA1Zsrwfwazm2cQVlJI942yZUOYQrEAajhC4QA== X-Received: by 10.55.182.68 with SMTP id g65mr13862127qkf.41.1520374589356; Tue, 06 Mar 2018 14:16:29 -0800 (PST) Received: from dx240.localdomain ([2804:7f4:5088:d937:ea2a:eaff:fed2:a5a0]) by smtp.googlemail.com with ESMTPSA id j1sm10088711qke.57.2018.03.06.14.16.26 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 06 Mar 2018 14:16:28 -0800 (PST) Date: Tue, 6 Mar 2018 19:15:54 -0300 From: "Danilo G. Baio" To: "Rodney W. Grimes" , Trond Endrest?l , FreeBSD current Cc: Kurt Jaeger Subject: Re: Strange ARC/Swap/CPU on yesterday's -CURRENT Message-ID: <20180306221554.uyshbzbboai62rdf@dx240.localdomain> References: <20180306173455.oacyqlbib4sbafqd@ler-imac.lerctr.org> <201803061816.w26IGaW5050053@pdx.rh.CN85.dnsmgr.net> <20180306193645.vv3ogqrhauivf2tr@ler-imac.lerctr.org> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="dcm2io5y6advqnhh" Content-Disposition: inline In-Reply-To: <20180306193645.vv3ogqrhauivf2tr@ler-imac.lerctr.org> User-Agent: NeoMutt/20180223 X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.25 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 06 Mar 2018 22:21:52 -0000 --dcm2io5y6advqnhh Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Tue, Mar 06, 2018 at 01:36:45PM -0600, Larry Rosenman wrote: > On Tue, Mar 06, 2018 at 10:16:36AM -0800, Rodney W. Grimes wrote: > > > On Tue, Mar 06, 2018 at 08:40:10AM -0800, Rodney W. Grimes wrote: > > > > > On Mon, 5 Mar 2018 14:39-0600, Larry Rosenman wrote: > > > > >=20 > > > > > > Upgraded to: > > > > > >=20 > > > > > > FreeBSD borg.lerctr.org 12.0-CURRENT FreeBSD 12.0-CURRENT #11 r= 330385: Sun Mar 4 12:48:52 CST 2018 root@borg.lerctr.org:/usr/obj/usr/= src/amd64.amd64/sys/VT-LER amd64 > > > > > > +1200060 1200060 > > > > > >=20 > > > > > > Yesterday, and I'm seeing really strange slowness, ARC use, and= SWAP use and swapping. > > > > > >=20 > > > > > > See http://www.lerctr.org/~ler/FreeBSD/Swapuse.png > > > > >=20 > > > > > I see these symptoms on stable/11. One of my servers has 32 GiB o= f=20 > > > > > RAM. After a reboot all is well. ARC starts to fill up, and I sti= ll=20 > > > > > have more than half of the memory available for user processes. > > > > >=20 > > > > > After running the periodic jobs at night, the amount of wired mem= ory=20 > > > > > goes sky high. /etc/periodic/weekly/310.locate is a particular na= sty=20 > > > > > one. > > > >=20 > > > > I would like to find out if this is the same person I have > > > > reporting this problem from another source, or if this is > > > > a confirmation of a bug I was helping someone else with. > > > >=20 > > > > Have you been in contact with Michael Dexter about this > > > > issue, or any other forum/mailing list/etc? =20 > > > Just IRC/Slack, with no response. > > > >=20 > > > > If not then we have at least 2 reports of this unbound > > > > wired memory growth, if so hopefully someone here can > > > > take you further in the debug than we have been able > > > > to get. > > > What can I provide? The system is still in this state as the full ba= ckup is slow. > >=20 > > One place to look is to see if this is the recently fixed: > > https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D222288 > > g_bio leak. > >=20 > > vmstat -z | egrep 'ITEM|g_bio|UMA' > >=20 > > would be a good first look > >=20 > borg.lerctr.org /home/ler $ vmstat -z | egrep 'ITEM|g_bio|UMA' > ITEM SIZE LIMIT USED FREE REQ FAIL SLEEP > UMA Kegs: 280, 0, 346, 5, 560, 0, 0 > UMA Zones: 1928, 0, 363, 1, 577, 0, 0 > UMA Slabs: 112, 0,25384098, 977762,102033225, 0, 0 > UMA Hash: 256, 0, 59, 16, 105, 0, 0 > g_bio: 384, 0, 33, 1627,542482056, 0, 0 > borg.lerctr.org /home/ler $ > > > > > Limiting the ARC to, say, 16 GiB, has no effect of the high amoun= t of=20 > > > > > wired memory. After a few more days, the kernel consumes virtuall= y all=20 > > > > > memory, forcing processes in and out of the swap device. > > > >=20 > > > > Our experience as well. > > > >=20 > > > > ... > > > >=20 > > > > Thanks, > > > > Rod Grimes rgrimes@= freebsd.org > > > Larry Rosenman http://www.lerctr.org/~ler > >=20 > > --=20 > > Rod Grimes rgrimes@free= bsd.org >=20 > --=20 > Larry Rosenman http://www.lerctr.org/~ler > Phone: +1 214-642-9640 E-Mail: ler@lerctr.org > US Mail: 5708 Sabbia Drive, Round Rock, TX 78665-2106 Hi. I noticed this behavior as well and changed vfs.zfs.arc_max for a smaller s= ize. For me it started when I upgraded to 1200058, in this box I'm only using poudriere for building tests. Regards. --=20 Danilo G. Baio (dbaio) --dcm2io5y6advqnhh Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQIzBAEBCAAdFiEEORj0UTsjzCy+enIkmpN7LfMuiNcFAlqfExUACgkQmpN7LfMu iNd3zg/9G2ndeASQMdADH5pALt3Vmbr9oRWYr8TN33AJt8GPAviwz++sn44dDZ5y c7fQdZoIkt2I3ERiOFDO0oHPp3FADS9iepb4sgvHx3LBAwCriy/+UqDAqPt2JM+t IyYHTTsnYHuvZjiDSn9DAhemU3vtfLJgDHdKwMbAyk0P647cAFbUXeaCRKjC0aH+ wvFpCYF9Vi5qj5j0Agg6cjS+FLZM6vPgq9bTiWhtm4PsRH1sjeh43F4mBQWvcbzv mw6IRP746T+kofgphc2VhzHmdHtJh+30Y25GjdYf8GKLDH7z4lrGADCCuoiJFIim X9Z772/5glcolmCtt10yGsyw60rjlyYjBBf8gp11rRIsNPt6fWpAsk1OvSgd4M/g 4wTHkgkGTXoOkoCVC6CDbI3Ionn+PGvOvt8tvADiZvgwe7/cGZuYiSfzhMVZM6K5 u/uEXmo23NR1jPC+9TmEkGcCe22duXu07gM1qpfGRlCcrzJzUL+9hZpmA+kHltJy wcCsTn+8EqMXbwaewbWpKVTN2IgY9Ho+RFyz8fXA9aUuuJDx4VkS1pJZtgtPVF6T +upCIURi7/J6x00UW4pjWrF4PR9BO7hm1ySGdt4T3DJ8GkcxyBJXii27EMRyKyPs C/BH1Ok82RD5kleTcQ7Kju2KiSIHiuY+y5ksrm1tSBCSnPTIvg8= =lX5M -----END PGP SIGNATURE----- --dcm2io5y6advqnhh--