Date: Fri, 10 Jun 2022 21:32:10 -0700 From: Mark Millard <marklmi@yahoo.com> To: FreeBSD Hackers <freebsd-hackers@freebsd.org> Cc: Daniel Ebdrup Jensen <debdrup@freebsd.org>, David Cross <david@crossfamilyweb.com>, Robert Clausecker <fuz@fuz.su> Subject: Re: What can I learn about data that is staying paged out? (There is a more specific poudriere bulk related context given.) Message-ID: <6EA27152-3355-4356-B246-A083F31452F2@yahoo.com> In-Reply-To: <573B8B0C-5209-459D-98AD-EE92DDA4DF83@yahoo.com> References: <C337A09C-D546-46FC-A166-DBB3237D1AFC@yahoo.com> <A9C9AC24-62EC-43C1-B713-F2012CD1FD5B@yahoo.com> <573B8B0C-5209-459D-98AD-EE92DDA4DF83@yahoo.com>
next in thread | previous in thread | raw e-mail | index | archive | help
On 2022-Jun-10, at 21:27, Mark Millard <marklmi@yahoo.com> wrote: > On 2022-Jun-5, at 15:04, Mark Millard <marklmi@yahoo.com> wrote: >=20 >> On 2022-Jun-5, at 12:42, Mark Millard <marklmi@yahoo.com> wrote: >>=20 >>> I have a poudriere bulk -a -c going on a 8 Gibyte >>> aarch64 system. top has been showing an occasionally >>> increasing swap usage but never any sizable decreases. >>> Over 5800 ports have built so far. The context is UFS >>> only. The system is running a non-debug build of main. >>>=20 >>> Part of the context is ( in /etc/sysctl.conf ): >>>=20 >>> vm.swap_enabled=3D0 >>> vm.swap_idle_enabled=3D0 >>>=20 >>> Also ( in /usr/local/etc/poudriere.conf ): >>>=20 >>> USE_TMPFS=3D"data" >>>=20 >>> poudriere's TMPFS reports normally total under 128 >>> KiBytes across the 4 builders. >>>=20 >>> For reference, example figures . . . >>>=20 >>> A top variant shows: >>>=20 >>> Swap: 30720Mi Total, 306816Ki Used >>>=20 >>> vmstat -s shows: >>>=20 >>> 78152 swap pager pages paged out >>>=20 >>> Note: (78152*4096)/1024 =3D=3D 312608Ki >>>=20 >>> So nearly all of the "swap pager pages paged out" >>> pages are still sitting out in the used swap/paging >>> space. Thus, the usage is not held by user processes >>> or is held via very long running processes or is >>> not directly tied to user processes --or some mix. >>>=20 >>> The variant of top reports never having observed >>> more than: 6658Mi MaxObs(Act+Wir+Lndry). >>> ("MaxObs" is short for "Maximum Observed".) >>> Such high usage is for a bounded time, long past >>> at this point. (Until some combination of port >>> builds ends up active that uses such.) >>>=20 >>> So I'm curious: >>>=20 >>> What can I learn about the data that is staying >>> paged out (and is gradually growing)? How can I >>> learn it? >>>=20 >>>=20 >>> Other notes: >>>=20 >>> The poudriere jail being built is: >>>=20 >>> # poudriere jail -jmain-CA7-bulk_a -i >>> Jail name: main-CA7-bulk_a >>> Jail version: 14.0-CURRENT >>> Jail arch: arm.armv7 >>> Jail method: null >>> Jail mount: /usr/obj/DESTDIRs/main-CA7-poud-bulk_a >>> Jail fs: =20 >>> Jail updated: 2022-05-23 02:21:24 >>> Jail pkgbase: disabled >>>=20 >>> (Just in case the armv7 jail usage or the null method >>> or such is important to the issue.) >>=20 >> Hmm. systat -swap reports a toal for the Devices/Paths Used >> that is somewhat less than the total for what reports for the >> Pid . . . Total figures (not the Pid Swap figures!): >>=20 >> # systat -swap >> /0 /1 /2 /3 /4 /5 /6 /7 /8 /9 = /10 >> Load Average |||||||| =20 >>=20 >> Device/Path Size Used |0% /10 /20 /30 /40 / 60\ 70\ 80\ = 90\ 100| >> gpt/CA72USBswp14 14G 150M >> gpt/CA72USBswp16 16G 150M >> Total 30G 300M >>=20 >> Pid Username Command Swap/Total Per-Process Per-System >> 1453 root nfsd 1M / 15M 9% 0% >> 1451 root mountd 1M / 15M 7% 0% >> 1481 root sshd 912K / 20M 4% 0% >> 1406 root ntpd 740K / 27M 2% 0% >> 1513 root login 724K / 14M 5% 0% >> 1514 root sh 656K / 13M 4% 0% >> 342 _dhcp dhclient 516K / 13M 3% 0% >> 1363 root rpcbind 448K / 13M 3% 0% >> 1454 root nfsd 400K / 12M 3% 0% >> 341 root dhclient 380K / 13M 2% 0% >> 1341 root syslogd 324K / 12M 2% 0% >> 1505 root getty 292K / 12M 2% 0% >> 1510 root getty 292K / 12M 2% 0% >> 1511 root getty 292K / 12M 2% 0% >> 1512 root getty 292K / 12M 2% 0% >> 1509 root getty 292K / 12M 2% 0% >> 1508 root getty 292K / 12M 2% 0% >> 1507 root getty 292K / 12M 2% 0% >> 1506 root getty 288K / 12M 2% 0% >> 1135 root devd 272K / 11M 2% 0% >> 338 root dhclient 264K / 13M 2% 0% >> 1 root init 244K / 11M 2% 0% >> 1486 root cron 188K / 13M 1% 0% >>=20 >> I'm, Still looking for a clear indication of what >> most of the 300 MiBytes or so of swap/paging space >> is in use for. >=20 > I finally gave up and checked if a swapoff would > actually bring in all the pages from swap space > that were needed (if any) and then un-configure > the swap space. It did. (The bulk -a was still > ongoing. It was not doing memory-hog builder > activity at the time.) >=20 > So such an activity may be a workaround for long > running things like bulk -a to avoid a swap space > accumulation that seems to be happening. >=20 > I do not know how much was brought in to RAM vs. > simply deallocated from swap space (pages not > changed and still in RAM). If I do such a test > again, it would be good to figure out how to > monitor what the swapoff does for bringing in > pages vs. just discarding them --if possible. >=20 > After a while 12136Ki Used showed up after the > swapon that reconfigured the swap space, which is > about the size of the increments that I'd observed > for its sustained increases. >=20 An interesting point for "systat -swap" now: /0 /1 /2 /3 /4 /5 /6 /7 /8 /9 = /10 Load Average ||||||||||||||||||||||||||||| Device/Path Size Used |0% /10 /20 /30 /40 / 60\ 70\ 80\ = 90\ 100| gpt/CA72USBswp14 14G 6108K gpt/CA72USBswp16 16G 6028K Total 30G 12M Pid Username Command Swap/Total Per-Process Per-System No process is listed as using swap but the 12M shows as used! That should be a hint, not that it is directly useful for me figuring out what the usage is from/for. =3D=3D=3D Mark Millard marklmi at yahoo.com
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?6EA27152-3355-4356-B246-A083F31452F2>