Date: Wed, 5 Aug 2015 19:18:21 -0700 From: Doug Hardie <bc979@lafn.org> To: Doug Hardie <bc979@lafn.org> Cc: Peter Jeremy <peter@rulingia.com>, FreeBSD Stable ML <stable@freebsd.org> Subject: Re: Swap Usage Message-ID: <BCB80170-3E1E-4861-B159-CD52472FB475@lafn.org> In-Reply-To: <10F94D23-E58C-466E-ADCA-5E6670054BD7@lafn.org> References: <BCA67F7E-676A-4226-83A0-84229948895E@lafn.org> <20150730064444.GA88137@server.rulingia.com> <10F94D23-E58C-466E-ADCA-5E6670054BD7@lafn.org>
next in thread | previous in thread | raw e-mail | index | archive | help
> On 30 July 2015, at 01:39, Doug Hardie <bc979@lafn.org> wrote:
>=20
>=20
>> On 29 July 2015, at 23:44, Peter Jeremy <peter@rulingia.com> wrote:
>>=20
>> [reformatted]
>>=20
>> On 2015-Jul-29 17:41:33 -0700, Doug Hardie <bc979@lafn.org> wrote:
>>> I have several FreeBSD 9.3 systems that are using swap and I can=E2=80=
=99t
>>> figure out what is doing it. The key system has 6GB swap and
>>> currently it has over 2GB in use.
>>=20
>> Is the system currently paging (top(1) and "systat -v" will show
>> this)? If not, this just means that at some time in the past, the
>> system was under memory pressure and paged some process memory out.
>> Since then, that memory hasn't been touched so the system hasn't =
paged
>> it in.
>>=20
>>> ps shows only a kernel module
>>> [intr] with a W status.
>>=20
>> 'W' means the whole process is 'swapped' out - this will only occur
>> under severe RAM pressure. Normally, the system will just page out
>> inactive parts of a processes address space - and none of the ps =
flags
>> will show this.
>>=20
>>> How do I figure out what that swap space is being used for?
>>=20
>> I don't think this can be trivially done. "procstat -v" will show
>> the number of resident pages within each swap-backed region, any
>> pages in that region that have been touched but are not resident
>> are on the swap device but any pages that have never been touched
>> aren't counted at all.
>=20
> Bingo. procstat shows the problem. The process that I suspected has =
a large number of entries like:
>=20
> 650 0x834c00000 0x835800000 rw- 0 0 1 0 ---- =
sw=20
> 650 0x835800000 0x835c00000 rw- 0 0 1 0 ---- =
sw=20
> 650 0x835c00000 0x837c00000 rw- 1 0 1 0 ---- =
sw=20
>=20
> I don=E2=80=99t know whats in those areas yet. If I were to kill the =
process with SIGABRT would the core dump show those areas? I might be =
able to figure out what they are from that.
>=20
> Thanks for the pointer.
I believe I have found a bug in FreeBSD 9.3 mmap. The following test =
program (named test.c) does an mmap to itself and monitors the memory =
objects created with procstat. After the mmap, munamp is called and the =
objects are checked again. The objects created from the mmap do not get =
purged, but remain. Eventually, after enough have been created, they =
start becoming type sw and eventually the system runs out of swap.
The output:
zool# ./test
31459 0x8049000 0x804a000 rw- 1 0 1 0 CN-- df=20
31459 0x2805f000 0x28069000 rw- 10 0 1 0 C--- df=20
31459 0x28186000 0x281ad000 rw- 13 0 1 0 CN-- df=20
31459 0xbfbdf000 0xbfbff000 rwx 3 0 1 0 C--D df=20
----------------------------------
31459 0x8049000 0x804a000 rw- 1 0 1 0 CN-- df=20
31459 0x804a000 0x8400000 rw- 1 0 1 0 CN-- df=20
31459 0x2805f000 0x28069000 rw- 10 0 1 0 CN-- df=20
31459 0x28186000 0x281ad000 rw- 14 0 1 0 CN-- df=20
31459 0x28400000 0x28800000 rw- 6 0 1 0 CN-- df=20
31459 0xbfbdf000 0xbfbff000 rwx 3 0 1 0 C--D df=20
----------------------------------
31459 0x8049000 0x804a000 rw- 1 0 1 0 CN-- df=20
31459 0x804a000 0x8400000 rw- 1 0 1 0 CN-- df=20
31459 0x2805f000 0x28069000 rw- 10 0 1 0 CN-- df=20
31459 0x28186000 0x281ad000 rw- 14 0 1 0 CN-- df=20
31459 0x28400000 0x28800000 rw- 6 0 1 0 CN-- df=20
31459 0xbfbdf000 0xbfbff000 rwx 3 0 1 0 C--D df=20
----------------------------------
31459 0x8049000 0x804a000 rw- 1 0 1 0 CN-- df=20
31459 0x804a000 0x8400000 rw- 1 0 1 0 CN-- df=20
31459 0x2805f000 0x28069000 rw- 10 0 1 0 CN-- df=20
31459 0x28186000 0x281ad000 rw- 14 0 1 0 CN-- df=20
31459 0x28400000 0x28800000 rw- 6 0 1 0 CN-- df=20
31459 0xbfbdf000 0xbfbff000 rwx 3 0 1 0 C--D df=20
The program:
int main (int argc, char *argv[])
{
int rc, pid, fd;
char *cp;
char cmd[1024];
pid =3D getpid ();
sprintf (cmd, "procstat -v %d | grep df", pid);
fflush (stdout);
rc =3D system (cmd);
fflush (stdout);
printf ("----------------------------------\n");
fd =3D open ("./test.c", O_RDWR);
cp =3D mmap (0, 100, PROT_READ|PROT_WRITE, MAP_FILE, fd, 0);
if (cp =3D=3D MAP_FAILED)
{
printf ("mmap error %s\n", strerror(errno));
exit (1);
}
fflush (stdout);
rc =3D system (cmd);
fflush (stdout);
printf ("----------------------------------\n");
rc =3D munmap (cp, 100);
if (rc =3D=3D -1)
{
printf ("munmap error %s\n", strerror(errno));
exit (1);
}
fflush (stdout);
rc =3D system (cmd);
fflush (stdout);
printf ("----------------------------------\n");
close (fd);
fflush (stdout);
rc =3D system (cmd);
fflush (stdout);
}
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?BCB80170-3E1E-4861-B159-CD52472FB475>
