Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 8 Aug 2018 08:38:00 -0700
From:      bob prohaska <fbsd@www.zefox.net>
To:        Mark Johnston <markj@freebsd.org>
Cc:        Mark Millard <marklmi@yahoo.com>, freebsd-arm@freebsd.org, bob prohaska <fbsd@www.zefox.net>
Subject:   Re: RPI3 swap experiments ["was killed: out of swap space" with: "v_free_count: 5439, v_inactive_count: 1"]
Message-ID:  <20180808153800.GF26133@www.zefox.net>
In-Reply-To: <20180806155837.GA6277@raichu>
References:  <23793AAA-A339-4DEC-981F-21C7CC4FE440@yahoo.com> <20180731231912.GF94742@www.zefox.net> <2222ABBD-E689-4C3B-A7D3-50AECCC5E7B2@yahoo.com> <20180801034511.GA96616@www.zefox.net> <201808010405.w7145RS6086730@donotpassgo.dyslexicfish.net> <6BFE7B77-A0E2-4FAF-9C68-81951D2F6627@yahoo.com> <20180802002841.GB99523@www.zefox.net> <20180802015135.GC99523@www.zefox.net> <EC74A5A6-0DF4-48EB-88DA-543FD70FEA07@yahoo.com> <20180806155837.GA6277@raichu>

next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, Aug 06, 2018 at 11:58:37AM -0400, Mark Johnston wrote:
> On Wed, Aug 01, 2018 at 09:27:31PM -0700, Mark Millard wrote:
> > [I have a top-posted introduction here in reply
> > to a message listed at the bottom.]
> > 
> > Bob P. meet Mark J. Mark J. meet Bob P. I'm
> > hopinh you can help Bob P. use a patch that
> > you once published on the lists. This was from:
> > 
> > https://lists.freebsd.org/pipermail/freebsd-current/2018-June/069835.html
> > 
> > Bob P. has been having problems with an rpi3
> > based buildworld ending up with "was killed:
> > out of swap space" but when the swap partitions
> > do not seem to be heavily used (seen via swapinfo
> > or watching top).
> > 
> > > The patch to report OOMA information did its job, very tersely. The console reported
> > > v_free_count: 5439, v_inactive_count: 1
> > > Aug  1 18:08:25 www kernel: pid 93301 (c++), uid 0, was killed: out of swap space
> > > 
> > > The entire buildworld.log and gstat output are at
> > > http://www.zefox.net/~fbsd/rpi3/swaptests/r336877M/
> > > 
> > > It appears that at 18:08:21 a write to the USB swap device took 530.5 ms, 
> > > next top was killed and ten seconds later c++ was killed, _after_ da0b
> > > was no longer busy.
> 
> My suspicion, based on the high latency, is that this is a consequence
> of r329882, which lowered the period of time that the page daemon will
> sleep while waiting for dirty pages to be cleaned.  If a certain number
> of consecutive wakeups and queue scans occur without making progress,
> the OOM killer is triggered.  That number is vm.pageout_oom_seq - could
> you try increasing it by a factor of 10 and retry your test?
> 
> > > This buildworld stopped a quite a bit earlier than usual; most of the time
> > > the buildworld.log file is close to 20 MB at the time OOMA acts. In this case
> > > it was around 13 MB. Not clear if that's of significance.
> > > 
> > > If somebody would indicate whether this result is informative, and any possible
> > > improvements to the test, I'd be most grateful. 
> 
> If the above suggestion doesn't help, the next thing to try would be to
> revert the oom_seq value to the default, apply this patch, and see if
> the problem continues to occur.  If this doesn't help, please try
> applying both measures, i.e., set oom_seq to 120 _and_ apply the patch.
> 
> diff --git a/sys/vm/vm_pagequeue.h b/sys/vm/vm_pagequeue.h
> index fb56bdf2fdfc..29a16060253f 100644
> --- a/sys/vm/vm_pagequeue.h
> +++ b/sys/vm/vm_pagequeue.h
> @@ -74,7 +74,7 @@ struct vm_pagequeue {
>  } __aligned(CACHE_LINE_SIZE);
>  
>  #ifndef VM_BATCHQUEUE_SIZE
> -#define	VM_BATCHQUEUE_SIZE	7
> +#define	VM_BATCHQUEUE_SIZE	1
>  #endif
>  
>  struct vm_batchqueue {

The patched kernel ran longer than default but OOMA still halted buildworld around
13 MB. That's considerably farther than a default build world have run but less than
observed when setting vm.pageout_oom_seq=120 alone. Log files are at
http://www.zefox.net/~fbsd/rpi3/swaptests/r337226M/1gbsdflash_1gbusbflash/batchqueue/

Both changes are now in place and -j4 buildworld has been restarted. 

Thanks for reading!

bob prohaska




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20180808153800.GF26133>