From owner-freebsd-arm@freebsd.org  Wed Jun 27 05:40:35 2018
Return-Path: <owner-freebsd-arm@freebsd.org>
Delivered-To: freebsd-arm@mailman.ysv.freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1])
 by mailman.ysv.freebsd.org (Postfix) with ESMTP id 1804C1001CF5
 for <freebsd-arm@mailman.ysv.freebsd.org>;
 Wed, 27 Jun 2018 05:40:35 +0000 (UTC)
 (envelope-from fbsd@www.zefox.net)
Received: from www.zefox.net (www.zefox.net [50.1.20.27])
 (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
 (Client CN "www.zefox.org", Issuer "www.zefox.org" (not verified))
 by mx1.freebsd.org (Postfix) with ESMTPS id 132569545C
 for <freebsd-arm@freebsd.org>; Wed, 27 Jun 2018 05:40:32 +0000 (UTC)
 (envelope-from fbsd@www.zefox.net)
Received: from www.zefox.net (localhost [127.0.0.1])
 by www.zefox.net (8.15.2/8.15.2) with ESMTPS id w5R5eTFr023080
 (version=TLSv1.2 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=NO);
 Tue, 26 Jun 2018 22:40:31 -0700 (PDT)
 (envelope-from fbsd@www.zefox.net)
Received: (from fbsd@localhost)
 by www.zefox.net (8.15.2/8.15.2/Submit) id w5R5eSWR023079;
 Tue, 26 Jun 2018 22:40:28 -0700 (PDT) (envelope-from fbsd)
Date: Tue, 26 Jun 2018 22:40:27 -0700
From: bob prohaska <fbsd@www.zefox.net>
To: Mark Millard <marklmi@yahoo.com>
Cc: freebsd-arm@freebsd.org, bob prohaska <fbsd@www.zefox.net>
Subject: Re: RPI3 swap experiments, was Re: GPT vs MBR for swap devices
Message-ID: <20180627054027.GA22144@www.zefox.net>
References: <20180624231020.GA11132@www.zefox.net>
 <C87C40CF-15B2-4137-892C-F2ADBAB32418@yahoo.com>
 <20180626052451.GA17293@www.zefox.net>
 <CANCZdfpXyzxzOZ8pqcRtuFsxYx5Jjs9oSL1ok2sGVPHdiB0qVQ@mail.gmail.com>
 <201806261040.w5QAeBKq035183@donotpassgo.dyslexicfish.net>
 <A6986B21-FF6E-48F5-9F3A-06B3D2A92C55@yahoo.com>
 <20180626151843.GD17293@www.zefox.net>
 <3525D7C7-F848-45A1-BD85-2DAC895DF48C@yahoo.com>
 <20180626222834.GA20270@www.zefox.net>
 <28012DFB-37A0-461A-BB62-CD3EE61E82F0@yahoo.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <28012DFB-37A0-461A-BB62-CD3EE61E82F0@yahoo.com>
User-Agent: Mutt/1.5.24 (2015-08-30)
X-BeenThere: freebsd-arm@freebsd.org
X-Mailman-Version: 2.1.26
Precedence: list
List-Id: "Porting FreeBSD to ARM processors." <freebsd-arm.freebsd.org>
List-Unsubscribe: <https://lists.freebsd.org/mailman/options/freebsd-arm>,
 <mailto:freebsd-arm-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-arm/>
List-Post: <mailto:freebsd-arm@freebsd.org>
List-Help: <mailto:freebsd-arm-request@freebsd.org?subject=help>
List-Subscribe: <https://lists.freebsd.org/mailman/listinfo/freebsd-arm>,
 <mailto:freebsd-arm-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Wed, 27 Jun 2018 05:40:35 -0000

On Tue, Jun 26, 2018 at 07:09:09PM -0700, Mark Millard wrote:
> 
> 
> On 2018-Jun-26, at 3:28 PM, bob prohaska <fbsd at www.zefox.net> wrote:
> 
> > On Tue, Jun 26, 2018 at 01:15:54PM -0700, Mark Millard wrote:
> >> On 2018-Jun-26, at 8:18 AM, bob prohaska <fbsd at www.zefox.net> wrote:
> >> 
> >>> On Tue, Jun 26, 2018 at 07:37:59AM -0700, Mark Millard wrote:
> >>>> 
> >>>> 
> >>>> . . .
> >>>> 
> >>>> As I remember, Bob P. Did reproduce drive errors even without
> >>>> the problem drive being used for swapping. This too suggests
> >>>> (A) as separate activity.
> >>>> 
> >>> Indeed, it is a requirement. If the suspect device is used for swapping
> >>> OOMA kills prevent the test from progressing to the point of failure.
> >>> 
> >> 
> >> Looking back at http://www.zefox.net/~fbsd/rpi3/swaptests/
> >> and information about /dev/da0 rive errors it does not
> >> appear that a combination with:
> >> 
> >> A) sufficient swap (> 1.5 GiByte total?) but no use of swap on
> >>   any partition on /dev/da0
> >> and:
> >> B) use of /dev/da0 for /usr/ and /var/
> >> and:
> >> C) Records from the console showing errors (or notes
> >>   indicating lack of such errors).
> >> 
> >> exists. So I was remembering incorrectly.
> >> 
> >> I'm not claiming such a combination is the best direction for
> >> the next tests, but absent such tests there is no
> >> compare/contrast to know if /dev/da0 would still get errors
> >> despite the system having sufficient swap present on other
> >> drives. Thus, I would not go so far as "is a requirement" on
> >> the evidence available.
> >> 
> > 
> > I just didn't bother to record successful runs. I'm logging one now.
> > 
> >> We do have evidence for the system having insufficient swap
> >> space: this context seems to have the current status "is
> >> sufficient but might not be necessary" for /dev/da0
> >> getting drive errors.
> >> 
> > Not sure I understand here. Basically there seem to be three cases:
> > Enough swap not on da0, -j4 buildworld completes.
> > Any swap on da0, -j4 buildworld is killed by OOMA
> > Not enough swap not on da0, -j4 buildworld crashes the machine eventually.
                    ^^^^^^^^^^
OK, here's my error. The third case should have been
"not enough swap on mmcsd0". 


> > 
> > Are there other combinations I've overlooked? The first two don't seem 
> > worth repeating, at least not often.
> 
> "buildworld completes with /dev/da0 errors" vs. "buildworld completes
> without /dev/da0 errors" (for: enough swap not on /dev/da0 with no
> swap on /dev/da0 ).
> 
> That is a little simplistic, as there can be multiple retries
> before FreeBSD gives up. Normal is no retries needed. Going
> from rare single retries to frequent multiple retries but no
> giving-up to it giving up sometimes is all abnormal as I
> understand. But there are degrees of abnormal.
> 
> And, yes, I have had past examples of significant drive reports
> during buildworld that let buildworld appear to complete. (Not
> that I trusted the result or the drive involved after such, at
> least as the drive was powered/connected at the time.)
> 
> For "any swap on da0" and "not enough swap not on da0" (with
> no swap on da0) I'd add to your descriptions: "with /dev/da0
> errors" (again simplistic).

The only case where I've seen crashes and /dev/da0 errors is with
insufficient swap on mmcsd0.  I've come to ignore OOMA kills as 
too familiar to be interesting. 
> 
> This goes along with my suggestion to split the /dev/da0
> error investigation from the investigations of OMMA behavior
> and crashing-the-machine: avoiding any confounding.
> 
>From what I've seen, OOMA isn't associated with da0 errors and crashes.
To see the latter, OOMA must be avoided.

Thanks for reading,

bob prohaska