From owner-freebsd-arm@freebsd.org Wed Jun 27 06:30:58 2018 Return-Path: Delivered-To: freebsd-arm@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 958A51004069 for ; Wed, 27 Jun 2018 06:30:58 +0000 (UTC) (envelope-from marklmi@yahoo.com) Received: from sonic310-13.consmr.mail.bf2.yahoo.com (sonic310-13.consmr.mail.bf2.yahoo.com [74.6.135.123]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 364A796AC4 for ; Wed, 27 Jun 2018 06:30:58 +0000 (UTC) (envelope-from marklmi@yahoo.com) X-YMail-OSG: m3tMVNkVM1lwBSRGx0Rz_NWYvuLDN7bSRiTJi9I3XFroLdgxOiu_gmFEn0.mq22 fj9_OM.b6l9Gf.bBMrBxvuD.CNpbUwSnOtxag8iCVHeQqoIrIMFu3G9q_zqI.Gq3zIvbSAOQF5PU 81riY9pgkO2JR4gS.AJM.mmNWYVbTUcCmo.VvoQ5u7IdCiNeYo1TNEoEKuivdiAZuojlwKNCZdXi j7wLFIWvWuxDqRSsRI1Zrozoo4cLsvFFDEnGYlt0Gts.I.01XMZ6myvSdIXLBbIrvoy5zCsH9Y27 90lsh_w0LvGte29wdglxXeAqtU9h1RjLHklTLU7kpBduFKDRcXQHqThOhSRZB4cPlFO43NkWO3JT RmdaaI9mwuFesV.GPnWMwXv3AVBeBuD6WKSySSd0FCVy443LkP.apA1Sdn2zka1xPvsfv9uxccw5 L0cs43_S3IkKiHjWnO9fyyj4eNs2Q6JpeaWqodZIgQ3mjmojXodxa5rmsliFYILgPeuKL8S9Mr.H 0evgAKH_G4.hGETMap7Y.lfpGrN1ac97YX9DZYiK3J7jq2gWgfOmrJrlLcWQpaoHzjNhrfVMcArt aS39LEoT2SQtzCWkgwoPGx27RAxsXWCN3PEg0t_0mlYMz5u6BzrxlieCDdR.bT7g8b1J5y4cJKb9 yr5ckMdbKj8TG6G31XLJ5F4fjwjWapLpKS0sjTAqgsXOCdRtWUXOG5XOGlK0t_8nCFUau1LNnq5x jKzWfn_US9HCafMV2swkIbCRaGQ91xmNJW_5Z18xGxaHhWTfXy._atdxdlZr_PFTbumiHby2owdR lLxg18VjJAx1seCD9GUDshoyO3VqDHfsUnNgi23lXaaHYjRr7hePsE7iExcJ3yMSRMSh59BBoaVN tN4HV3J0RtZ6gcPHa_.jVwtdj88ziLH0metqN3FxQoapWzy0VNq8ghrjG5vJo8ypoJ0xNA4MH36G gGty3jH.w3cF4a2sn5gtn8cMsjWIJnsEm Received: from sonic.gate.mail.ne1.yahoo.com by sonic310.consmr.mail.bf2.yahoo.com with HTTP; Wed, 27 Jun 2018 06:30:57 +0000 Received: from ip70-189-131-151.lv.lv.cox.net (EHLO [192.168.0.105]) ([70.189.131.151]) by smtp404.mail.bf1.yahoo.com (Oath Hermes SMTP Server) with ESMTPA ID c7840db5a7f361770c5fa542451b8bba; Wed, 27 Jun 2018 06:30:54 +0000 (UTC) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 11.4 \(3445.8.2\)) Subject: Re: RPI3 swap experiments, was Re: GPT vs MBR for swap devices From: Mark Millard In-Reply-To: <20180627054027.GA22144@www.zefox.net> Date: Tue, 26 Jun 2018 23:30:52 -0700 Cc: freebsd-arm@freebsd.org Content-Transfer-Encoding: quoted-printable Message-Id: References: <20180624231020.GA11132@www.zefox.net> <20180626052451.GA17293@www.zefox.net> <201806261040.w5QAeBKq035183@donotpassgo.dyslexicfish.net> <20180626151843.GD17293@www.zefox.net> <3525D7C7-F848-45A1-BD85-2DAC895DF48C@yahoo.com> <20180626222834.GA20270@www.zefox.net> <28012DFB-37A0-461A-BB62-CD3EE61E82F0@yahoo.com> <20180627054027.GA22144@www.zefox.net> To: bob prohaska X-Mailer: Apple Mail (2.3445.8.2) X-BeenThere: freebsd-arm@freebsd.org X-Mailman-Version: 2.1.26 Precedence: list List-Id: "Porting FreeBSD to ARM processors." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 27 Jun 2018 06:30:58 -0000 On 2018-Jun-26, at 10:40 PM, bob prohaska wrote: > On Tue, Jun 26, 2018 at 07:09:09PM -0700, Mark Millard wrote: >>=20 >>=20 >> On 2018-Jun-26, at 3:28 PM, bob prohaska = wrote: >>=20 >>> On Tue, Jun 26, 2018 at 01:15:54PM -0700, Mark Millard wrote: >>>> On 2018-Jun-26, at 8:18 AM, bob prohaska = wrote: >>>>=20 >>>>> On Tue, Jun 26, 2018 at 07:37:59AM -0700, Mark Millard wrote: >>>>>>=20 >>>>>>=20 >>>>>> . . . >>>>>>=20 >>>>>> As I remember, Bob P. Did reproduce drive errors even without >>>>>> the problem drive being used for swapping. This too suggests >>>>>> (A) as separate activity. >>>>>>=20 >>>>> Indeed, it is a requirement. If the suspect device is used for = swapping >>>>> OOMA kills prevent the test from progressing to the point of = failure. >>>>>=20 >>>>=20 >>>> Looking back at http://www.zefox.net/~fbsd/rpi3/swaptests/ >>>> and information about /dev/da0 rive errors it does not >>>> appear that a combination with: >>>>=20 >>>> A) sufficient swap (> 1.5 GiByte total?) but no use of swap on >>>> any partition on /dev/da0 >>>> and: >>>> B) use of /dev/da0 for /usr/ and /var/ >>>> and: >>>> C) Records from the console showing errors (or notes >>>> indicating lack of such errors). >>>>=20 >>>> exists. So I was remembering incorrectly. >>>>=20 >>>> I'm not claiming such a combination is the best direction for >>>> the next tests, but absent such tests there is no >>>> compare/contrast to know if /dev/da0 would still get errors >>>> despite the system having sufficient swap present on other >>>> drives. Thus, I would not go so far as "is a requirement" on >>>> the evidence available. >>>>=20 >>>=20 >>> I just didn't bother to record successful runs. I'm logging one now. >>>=20 >>>> We do have evidence for the system having insufficient swap >>>> space: this context seems to have the current status "is >>>> sufficient but might not be necessary" for /dev/da0 >>>> getting drive errors. >>>>=20 >>> Not sure I understand here. Basically there seem to be three cases: >>> Enough swap not on da0, -j4 buildworld completes. >>> Any swap on da0, -j4 buildworld is killed by OOMA >>> Not enough swap not on da0, -j4 buildworld crashes the machine = eventually. > ^^^^^^^^^^ > OK, here's my error. The third case should have been > "not enough swap on mmcsd0".=20 >=20 >=20 >>>=20 >>> Are there other combinations I've overlooked? The first two don't = seem=20 >>> worth repeating, at least not often. >>=20 >> "buildworld completes with /dev/da0 errors" vs. "buildworld completes >> without /dev/da0 errors" (for: enough swap not on /dev/da0 with no >> swap on /dev/da0 ). >>=20 >> That is a little simplistic, as there can be multiple retries >> before FreeBSD gives up. Normal is no retries needed. Going >> from rare single retries to frequent multiple retries but no >> giving-up to it giving up sometimes is all abnormal as I >> understand. But there are degrees of abnormal. >>=20 >> And, yes, I have had past examples of significant drive reports >> during buildworld that let buildworld appear to complete. (Not >> that I trusted the result or the drive involved after such, at >> least as the drive was powered/connected at the time.) >>=20 >> For "any swap on da0" and "not enough swap not on da0" (with >> no swap on da0) I'd add to your descriptions: "with /dev/da0 >> errors" (again simplistic). >=20 > The only case where I've seen crashes and /dev/da0 errors is with > insufficient swap on mmcsd0. I've come to ignore OOMA kills as=20 > too familiar to be interesting.=20 "crashes and /dev/da0 errors": A) Any examples of /dev/da0 errors without crashes? B) Any examples of crashes without /dev/da0 errors? C) All examples that do either also does the other (so both always go together)? (I've having trouble parsing a specific meaning for the reference. I did not go back trough all the logs again to identify the combinations recorded.) For (A), have you tried any examples of: sufficient swap on mmcsd0 (or other such) with no swap on da0 (but /usr and /var on /dev/da0)? If yes, did you check on if there were /dev/da0 errors logged? What, if any, /dev/da0 errors where logged? None? For (B), have you tried any examples of: insufficient swap on (say) mmcsd0 and no use of the /dev/da0 drive that has reported errors at all, /usr/ and /var not on mmcsd0 (or whatever was used for swap) either? Did some drive end up reporting errors? Which? Did the system still crash as well? Have such test-context combinations been tried? Without any logs to look at for such alternatives, I can not try to compare/contrast such to the others examples. >>=20 >> This goes along with my suggestion to split the /dev/da0 >> error investigation from the investigations of OMMA behavior >> and crashing-the-machine: avoiding any confounding. >>=20 > =46rom what I've seen, OOMA isn't associated with da0 errors and = crashes. > To see the latter, OOMA must be avoided. =3D=3D=3D Mark Millard marklmi at yahoo.com ( dsl-only.net went away in early 2018-Mar)