From owner-freebsd-bugs@freebsd.org Sat Aug 12 22:35:17 2017 Return-Path: Delivered-To: freebsd-bugs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 897BBDCDC24 for ; Sat, 12 Aug 2017 22:35:17 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2001:1900:2254:206a::16:76]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 77FBB7490B for ; Sat, 12 Aug 2017 22:35:17 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from bugs.freebsd.org ([127.0.1.118]) by kenobi.freebsd.org (8.15.2/8.15.2) with ESMTP id v7CMZHsA046241 for ; Sat, 12 Aug 2017 22:35:17 GMT (envelope-from bugzilla-noreply@freebsd.org) From: bugzilla-noreply@freebsd.org To: freebsd-bugs@FreeBSD.org Subject: [Bug 220971] Freebsd 11.0p11 - system freeze on intensive I/O Date: Sat, 12 Aug 2017 22:35:17 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: kern X-Bugzilla-Version: 11.0-RELEASE X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Some People X-Bugzilla-Who: markmi@dsl-only.net X-Bugzilla-Status: New X-Bugzilla-Resolution: X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: freebsd-bugs@FreeBSD.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 12 Aug 2017 22:35:17 -0000 https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D220971 --- Comment #8 from Mark Millard --- (In reply to execve from comment #7) I tried a couple of variations of the experiment that I suggested. Unfortunately the results are a little complicated to interpret. Context: under virtualbox (on Windows 10 Pro) with. . . (Bugzilla 206048 has pointed out reproducibility under virtual machines.) FreeBSDx64OPC11S# uname -apKU FreeBSD FreeBSDx64OPC11S 11.1-STABLE FreeBSD 11.1-STABLE r322433M amd64 a= md64 1101501 1101501 # svnlite diff /usr/src/ Index: /usr/src/sys/amd64/conf/GENERIC =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D --- /usr/src/sys/amd64/conf/GENERIC (revision 322433) +++ /usr/src/sys/amd64/conf/GENERIC (working copy) @@ -24,7 +24,8 @@ makeoptions DEBUG=3D-g # Build kernel with gdb(1) debug symbols makeoptions WITH_CTF=3D1 # Run ctfconvert(1) for DTrace su= pport -options SCHED_ULE # ULE scheduler +#options SCHED_ULE # ULE scheduler +options SCHED_4BSD # 4BSD scheduler options PREEMPTION # Enable kernel thread preemption options INET # InterNETworking options INET6 # IPv6 communications protocols I tried: 4 processors and 1 GiBYte of RAM assigned using: stress -d 2 -m 3 --vm-keep and separately: 8 processors and 1 GiByte of RAM assigned using: stress -d 6 -m 3 --vm-keep I had a top -Cawopid running in each case with its own ssh into the virtual machine. stress was via ssh as well. In the 2nd case I got to a lock-up: top stopped updating and input was ignored to both the ssh's (top and stress) and the console window, including input such as ^C and ^T . The console window did eventually show: swap_pager: I/O error - pageout failed; blkno 7367,size 4096, error 12 (After seeing that I waited a while longer but I gave up on waiting and eventually killed the virtual machine.) I later found a list message reporting about such "error 12" variants of the message: QUOTE > I think it might be ENOMEM from a geom when trying to g_clone_bio. . . . It shouldn't happen, but you should notice no ill effects (that is, the page isn't lost, it just wasn't paged out and there's a few bytes less that the pager could do at the moment). END QUOTE. As for the lock-up structure. . . Unfortunately top did not happen to update showing any of the lock up structure in other processes before it locked up. It does at least appear not as easy to get a lock-up (or get ENOMEM and failure to page out) with SCHED_4BSD (to the degree that just a couple of tests indicate anything about such). But getting stuck appears possible and pageout's can fail to happen for lack of memory, or so it appears. --=20 You are receiving this mail because: You are the assignee for the bug.=