From owner-freebsd-hackers@FreeBSD.ORG Thu Dec 10 18:34:16 2009 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8FDD7106576B for ; Thu, 10 Dec 2009 18:34:16 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 814058FC1B for ; Thu, 10 Dec 2009 18:34:15 +0000 (UTC) Received: from bigwig.baldwin.cx (66.111.2.69.static.nyinternet.net [66.111.2.69]) by cyrus.watson.org (Postfix) with ESMTPSA id 1462A46B38; Thu, 10 Dec 2009 13:34:15 -0500 (EST) Received: from jhbbsd.localnet (unknown [209.249.190.9]) by bigwig.baldwin.cx (Postfix) with ESMTPA id E11508A01B; Thu, 10 Dec 2009 13:34:13 -0500 (EST) From: John Baldwin To: ticso@cicely.de Date: Thu, 10 Dec 2009 11:13:56 -0500 User-Agent: KMail/1.12.1 (FreeBSD/7.2-CBSD-20091103; KDE/4.3.1; amd64; ; ) References: <237c27100911260714x2fcb194ew1e6ce11e764efd08@mail.gmail.com> <200912090907.33433.jhb@freebsd.org> <20091210145052.GX20668@cicely7.cicely.de> In-Reply-To: <20091210145052.GX20668@cicely7.cicely.de> MIME-Version: 1.0 Content-Type: Text/Plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Message-Id: <200912101113.56570.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0.1 (bigwig.baldwin.cx); Thu, 10 Dec 2009 13:34:13 -0500 (EST) X-Virus-Scanned: clamav-milter 0.95.1 at bigwig.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-2.5 required=4.2 tests=AWL,BAYES_00,RDNS_NONE autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on bigwig.baldwin.cx Cc: freebsd-hackers@freebsd.org, Linda Messerschmidt Subject: Re: Superpages on amd64 FreeBSD 7.2-STABLE X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 10 Dec 2009 18:34:16 -0000 On Thursday 10 December 2009 9:50:52 am Bernd Walter wrote: > On Wed, Dec 09, 2009 at 09:07:33AM -0500, John Baldwin wrote: > > On Thursday 26 November 2009 10:14:20 am Linda Messerschmidt wrote: > > > It's not clear to me if this might be a problem with the superpages > > > implementation, or if squid does something particularly horrible to > > > its memory when it forks to cause this, but I wanted to ask about it > > > on the list in case somebody who understands it better might know > > > whats going on. :-) > > > > I talked with Alan Cox some about this off-list and there is a case that can > > cause this behavior if the parent squid process takes write faults on a > > superpage before the child process has called exec() then it can result in > > superpages being fragmented and never reassembled. Using vfork() should > > prevent this from happening. It is a known issue, but it will probably be > > some time before it is addressed. There is lower hanging fruit in other areas > > in the VM that will probably be worked on first. > > For me the whole threads puzzles me. > Especially because vfork is often called a solution. > > Scenario A > Parent with super page > fork/exec > This problem can happen because there is a race. > The parent now has it's super pages fragmented permanently!? > the child throws away his pages because of the exec!? > > Scenario B > Parent with super page > vfork/exec > This problem won't happen because the child has no pseudo copy of the > parents memory and then starts with a completely new map. Actually, the fact that vfork() doesn't let the parent execute until the child has called exec() also closes the race, as it were, and that was the primary reason in my mind for saying that vfork() would prevent it. > Scenario C > Parent with super page > fork/ no exec > The problem can happen because the child shares the same memory over > it's complete lifetime. > The parent can get it's super pages fragmented over time. > > I don't see a use case for scenario A, because vfork is there since > over 16 years. > I use fork myself, because it is easier sometimes, but people writing > big programms such as squid should know better. > If squid doesn't use vfork they likely have a reason. > With scenario C I don't see how vfork can help, since this is not a legal > case for vfork. > I use quid myself, but don't know how it handles it's childs. > But isn't the whole story about such slave childs that they share memory > with the master? - How can vfork be solution for this case? > How can fragmentation of super pages be avoided at all? In Linda's case it was a fork to run a log rotation binary, so it was A). For C) I think you would want to map the pages MAP_SHARED (or use minherit(2) with INHERIT_SHARE) in which case they would not be COW'd on fork() and you would keep the superpages. Assuming that you explicitly want to share the memory with your child processes you already have to do this to really do sharing anyway. -- John Baldwin