From owner-freebsd-current@FreeBSD.ORG  Mon Aug 27 07:42:30 2012
Return-Path: <owner-freebsd-current@FreeBSD.ORG>
Delivered-To: current@freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34])
	by hub.freebsd.org (Postfix) with ESMTP id B0878106566C;
	Mon, 27 Aug 2012 07:42:30 +0000 (UTC) (envelope-from alc@rice.edu)
Received: from mh11.mail.rice.edu (mh11.mail.rice.edu [128.42.199.30])
	by mx1.freebsd.org (Postfix) with ESMTP id 7B7FB8FC12;
	Mon, 27 Aug 2012 07:42:30 +0000 (UTC)
Received: from mh11.mail.rice.edu (localhost.localdomain [127.0.0.1])
	by mh11.mail.rice.edu (Postfix) with ESMTP id E30314C02AD;
	Mon, 27 Aug 2012 02:42:29 -0500 (CDT)
Received: from mh11.mail.rice.edu (localhost.localdomain [127.0.0.1])
	by mh11.mail.rice.edu (Postfix) with ESMTP id E16834C02A5;
	Mon, 27 Aug 2012 02:42:29 -0500 (CDT)
X-Virus-Scanned: by amavis-2.7.0 at mh11.mail.rice.edu, auth channel
Received: from mh11.mail.rice.edu ([127.0.0.1])
	by mh11.mail.rice.edu (mh11.mail.rice.edu [127.0.0.1]) (amavis,
	port 10026)
	with ESMTP id CRI5M8imhf6D; Mon, 27 Aug 2012 02:42:29 -0500 (CDT)
Received: from adsl-216-63-78-18.dsl.hstntx.swbell.net
	(adsl-216-63-78-18.dsl.hstntx.swbell.net [216.63.78.18])
	(using TLSv1 with cipher RC4-MD5 (128/128 bits))
	(No client certificate requested) (Authenticated sender: alc)
	by mh11.mail.rice.edu (Postfix) with ESMTPSA id 75F024C0268;
	Mon, 27 Aug 2012 02:42:29 -0500 (CDT)
Message-ID: <503B24E4.6090701@rice.edu>
Date: Mon, 27 Aug 2012 02:42:28 -0500
From: Alan Cox <alc@rice.edu>
User-Agent: Mozilla/5.0 (X11; FreeBSD i386;
	rv:8.0) Gecko/20111113 Thunderbird/8.0
MIME-Version: 1.0
To: Luigi Rizzo <rizzo@iet.unipi.it>
References: <20120822120105.GA63763@onelab2.iet.unipi.it>
	<CAJUyCcPOte19TJXpCVAskhf+Dia_Zg5uj6J_idW67rGsOLaZXw@mail.gmail.com>
	<20120823163145.GA3999@onelab2.iet.unipi.it>
	<50366398.2070700@rice.edu>
	<20120823174504.GB4820@onelab2.iet.unipi.it>
	<50371485.1020409@rice.edu>
	<20120824145708.GA16557@onelab2.iet.unipi.it>
	<5037A803.6030100@rice.edu>
	<20120824165428.GA17495@onelab2.iet.unipi.it>
	<5037B226.3000103@rice.edu>
	<20120826171126.GA40672@onelab2.iet.unipi.it>
In-Reply-To: <20120826171126.GA40672@onelab2.iet.unipi.it>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Cc: alc@freebsd.org, current@freebsd.org
Subject: Re: less aggressive contigmalloc ?
X-BeenThere: freebsd-current@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Discussions about the use of FreeBSD-current
	<freebsd-current.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-current>, 
	<mailto:freebsd-current-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-current>
List-Post: <mailto:freebsd-current@freebsd.org>
List-Help: <mailto:freebsd-current-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-current>,
	<mailto:freebsd-current-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Mon, 27 Aug 2012 07:42:30 -0000

On 08/26/2012 12:11, Luigi Rizzo wrote:
> On Fri, Aug 24, 2012 at 11:56:06AM -0500, Alan Cox wrote:
>> On 08/24/2012 11:54, Luigi Rizzo wrote:
>>> On Fri, Aug 24, 2012 at 11:12:51AM -0500, Alan Cox wrote:
>>>> On 08/24/2012 09:57, Luigi Rizzo wrote:
>>>>> On Fri, Aug 24, 2012 at 12:43:33AM -0500, Alan Cox wrote:
>>>>>> On 08/23/2012 12:45, Luigi Rizzo wrote:
>>>>>>> On Thu, Aug 23, 2012 at 12:08:40PM -0500, Alan Cox wrote:
>>>>>>> ...
>>>>>>>>> yes i do see that.
>>>>>>>>>
>>>>>>>>> Maybe less aggressive with M_NOWAIT but still kills processes.
>>>>>>>> Are you compiling world with MALLOC_PRODUCTION?  The latest version of
>>>>>>> whatever the default is. But:
>>>>>>>
>>>>>>>> jemalloc uses significantly more memory when debugging options are
>>>>>>>> enabled.  This first came up in a thread titled "10-CURRENT and swap
>>>>>>>> usage" back in June.
>>>>>>>>
>>>>>>>> Even at its most aggressive, M_WAITOK, contigmalloc() does not
>>>>>>>> directly
>>>>>>>> kill processes.  If process death coincides with the use of
>>>>>>>> contigmalloc(), then it is simply the result of earlier, successful
>>>>>>>> contigmalloc() calls, or for that matter any other physical memory
>>>>>>>> allocation calls, having depleted the pool of free pages to the point
>>>>>>>> that the page daemon runs and invokes vm_pageout_oom().
>>>>>>> does it mean that those previous allocations relied on memory
>>>>>>> overbooking ?
>>>>>> Yes.
>>>>>>
>>>>>>> Is there a way to avoid that, then ?
>>>>>> I believe that malloc()'s default minimum allocation size is 4MB.  You
>>>>>> could reduce that.
>>>>>>
>>>>>> Alternatively, you can enable MALLOC_PRODUCTION.
>>>>> i tried this, and as others mentioned it makes life
>>>>> better and reduces the problem but contigmalloc still triggers
>>>>> random process kills.
>>>> I would be curious to see a stack backtrace when vm_pageout_oom() is
>>>> called.
>>> you mean a backtrace of the process(es) that get killed ?
>> No, a backtrace showing who called vm_pageout_oom().  Simply add a
>> kdb_backtrace() call at the start of vm_pageout_oom().  There are two
>> possibilities.  I want to know which it is.
> this is dmesg when I add kdb_backtrace()  at the start of vm_pageout_oom()
> The '... netmap_finalize_obj_allocator... are from my calls to
> contigmalloc, each one doing one-page allocations.

These calls are made with M_WAITOK?

> I get 7-8 'KDB: stack backtrace' blocks, then allocations
> restart successfully, then more failures...
> The reference to fork_exit() does not seem right, because i am
> in a block where i call contigmalloc, so the caller of
> vm_pageout_grow_cache() should be kmem_alloc_contig().

Try this instead.  At the start of vm_pageout_oom(), print the value of 
its parameter "shortage".  That will uniquely identify the caller.

> 630.004926 netmap_finalize_obj_allocator [593] cluster at 8910 ok
> 630.005563 netmap_finalize_obj_allocator [593] cluster at 8912 ok
> 630.006077 netmap_finalize_obj_allocator [593] cluster at 8914 ok
> KDB: stack backtrace:
> X_db_sym_numargs() at X_db_sym_numargs+0x1aa
> vm_pageout_oom() at vm_pageout_oom+0x19
> vm_pageout_grow_cache() at vm_pageout_grow_cache+0xd01
> fork_exit() at fork_exit+0x11c
> fork_trampoline() at fork_trampoline+0xe
> --- trap 0, rip = 0, rsp = 0xffffff8005f12cb0, rbp = 0 ---
> KDB: stack backtrace:
> X_db_sym_numargs() at X_db_sym_numargs+0x1aa
> vm_pageout_oom() at vm_pageout_oom+0x19
> vm_pageout_grow_cache() at vm_pageout_grow_cache+0xd01
> fork_exit() at fork_exit+0x11c
> fork_trampoline() at fork_trampoline+0xe
> --- trap 0, rip = 0, rsp = 0xffffff8005f12cb0, rbp = 0 ---
> ...
>
> Some of the processes must be 'getty' because i also find
> this line in dmesg:
>
> <118>Aug 26 16:47:11 init: getty repeating too quickly on port /dev/ttyv7, sleep
> ing 30 secs
>
> cheers
> luigi
>