From owner-freebsd-stable@FreeBSD.ORG Thu Jul 11 12:47:10 2013 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by hub.freebsd.org (Postfix) with ESMTP id A0640F53 for ; Thu, 11 Jul 2013 12:47:10 +0000 (UTC) (envelope-from andre@freebsd.org) Received: from c00l3r.networx.ch (c00l3r.networx.ch [62.48.2.2]) by mx1.freebsd.org (Postfix) with ESMTP id 234201AB8 for ; Thu, 11 Jul 2013 12:47:09 +0000 (UTC) Received: (qmail 89015 invoked from network); 11 Jul 2013 13:37:44 -0000 Received: from c00l3r.networx.ch (HELO [127.0.0.1]) ([62.48.2.2]) (envelope-sender ) by c00l3r.networx.ch (qmail-ldap-1.03) with SMTP for ; 11 Jul 2013 13:37:44 -0000 Message-ID: <51DEA947.6060605@freebsd.org> Date: Thu, 11 Jul 2013 14:47:03 +0200 From: Andre Oppermann User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:17.0) Gecko/20130509 Thunderbird/17.0.6 MIME-Version: 1.0 To: Steven Hartland Subject: Re: status of autotuning freebsd for 9.2 References: <51D90B9B.9080209@ixsystems.com> <51D92826.1070707@freebsd.org> <51D9B24B.8070303@ixsystems.com> <51DACE93.9050608@freebsd.org> <51DE6255.5000304@freebsd.org> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: stable@freebsd.org, re@freebsd.org, nonesuch@longcount.org, Alfred Perlstein X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 11 Jul 2013 12:47:10 -0000 On 11.07.2013 11:08, Steven Hartland wrote: > ----- Original Message ----- From: "Andre Oppermann" > >> On 08.07.2013 16:37, Andre Oppermann wrote: >>> On 07.07.2013 20:24, Alfred Perlstein wrote: >>>> On 7/7/13 1:34 AM, Andre Oppermann wrote: >>>>> Can you help me with with testing? >>>>> >>>> Yes. Please give me your proposed changes and I'll stand up a machine and give feedback. >>> >>> http://people.freebsd.org/~andre/mfc-autotuning-20130708.diff >> >> Any feedback from testers on this? The MFC window is closing soon. > > Few things I've noticed most of which look like issues against the original > patch and not the MFC but worth mentioning. > > 1. You've introduced a new tunable kern.maxmbufmem which is autosized but > doesnt seem to be exposed via a sysctl so it looks like there is no way > to determine what its actually set to? Good point. I've made it global and exposed as kern.ipc.maxmbufmem (RDTUN). > 2. There's a missmatch between the tuneable kern.ipc.nmbufs in tunable_mbinit > and the sysctl kern.ipc.nmbuf i.e. no 's'. That's a typo, fixed. > 3. Should kern.maxmbufmem be kern.ipc.maxmbufmem to sit along side all of > the other sysctls? Yes, see above. > 4. style issues: > * @@ -178,11 +202,13 @@ > ... > if (newnmbjumbo9 > nmbjumbo9&& Thanks. All fixed in r253204. > Finally out of interest what made us arrive at the various defaults for each > type as it looks like the ratios have changed? Before it was an arbitrary mess. Mbufs were not limited at all and the others to some random multiple of maxusers with the net limit ending up at some 25,000 mbuf clusters by default. Now default overall limit is set at 50% of all available min(physical, kmem_map) memory to prevent mbufs from monopolizing kernel memory and leave some space for other kernel structures and buffers as well as user-space programs. It can be raised to 3/4 of available memory by the tunable. 2K and 4K (page size) mbuf clusters can each go up to 25% of this mbuf memory. The former is dominantly used on the receive path and the latter in the send path. 9K and 16K jumbo mbuf clusters can each go up to 12.5% of mbuf memory. They are only used in the receive path if large jumbo MTUs on a network interface are active. Both are special in that their memory is contiguous in KVM and physical memory. This becomes problematic due to memory fragmentation after a short amount of heavy system use. I hope to deprecate them for 10.0. Network interfaces should use 4K clusters instead and chain them together for larger packets. All modern NICs support that. Only the early and limited DMA engines without scatter-gather capabilities required contiguous physical memory. They are long gone by now. The limit for mbufs itselfs is 12.5% of mbuf memory and should be at least as many as the sum of the cluster types. Each cluster requires an mbuf to which it is attached. Two examples on the revised mbuf sizing limits: 1GB KVM: 512MB limit for mbufs 419,430 mbufs 65,536 2K mbuf clusters 32,768 4K mbuf clusters 9,709 9K mbuf clusters 5,461 16K mbuf clusters 16GB RAM: 8GB limit for mbufs 33,554,432 mbufs 1,048,576 2K mbuf clusters 524,288 4K mbuf clusters 155,344 9K mbuf clusters 87,381 16K mbuf clusters These defaults should be sufficient for even the most demanding network loads. For additional information see: http://svnweb.freebsd.org/changeset/base/243631 -- Andre