From owner-freebsd-net  Fri Jul  5  5:21:37 2002
Delivered-To: freebsd-net@freebsd.org
Received: from mx1.FreeBSD.org (mx1.FreeBSD.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP
	id EFB0237B400; Fri,  5 Jul 2002 05:21:29 -0700 (PDT)
Received: from duke.cs.duke.edu (duke.cs.duke.edu [152.3.140.1])
	by mx1.FreeBSD.org (Postfix) with ESMTP
	id 240D643E3B; Fri,  5 Jul 2002 05:21:29 -0700 (PDT)
	(envelope-from gallatin@cs.duke.edu)
Received: from grasshopper.cs.duke.edu (grasshopper.cs.duke.edu [152.3.145.30])
	by duke.cs.duke.edu (8.9.3/8.9.3) with ESMTP id IAA20554;
	Fri, 5 Jul 2002 08:04:04 -0400 (EDT)
Received: (from gallatin@localhost)
	by grasshopper.cs.duke.edu (8.11.6/8.9.1) id g65C3Y928844;
	Fri, 5 Jul 2002 08:03:34 -0400 (EDT)
	(envelope-from gallatin@cs.duke.edu)
From: Andrew Gallatin <gallatin@cs.duke.edu>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Message-ID: <15653.35606.290023.621040@grasshopper.cs.duke.edu>
Date: Fri, 5 Jul 2002 08:03:34 -0400 (EDT)
To: "Kenneth D. Merry" <ken@kdm.org>
Cc: Bosko Milekic <bmilekic@unixdaemons.com>, current@FreeBSD.ORG,
	net@FreeBSD.ORG
Subject: Re: virtually contig jumbo mbufs (was Re: new zero copy sockets snapshot)
In-Reply-To: <20020704231321.A42134@panzer.kdm.org>
References: <20020619090046.A2063@panzer.kdm.org>
	<20020619120641.A18434@unixdaemons.com>
	<15633.17238.109126.952673@grasshopper.cs.duke.edu>
	<20020619233721.A30669@unixdaemons.com>
	<15633.62357.79381.405511@grasshopper.cs.duke.edu>
	<20020620114511.A22413@unixdaemons.com>
	<15634.534.696063.241224@grasshopper.cs.duke.edu>
	<20020620134723.A22954@unixdaemons.com>
	<15652.46870.463359.853754@grasshopper.cs.duke.edu>
	<20020705002056.A5365@unixdaemons.com>
	<20020704231321.A42134@panzer.kdm.org>
X-Mailer: VM 6.75 under 21.1 (patch 12) "Channel Islands" XEmacs Lucid
Sender: owner-freebsd-net@FreeBSD.ORG
Precedence: bulk
List-ID: <freebsd-net.FreeBSD.ORG>
List-Archive: <http://docs.freebsd.org/mail/> (Web Archive)
List-Help: <mailto:majordomo@FreeBSD.ORG?subject=help> (List Instructions)
List-Subscribe: <mailto:majordomo@FreeBSD.ORG?subject=subscribe%20freebsd-net>
List-Unsubscribe: <mailto:majordomo@FreeBSD.ORG?subject=unsubscribe%20freebsd-net>
X-Loop: FreeBSD.org


Kenneth D. Merry writes:

<...>

 > >  Yes, it certainly confirms the virtual-based caching assumptions.  I
 > >  would like to provide virtually contiguous large buffers and believe I
 > >  can do that via mb_alloc... however, they would be several wired-down
 > >  pages.  Would this be in line with the requirements that these buffers
 > >  would have, in your mind?  (wired-down means that your buffers will
 > >  come out exactly as they would out of malloc(), so if you were using
 > >  malloc() already, I'm assuming that wired-down is OK).
 > > 
 > >  I think I can allocate the jumbo buffers via mb_alloc from the same map
 > >  as I allocate clusters from - the clust_map - and keep them in
 > >  buckets/slabs in per-CPU caches, like I do for mbufs and regular
 > >  clusters right now.  Luigi is in the process of doing some optimisation
 > >  work around mb_alloc and I'll probably be doing the SMP-specific stuff
 > >  after he's done so once that's taken care of, we can take a stab at
 > >  this if you think it's worth it.
 > 
 > If you do implement this, it would also be nice to have some sort of
 > standardized page-walking function to extract the physical addresses.
 > (Otherwise every driver will end up implementing its own loop to do it.)

Good point.  But this sounds like it belongs as a part of the
(currently unimplemented) busdma infastructure for mbufs.  

 > We also may want to examine what sort of guarantees, if any, we can make
 > about the physical page alignment of the allocated mbuf.  i.e. if we can
 > guarantee that the mbuf data segment will start on a physical page boundary
 > (if it is at least a page in size), that would allow device drivers to be
 > able to guarantee that they could fit a jumbo frame (9000 bytes) into 3
 > scatter/gather segments on an i386.
 > 
 > The number of scatter/gather segments used is important for some boards,
 > like ti(4), because they have a limited number of scatter/gather segments
 > available.  In the case of ti(4), it is 4 S/G segments, which is enough to
 > handle the maximum number of physical data chunks it would take to compose
 > a 9K virtual buffer.  (You could start in the middle of a page, have two
 > complete pages, and then end with a partial page.)
 > 
 > I suppose it would be good to see what NIC drivers in the tree can receive
 > into or send from multiple chunks of data, and what their requirements are.
 > (how many scatter/gather segments they can handle, what is the maximum MTU,
 > etc.)

If you're just looking at the code, then this would be hard.  All the
current drivers (with the exception of em) are coded to take one
physically contiguous private buffer.  I'm pretty sure that most of
them are capable of doing scatter DMA, but I don't have the
programming docs.

Drew

To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-net" in the body of the message