From owner-freebsd-fs  Mon Aug  5 19:00:17 1996
Return-Path: owner-fs
Received: (from root@localhost)
          by freefall.freebsd.org (8.7.5/8.7.3) id TAA08217
          for fs-outgoing; Mon, 5 Aug 1996 19:00:17 -0700 (PDT)
Received: from parkplace.cet.co.jp (parkplace.cet.co.jp [202.32.64.1])
          by freefall.freebsd.org (8.7.5/8.7.3) with ESMTP id TAA08209
          for <freebsd-fs@freebsd.org>; Mon, 5 Aug 1996 19:00:11 -0700 (PDT)
Received: from localhost (michaelh@localhost) by parkplace.cet.co.jp (8.7.5/CET-v2.1) with SMTP id BAA24468; Tue, 6 Aug 1996 01:59:09 GMT
Date: Tue, 6 Aug 1996 10:59:09 +0900 (JST)
From: Michael Hancock <michaelh@cet.co.jp>
Reply-To: Michael Hancock <michaelh@cet.co.jp>
To: Terry Lambert <terry@lambert.org>
cc: dfr@render.com, jkh@time.cdrom.com, tony@fit.qut.edu.au,
        freebsd-fs@freebsd.org
Subject: Per fs vnode pools (was Re: NFS Diskless Dispare...)
In-Reply-To: <199608051859.LAA11723@phaeton.artisoft.com>
Message-ID: <Pine.SV4.3.93.960806102756.24224A-100000@parkplace.cet.co.jp>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-fs@freebsd.org
X-Loop: FreeBSD.org
Precedence: bulk

[Moved to fs from current]

Thanks.  Previously, I didn't see how an inactive nfs vnode would be
reclaimed and moved to a ffs vnode pool, cleanly.  The generic interfaces
takes care of all this cleanly.

It looks like a win in terms of performance and new fs development ease at
the expense of a little space.

Regards,


Mike Hancock

On Mon, 5 Aug 1996, Terry Lambert wrote:

> > I think what he's is saying is that when the vnodes are in the global pool
> > the chances of reusing a vnode that was used previously by a particular fs
> > is less than having a per fs vnode pool. 
> 
> No, it's not.
> 
> > The problem with the per fs vnode pool is the management overhead.  When
> > you need to start reusing vnodes you need to search through all the
> > different fs pools to find a vnode. 
> > 
> > I don't know which is a better trade-off.
> 
> This isn't how per FS vnode pools should work.
> 
> When you want a vnode, you call the generic "getnewvnode()" from the
> XXX_vget routine via VFS_VGET (sys/mount.h).
> 
> This function returns a vnode with an FS specific inode.
> 
> In reality, you never care to have a vnode without an FS specific inode,
> since there is no way to access or write buffers hung off the critter
> because of the way vclean works.
> 
> 
> What I'm suggesting is that there needs to be both a VFS_VGET and
> a VFS_VPUT (or VFS_VRELE).  With the additional per fs release
> mechanism, each FS instance can allocate an inode pool at its
> instantiation (or do it on a per instance basis, the current
> method which makes inode allocation so slow...).
> 
> Consider UFS: the in core inode struct consists of a bunch of in core
> data elements (which should probably be in their own structure) and
> a "struct  dinode i_din" for the on disk inode.
> 
> You could modify this as:
> 
> struct inode {
> 	struct icinode	i_ic;		/* in core inode*/
> 	struct vnode	i_iv;		/* vnode for inode*/
> 	struct dinode	i_din;		/* on disk inode*/
> };
> 
> 
> Essentially, allocation of an inode would allocate a vnode.  There
> would never be an inode without a vnode.
> 
> 
> The VFS_VPUT would put the vnode into a pool maintained by the
> FS per fs instance (the in core fs structure would need an
> additional structure element to point to the maintenance data).
> 
> The FS itself would use generic maintenance routines shared by
> all FS's... and capable of taking a structure size for i_ic and
> i_din element size variations between FS types.  This would
> maintain all common code in the common interface.
> 
> 
> The use of the vget to associate naked vnodes with the FS's would
> go away; in no case is a naked vnode ever useful, since using vnode
> buffer elements requires an FS context.
> 
> 
> In effect, the ihash would become a vnhash and LRU for use in
> reclaiming vnode/inode pairs.  This would be much more efficient
> than the current dual allocation sequence.
> 
> 
> This would allow the discard of the vclean interface, and of the
> lock used to ensure it operates (a lock which has to be reimplemented
> and reimplemented correctly on a per FS basis in the XXX_LOCK and
> XXX_UNLOCK FS specific routines).
> 
> 
> The vnode locking could then be done in common code:
> 
> 
> vn_lock( vp, flags, p)
> struct vnode *vp;
> int flags;
> struct proc *p;
> {
> 	/* actual lock*/
> 	if( ( st = ...) == SUCCESS) {
> 		if( ( st = VOP_LOCK( vp, flags, p)) != SUCCESS) {
> 			/* lock was vetoed, undo actual lock*/
> 			...
> 		}
> 	}
> 	return( st);
> }
> 
> 
> The point here is that the lock contention (if any) can be resolved
> without ever hitting the FS itsef in the failure case.
> 
> 
> 
> The generic case of the per FS lock is now:
> 
> 
> int
> XXX_lock(ap)
> 	struct vop_lock_args /* {
> 		struct vnode *a_vp;
> 		int a_flags; 
> 		struct proc *a_p;
> 	} */ *ap; 
> {
> 	return( SUCCESS);
> }
> 
> 
> This is much harder to screw up when writing a new FS, and makes for much
> smaller intermediate layers.
> 
> 
> For NFS and unions, there isn't an i_din... but they also require data
> hung off the vnode, so the same allocation rules apply.  It's a win
> either way, and has the side benefit of unmunging the vn.
> 
> 
> I believe that John Heidemann's thesis had this in mind when it refers
> to using an RPC layer to use remote file system layers as intermediates
> in a local VFS stack.
> 
> 
> 					Terry Lambert
> 					terry@lambert.org
> ---
> Any opinions in this posting are my own and not those of my present
> or previous employers.
> 

--
michaelh@cet.co.jp                                http://www.cet.co.jp
CET Inc., Daiichi Kasuya BLDG 8F 2-5-12, Higashi Shinbashi, Minato-ku,
Tokyo 105 Japan              Tel: +81-3-3437-1761 Fax: +81-3-3437-1766