Date: Fri, 11 Apr 2008 13:59:54 +0100 From: David Malone <dwmalone@maths.tcd.ie> To: Jeff Roberson <jroberson@jroberson.net> Cc: cvs-src@FreeBSD.org, Jeff Roberson <jeff@FreeBSD.org>, src-committers@FreeBSD.org, cvs-all@FreeBSD.org Subject: Re: cvs commit: src/sys/ufs/ufs dirhash.h ufs_dirhash.c Message-ID: <200804111359.aa80973@walton.maths.tcd.ie> In-Reply-To: Your message of "Fri, 11 Apr 2008 02:32:49 -1000." <20080411021519.E43186@desktop>
next in thread | previous in thread | raw e-mail | index | archive | help
> Now instead of freeing a recycled dirhash pointer in ip->i_dirhash I lock > it and re-create it. This makes various races simpler. The dirhash has > to be minimally constructed for dirhash_free_locked to function correctly > if we bail out early. Ah - OK - I think that probably partially explains your accounting problems, mentioned below, but it may not be worth fixing. It is probably worth looking over to make sure that it isn't just hiding some other problem. Dirhash has been quite good at finding bugs in other bits of code. > The conditions in ufsdirhash_build() which result in creating a new > directory hash remain the same except we are more agressive about > reclaiming when the sysctl is lowered to ease testing. I suspected that, but was having trouble seeing it. > In the normal ufs_lookup() path we enter 3 dirhash functions which can now > assume the lock is held rather than locking/unlocking. And in lookup in > particular we are no longer required to drop and reacquire the lock, > potentially multiple times. I'm sure it's a net reduction in lock > operations. That really wasn't the primary motivation however. I mostly > wanted to enable shared locking of directories. I doubt the number of mtx > operations is dominating the performance of directory operations in any > event. Agreed. > > > > In the case of multiple directories, I guess we'll have two > > mutex locks replaced by one mutex and one lockmanager > > (shared) lock? This is probably the usual case, which isn't > > too bad. > I'm not sure I understand this. I was probably thinking out aloud - please feel free to ignore it ;-) > > should actually be kept. Actually, this probably isn't > > possible because the list lock is dropped while we're doing > > the free? > Because we drop the list lock we have to restart the processing from the > beginning. OK - understood. > I guess it is a risk that we could have many locked dirhashes > that we'll have to skip causing the recycle loop to potentially take a > long time. However, the old code was strange here as well. It'd only > examine the head of the list and only recycled if the score reached 0. That was the thing it had been designed to do. It only checked the head of the list because the objects that are candidates for recycling are at the head of the list. The idea was to keep plucking thing from the head until we got far enough down that the object looked like it was still active. > Really we could just make this a simple LRU and be done with it. The > score seems redundant. If I remember correctly, that isn't a good idea. I think Ian found that pure LRU didn't work that well for recycling, which is why the hybrid LRU/LOU scheme is in place. LRU resulted in thrashing when a large number of directories were being accessed at the same time. Roughly what happened was you would access the directory, build the whole hash, use it once and then discard it before another access. Since building the whole hash is a little more costly than a single linear access, this resulted in a net loss. By ensuring that an established hashes are used for slightly longer, you could avoid this hit. David.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200804111359.aa80973>