From owner-freebsd-arch@FreeBSD.ORG  Tue Aug 16 20:00:15 2005
Return-Path: <owner-freebsd-arch@FreeBSD.ORG>
X-Original-To: freebsd-arch@freebsd.org
Delivered-To: freebsd-arch@freebsd.org
Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP id 2330116A41F
	for <freebsd-arch@freebsd.org>; Tue, 16 Aug 2005 20:00:15 +0000 (GMT)
	(envelope-from PeterJeremy@optushome.com.au)
Received: from mail27.syd.optusnet.com.au (mail27.syd.optusnet.com.au
	[211.29.133.168])
	by mx1.FreeBSD.org (Postfix) with ESMTP id 7191243D45
	for <freebsd-arch@freebsd.org>; Tue, 16 Aug 2005 20:00:13 +0000 (GMT)
	(envelope-from PeterJeremy@optushome.com.au)
Received: from cirb503493.alcatel.com.au
	(c220-239-19-236.belrs4.nsw.optusnet.com.au [220.239.19.236])
	by mail27.syd.optusnet.com.au (8.12.11/8.12.11) with ESMTP id
	j7GK0BNg023512
	(version=TLSv1/SSLv3 cipher=EDH-RSA-DES-CBC3-SHA bits=168 verify=NO);
	Wed, 17 Aug 2005 06:00:12 +1000
Received: from cirb503493.alcatel.com.au (localhost.alcatel.com.au [127.0.0.1])
	by cirb503493.alcatel.com.au (8.12.10/8.12.10) with ESMTP id
	j7GK0BSR026740; Wed, 17 Aug 2005 06:00:11 +1000 (EST)
	(envelope-from pjeremy@cirb503493.alcatel.com.au)
Received: (from pjeremy@localhost)
	by cirb503493.alcatel.com.au (8.12.10/8.12.9/Submit) id j7GK0AmM026739; 
	Wed, 17 Aug 2005 06:00:10 +1000 (EST) (envelope-from pjeremy)
Date: Wed, 17 Aug 2005 06:00:10 +1000
From: Peter Jeremy <PeterJeremy@optushome.com.au>
To: Luigi Rizzo <rizzo@icir.org>
Message-ID: <20050816200010.GJ13959@cirb503493.alcatel.com.au>
References: <42F9ECF2.8080809@freebsd.org> <200508101638.27087.jhb@FreeBSD.org>
	<42FA6E0E.4070205@samsco.org> <200508111121.46546.jhb@FreeBSD.org>
	<20050816051231.D66550@xorpc.icir.org>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <20050816051231.D66550@xorpc.icir.org>
User-Agent: Mutt/1.4.2i
Cc: freebsd-arch@freebsd.org
Subject: Re: Special schedulers, one CPU only kernel, one only userland
X-BeenThere: freebsd-arch@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Discussion related to FreeBSD architecture <freebsd-arch.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-arch>,
	<mailto:freebsd-arch-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-arch>
List-Post: <mailto:freebsd-arch@freebsd.org>
List-Help: <mailto:freebsd-arch-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-arch>,
	<mailto:freebsd-arch-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Tue, 16 Aug 2005 20:00:15 -0000

On Tue, 2005-Aug-16 05:12:31 -0700, Luigi Rizzo wrote:
>reading this thread, and at times looking at some of the kernel code,
>with plenty of places where you have to drop a lock that you
>already have, do some small thing and then reacquire the lock itself,
>makes me wonder if we don't need a better mechanism/abtraction for
>this kind of programming.

There are two distinct cases where this is done:
1) The thread needs to call another function which can potentially sleep
   and therefore needs to drop any locks it is holding before calling
   the function and re-acquire them later.
2) The thread needs to execute for an excessive period whilst holding
   the lock and explicitly releases it occasionally to allow other
   threads to run (eg much of the vnode/buffer/page scanning code).

For the second case, we could create a function that checked if
another thread was waiting on the lock and, if so, released the lock
and grabbed it again after sleeping.  This is fairly easy for a single
lock but would be quite messy to implement for multiple locks - since
you need to release/acquire them in the correct order to prevent
deadlocks.

The first case again seems easy at first glance:  Add a "I'm happy
to release this lock" flag to each lock.  Before calling a function
that might block, you set the flag.  When the function calls sleep(9),
the code releases the locks and re-acquires them before returning.
There are three large gotchas:
1) Since sleeping might have changed the state of some structures
   that the thread is working on, it needs to re-validate its
   internal state to ensure that it's not working with stale data.
   This means the thread needs to check if it slept or not after
   each point where it could potentially sleep - and I suspect
   this amounts to most of the non-boilerplate code in the existing
   re-acquire lock section.
2) You have to ensure that either the release/acquire is atomic for
   all locks or the ordering is correct.
3) You need some way to pass back a permanent failure to re-acquire
   a requested lock (maybe a piece of hardware went away).

>In a way, this seems similar to the handling of interrupts:
>if we want a thread to be interrupted we don't check for interrupts
>(and save and restore state) explicitly at every instruction, but
>rely on the processor doing the right thing for us.

In the case of locks, restoring the previous state may not be
correct.  Whilst I hold a lock on object foo, I know its state will
remain unchanged (unless I change it) and can therefore safely cache
information about the object.  If I release/reacquire the lock, the
object may have changed in arbitrary ways and I need to ensure that
any cached information is still correct (and correct it if it's not).

Overall, you could probably make the code cleaner (and maybe more
efficient) but you can't make it totally transparent.

-- 
Peter Jeremy