Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 3 Oct 2007 13:11:01 +1000 (EST)
From:      Bruce Evans <brde@optusnet.com.au>
To:        Jeff Roberson <jroberson@chesapeake.net>
Cc:        cvs-all@FreeBSD.org, src-committers@FreeBSD.org, cvs-src@FreeBSD.org, Jeff Roberson <jeff@FreeBSD.org>, Garance A Drosehn <gad@FreeBSD.org>, Ben Kaduk <minimarmot@gmail.com>, Bruce Evans <brde@optusnet.com.au>
Subject:   Re: cvs commit: src/sys/kern sched_ule.c 
Message-ID:  <20071003123512.W14348@delplex.bde.org>
In-Reply-To: <20071002133334.W594@10.0.0.1>
References:  <20071001145257.EC9FC4500F@ptavv.es.net> <20071002133623.X40629@besplex.bde.org> <20071001232743.Q539@10.0.0.1> <20071002213829.F12287@delplex.bde.org> <20071002133334.W594@10.0.0.1>

next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, 2 Oct 2007, Jeff Roberson wrote:

> On Tue, 2 Oct 2007, Bruce Evans wrote:

> Sorry I don't have time for a point by point on this one.  Thank you for your 
> interesting analysis.  From this I'm taking away a couple of things:
>
> 1)  I've noticed that ULE relied on PREEMPTION for a long time and lost the 
> NEEDRESCHED setting in cases where it doesn't set owepreempt. Restoring this 
> should improve some of the !PREEMPTION behavior and perhaps even 
> responsiveness in your nice tests.

Not the problem here, since I always use PREEMPTION for UP.

> 2)  I need to try running with hz = 100 and see if there are some scaling 
> problems.  I have heard reports that ULE scales better than 4BSD up to higher 
> hz values but I haven't investigated this much.  It should work with lower as 
> well.  Everything important to relative priorites and time slice allotment 
> runs off of stathz.

Again, not the problem here, since I tested with both hz = 100 and hz = 1000.
Also stathz = 100 and stathz = 128.  I haven't tried huge hz lately.

> 3)  The code which adjusts priorities for fork may need some more fine 
> tuning.  ULE agressively penalizes parents for forking expensive children. 
> This helps us learn that make should not create interactive children for 
> example.

I checked what 5.2 and ~5.2 are doing:
- fork: both just keep the parent's estcpu
- exit: 5.2 adds the child's estcpu to the parent's, same as in 4.x (?),
   so that the parent's priority wants to be exponential in the number of
   children even of the children don't do anything (except when the initial
   estcpu is 0, doubling it makes no difference).  So I don't understand
   why 5.2 seems to be able to repeatedly run acroread with no penalty.
- exit: ~5.2 sets the parent's estcpu to the minimum of the parent and
   the child estcpu, so that parents are only penalized for expensive
   children.

   Parents aren't penalized enough if they have lots of not-so-expensive
   children, but with the estcpu clamp in 5.2 there is probably nothing
   better.  However, ~5.2 doesn't clamp estcpu, so it could do better.
   ~5.2 has to do something better than add the estcpu's, since with
   true exponential growth the estcpu quicky reaches "infinity", and
   ~5.2 has dynamic scaling of estcpu to priorities which would give
   an infinite penalty for infinite estcpu, and with infinite estcpu
   taking infinitely long to decay, the infinitely penalized perant
   would never run.

   Note that 4BSD was broken by the infinities in most versions of
   FreeBSD-2 and FreeBSD-3.  The bug was introduced early in FreeBSD-2
   and later picked up by NetBSD.  There were mysterious hangs, and only
   the infinities being not quite infinite allowed parents to run again.
   estcpu is an int[32_t] variable, so repeated doublings starting at
   value 1 eventually overflow to -0x80000000 and then to 0.  Usually
   the priority becomes too high to run before 31 or 32 doublings cause
   overflow, so the overflow bugs don't help.  Instead, estcpu reaches
   a huge value which takes only seconds or minutes to decay to a value
   which gives a small enough priority to run, or the runq becomes empty
   except for processes with infinite estcpu so those processes can run.

> 4)  I don't think you're losing interrupts when you ctrl+c.  It's just taking 
> too long for the interrupted task to run.  ctrl+z takes effect immediately 
> when the signal is delivered.  This may be related to hz = 100 or running 
> without preemption.  I am not able to reproduce this problem with a standard 
> GENERIC kernel + ULE.

^C is printed by the tty driver, so tty (keyboard) interrupts certainly
work.  IPIs probably work too.  Oops, this is UP so there are no IPIs.
So the problem is just the ^C signal never being delivered to the
looping process, and it is surpising that ^Z is delivered.  The
interrupted task _is_ running (it is looping) and delivery would consist
mainly of killing it (I forget if the shell sees the ^C before the
child is killed).

This is not related to PREEMPTION since I always use PREEMPTION for
UP.  It seems to be related to hz = 100.  My kernel is similar to
GENERIC in this area except for HZ and leaving out ADAPTIVE_GIANT.

Bruce



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20071003123512.W14348>