Date: Wed, 3 Oct 2007 13:11:01 +1000 (EST) From: Bruce Evans <brde@optusnet.com.au> To: Jeff Roberson <jroberson@chesapeake.net> Cc: cvs-all@FreeBSD.org, src-committers@FreeBSD.org, cvs-src@FreeBSD.org, Jeff Roberson <jeff@FreeBSD.org>, Garance A Drosehn <gad@FreeBSD.org>, Ben Kaduk <minimarmot@gmail.com>, Bruce Evans <brde@optusnet.com.au> Subject: Re: cvs commit: src/sys/kern sched_ule.c Message-ID: <20071003123512.W14348@delplex.bde.org> In-Reply-To: <20071002133334.W594@10.0.0.1> References: <20071001145257.EC9FC4500F@ptavv.es.net> <20071002133623.X40629@besplex.bde.org> <20071001232743.Q539@10.0.0.1> <20071002213829.F12287@delplex.bde.org> <20071002133334.W594@10.0.0.1>
next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, 2 Oct 2007, Jeff Roberson wrote: > On Tue, 2 Oct 2007, Bruce Evans wrote: > Sorry I don't have time for a point by point on this one. Thank you for your > interesting analysis. From this I'm taking away a couple of things: > > 1) I've noticed that ULE relied on PREEMPTION for a long time and lost the > NEEDRESCHED setting in cases where it doesn't set owepreempt. Restoring this > should improve some of the !PREEMPTION behavior and perhaps even > responsiveness in your nice tests. Not the problem here, since I always use PREEMPTION for UP. > 2) I need to try running with hz = 100 and see if there are some scaling > problems. I have heard reports that ULE scales better than 4BSD up to higher > hz values but I haven't investigated this much. It should work with lower as > well. Everything important to relative priorites and time slice allotment > runs off of stathz. Again, not the problem here, since I tested with both hz = 100 and hz = 1000. Also stathz = 100 and stathz = 128. I haven't tried huge hz lately. > 3) The code which adjusts priorities for fork may need some more fine > tuning. ULE agressively penalizes parents for forking expensive children. > This helps us learn that make should not create interactive children for > example. I checked what 5.2 and ~5.2 are doing: - fork: both just keep the parent's estcpu - exit: 5.2 adds the child's estcpu to the parent's, same as in 4.x (?), so that the parent's priority wants to be exponential in the number of children even of the children don't do anything (except when the initial estcpu is 0, doubling it makes no difference). So I don't understand why 5.2 seems to be able to repeatedly run acroread with no penalty. - exit: ~5.2 sets the parent's estcpu to the minimum of the parent and the child estcpu, so that parents are only penalized for expensive children. Parents aren't penalized enough if they have lots of not-so-expensive children, but with the estcpu clamp in 5.2 there is probably nothing better. However, ~5.2 doesn't clamp estcpu, so it could do better. ~5.2 has to do something better than add the estcpu's, since with true exponential growth the estcpu quicky reaches "infinity", and ~5.2 has dynamic scaling of estcpu to priorities which would give an infinite penalty for infinite estcpu, and with infinite estcpu taking infinitely long to decay, the infinitely penalized perant would never run. Note that 4BSD was broken by the infinities in most versions of FreeBSD-2 and FreeBSD-3. The bug was introduced early in FreeBSD-2 and later picked up by NetBSD. There were mysterious hangs, and only the infinities being not quite infinite allowed parents to run again. estcpu is an int[32_t] variable, so repeated doublings starting at value 1 eventually overflow to -0x80000000 and then to 0. Usually the priority becomes too high to run before 31 or 32 doublings cause overflow, so the overflow bugs don't help. Instead, estcpu reaches a huge value which takes only seconds or minutes to decay to a value which gives a small enough priority to run, or the runq becomes empty except for processes with infinite estcpu so those processes can run. > 4) I don't think you're losing interrupts when you ctrl+c. It's just taking > too long for the interrupted task to run. ctrl+z takes effect immediately > when the signal is delivered. This may be related to hz = 100 or running > without preemption. I am not able to reproduce this problem with a standard > GENERIC kernel + ULE. ^C is printed by the tty driver, so tty (keyboard) interrupts certainly work. IPIs probably work too. Oops, this is UP so there are no IPIs. So the problem is just the ^C signal never being delivered to the looping process, and it is surpising that ^Z is delivered. The interrupted task _is_ running (it is looping) and delivery would consist mainly of killing it (I forget if the shell sees the ^C before the child is killed). This is not related to PREEMPTION since I always use PREEMPTION for UP. It seems to be related to hz = 100. My kernel is similar to GENERIC in this area except for HZ and leaving out ADAPTIVE_GIANT. Bruce
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20071003123512.W14348>