From owner-freebsd-stable@FreeBSD.ORG Fri Oct 10 13:00:36 2008 Return-Path: Delivered-To: freebsd-stable@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6A7F0106568A; Fri, 10 Oct 2008 13:00:36 +0000 (UTC) (envelope-from smithi@nimnet.asn.au) Received: from sola.nimnet.asn.au (paqi.nimnet.asn.au [220.233.188.227]) by mx1.freebsd.org (Postfix) with ESMTP id A555F8FC13; Fri, 10 Oct 2008 13:00:35 +0000 (UTC) (envelope-from smithi@nimnet.asn.au) Received: from localhost (localhost [127.0.0.1]) by sola.nimnet.asn.au (8.14.2/8.14.2) with ESMTP id m9AD0QDa005997; Sat, 11 Oct 2008 00:00:26 +1100 (EST) (envelope-from smithi@nimnet.asn.au) Date: Sat, 11 Oct 2008 00:00:25 +1100 (EST) From: Ian Smith To: Jeremy Chadwick In-Reply-To: <20081009170727.GA9542@icarus.home.lan> Message-ID: <20081010201210.G16723@sola.nimnet.asn.au> References: <20081008183652.GA83351@icarus.home.lan> <501797.33750.qm@web39105.mail.mud.yahoo.com> <20081009051214.GA94941@icarus.home.lan> <20081010023938.R16723@sola.nimnet.asn.au> <20081009170727.GA9542@icarus.home.lan> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Cc: bf , freebsd-stable@FreeBSD.org, Charles Sprickman Subject: Re: Recent Problems with RELENG_7 i386 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 10 Oct 2008 13:00:36 -0000 On Thu, 9 Oct 2008, Jeremy Chadwick wrote: > On Fri, Oct 10, 2008 at 03:51:02AM +1100, Ian Smith wrote: [..] > > | CPU: AMD Athlon(tm) Processor (906.35-MHz 686-class CPU) > > | Timecounter "ACPI-safe" frequency 3579545 Hz quality 850 > > | inittimecounter(0)... Timecounters tick every 10.000 msec > > > > ie HZ=100, as mentioned, and using ACPI-safe as later confirmed. So > > it's either a different kernel or bf updated kern.hz from loader.conf? > > Yep -- his original mail had loader.conf shown, with this in it (near the > bottom): > > kern.hz=100 Missed that, thanks. > > > Well, I believe HZ was increased from 100 to 1000 long ago (RELENG_6?) > > > as a default. I'm really not sure of the implications of decreasing it, > > > besides having less granularity for some things (the only things I know > > > of would be something pertaining to firewalls, I just can't remember > > > what. My brain is full. :-) ) > > > > You need a day off :) But yes, RELENG_5 still had HZ=100 default, long > > after the 'average' CPU clock frequency was 10 or more times faster than > > the 166MHz Pentiums and such (mostly then on only 100Mbps ethernet) that > > were comfortable at 100Hz slicing. 1000Hz was a big shift to catch up. > > > > In a day or so playing around with it years ago, I found 200-250Hz good > > for 300MHz, 500Hz a bit much, 1000Hz way too busy, and find my 1133MHz > > P3-M happy enough at 1000Hz, though I've done no specific tests on it. bf: to answer your later question; this was entirely empirical, a wet finger in the wind. The machine in question, idle with X+KDE, 5 or 6 konsoles, 5 or 6 idle kwrite sessions, xmms, mozilla, several servers (httpd, mysqld, sendmail, named ..) with all of its 160MB RAM and about as much swap in use (but statically, not paging in or out), shows about 9-10% CPU doing nothing, with HZ=200. From my notes then, with HZ=500, about an _extra_ 7% CPU shown in top. With HZ=1000, an _extra_ 20% odd. Such things are kinda hard to notice with a fast CPU and enough RAM :) OTOH, little difference betwee HZ=200 and 100. So, 200 is a sweet spot, on a 300MHz laptop that only halves its CPU frequency when on battery. Charles: your general impression of context switching is about right. Ignoring interrupt context for the moment, at every tick the scheduler has to figure out the highest priority ready-to-run process to switch to, update the CPU context (registers, memory selectors and such), then run that process for (the remainder of) one tick time. If it takes X CPU cycles to switch context for a tick interval of Y ms, then the switching overhead is proportional to X/Y. If we now drop the CPU frequency from say 1GHz to say 100MHz (eg w/powerd) then those X cycles take ~10 times longer to execute, but the tick interval Y remains constant, so the switching overhead becomes proportional to 10X/Y, ie a greater proportion of the tick's time is taken up by context switching. And then of course the selected process can only execute something less than a tenth as many instructions before being preempted again. > > Some people had perhaps similar clock issues when their fast processors > > were throttling/stepping down to very low speeds (100, even 75MHz) while > > still slicing at 1000Hz, which I didn't find too surprising. Limiting > > minimum CPU freq to 300Mz or more seemed to solve many such issues, but > > I haven't your perseverance for digging up the relevant threads .. > > > > Even in 5.5-S (/sys/conf/NOTES and /sys/i386/conf/NOTES) HZ=1000 or 2000 > > was suggested for DEVICE_POLLING (which bf included in config, though > > maybe it's not enabled?) and HZ=1000 or more was recommended when using > > DUMMYNET with ipfw - to provide smoother queue dispatching, I gather. > > > > Bottom line, IMHO, bf should probably run the default 1000Hz, 500 at > > least, on an Athlon 900. With powerd, maybe set min. freq >= 150MHz? > > Wow, this is fantastic information. You've just educated me a great bit > about the history and use of HZ. I've always had a "general" idea of > its importance and key role, but I was never fully aware of the history. Well hardly fantastic, nor authoritative, neither am I aware of any more than I've gleaned by flying by the seat of my pants with several small machines, occasionally trying to figure out how the ACPI / cpufreq code works; I haven't yet been game to delve into the scheduler(s) code. The last time I actually wrote anything run by a scheduler was in '72 in IBM S/370 DOS assembler to charge CPU time used to various departments, unless you count some '90s MSDOS/DesqView 'TSR' gadgets in Turbo Pascal, so I'm a very long way off the pace; still the principles are the same. Now I hope someone who groks FreeBSD scheduling will be irritated enough by my simplistic generalisations to say something more meaningful :) > P.S. -- I need more like 6 months off. I've never taken an official > (read: real) vacation my entire life. Maybe some day I'll get to travel > to Seoul and visit Pyun Yong-Hyeon and drink lots of soju. :-) No soju here, but I'm sure we could find you some well-respected local brew or other, so sing out if you happen to be passing northern NSW .. cheers, Ian