From owner-freebsd-smp Thu May 2 10:32:47 2002 Delivered-To: freebsd-smp@freebsd.org Received: from hawk.mail.pas.earthlink.net (hawk.mail.pas.earthlink.net [207.217.120.22]) by hub.freebsd.org (Postfix) with ESMTP id 5DD0737B41A; Thu, 2 May 2002 10:32:25 -0700 (PDT) Received: from pool0542.cvx22-bradley.dialup.earthlink.net ([209.179.200.32] helo=mindspring.com) by hawk.mail.pas.earthlink.net with esmtp (Exim 3.33 #2) id 173KRM-00028R-00; Thu, 02 May 2002 10:32:24 -0700 Message-ID: <3CD1780B.2CC97DDB@mindspring.com> Date: Thu, 02 May 2002 10:31:55 -0700 From: Terry Lambert X-Mailer: Mozilla 4.7 [en]C-CCK-MCD {Sony} (Win98; U) X-Accept-Language: en MIME-Version: 1.0 To: John Baldwin Cc: freebsd-smp@FreeBSD.org, Andrew Gallatin , Jonathan Mini Subject: Re: hlt when idle? References: Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: owner-freebsd-smp@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.org John Baldwin wrote: > > It definitely does, provided you are guaranteed that the clock > > interrupt handler can always execute in less time than the > > interval between clock interrupts. Then you don't have to > > contend on the global handler lock to ensure it only runs once. > > Well, with our current interrupts the foo_process() (should be foo_thread()) > versions still need the global sched_lock to modify some flag fields and what > not. (Some of this could be eliminated by splitting td_flags into two parts, > one that needs the lock and one that doesn't.) Thus, if they all fire at once, > even if it's broadcast they all contend with each other, wherease if they are > broadcast but staggered such as on the alpha they don't in theory contend with > each other at all. (Provided the handler is fast enough, which it should be.) Ah. I omitted a piece because it was obvious to me, when in retrospect it's really non-obvious. You reprogram the top 32 bits of the ICR in the APIC for the interrupt each time as part of the interrupt handler, so it's actually unicast to a single processor to get the "round robin" effect ("bother that CPU instead of me next time"). IMO, going round-robin on this sort of thing is a bad idea; L1 and L2 cache busting are a biggie for me -- but then I want to scale to an infinite number of processors. 8-). If you wanted to broadcast a clock interrupt, which makes more sense for me, then the system "clock interrupt" processing would be handled on a single CPU. Alternately, you could rewrite the upper 32 bits, rewrite the lower 32 bits (which would trigger the IPI to be sent), and then rewrite the upper 32 bits again, to target the next guy in line. I think the confusion is because it's useful for the clock to be broadcast for one set of reasons, and it's useful to hit on only one processor for global handler processing for another set of reasons. I think a lot of the desire to hit a CPU over the head to have it look at the scheduler queue comes because the scheduler queues aren't per-CPU, and thus CPU migration is not a really rare event, like it should be (in order to get CPU affinity). In the migration case, an IPI targetting only the migrated-to CPU, only if it's HLT'ed, is probably acceptable overhead. Actually, here's an incredibly good reference for Intel APIC and SMP processing, from Purdue University: Multiprocessing Support for Hobby OSes Explained Ben L. Titzer http://expert.cc.purdue.edu/~titzer/redpants/mpx86.html It goes into the details of ICR programming, which is where the black magic is for broadcast vs. non-broadcast, etc.. -- Terry To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message