From owner-freebsd-current Fri Jan 19 0:31:49 2001 Delivered-To: freebsd-current@freebsd.org Received: from meow.osd.bsdi.com (meow.osd.bsdi.com [204.216.28.88]) by hub.freebsd.org (Postfix) with ESMTP id B324137B698; Fri, 19 Jan 2001 00:31:27 -0800 (PST) Received: from laptop.baldwin.cx (john@jhb-laptop.osd.bsdi.com [204.216.28.241]) by meow.osd.bsdi.com (8.11.1/8.9.3) with ESMTP id f0J8RHL75590; Fri, 19 Jan 2001 00:27:17 -0800 (PST) (envelope-from jhb@FreeBSD.org) Message-ID: X-Mailer: XFMail 1.4.0 on FreeBSD X-Priority: 3 (Normal) Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 8bit MIME-Version: 1.0 In-Reply-To: <200101190735.IAA60706@freebsd.dk> Date: Fri, 19 Jan 2001 00:29:34 -0800 (PST) From: John Baldwin To: Soren Schmidt Subject: Re: HEADS-UP: await/asleep removal imminent Cc: rjesup@wgate.com, arch@FreeBSD.org, current@FreeBSD.org, (Alfred Perlstein) , (Peter Wemm) Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG On 19-Jan-01 Soren Schmidt wrote: > It seems Peter Wemm wrote: >> Soren Schmidt wrote: >> > It seems Peter Wemm wrote: >> > > >> > > Soren, can you retest a buildworld with the currently committed kernel >> > > with no other changes? Let us see if the forward_signal() stuff is the >> > > culprit, and if not, try adding just the i386/i386/machdep.c patch to >> > > HLT >> > > the idle CPU. (if *that* makes a difference then we have got trouble!) >> > >> > It seems that the HLT thing is exactly the patch that makes it work! >> > >> > So we have trouble, I said that all along .5 :) >> >> So, the difference between -current working for you or not is this: > > [diff snipped] > >> ?? No other changes? > > I need the FULL change to machdep.c and apparently the chagne to subr_prf.c > is needed too, at least it hasnt broke yet with that in place too, not > that I can see why it matters though. > > The new diff to machdep.c that jhb made yesterday also causes trouble. > >> This is bad news.. This means we have races somewhere, or some other >> badness. That my other diff didn't work is more bad news. > That is what I've been harping about for months... > > What strikes me here as a very serious problem is that the SMPng developers > has told me over and over that it works fine for them on -current, but > when we get to the matter after months of frustration, it suddenly appears > that they are not running the same -current as the rest of us. This is _not_ true. My quad xeon test box runs a pure source tree, and has not had a single problem building many worlds and releases since the fix to atomic_store_rel_ptr(). Also, I rebuilt a fresh kernel from a freshly checked out tree without any patches today on mutex (the machine I built kernel.sos on) and that kernel has built worlds today without _any_ lockups. In fact, smp_hlt.patch is only something I've played with a bit for the last few weeks, and it doesn't really work anyways, as you can end up with CPU's staying in the HLT state for long periods of time even though there are runnable jobs, which is why I haven't bothered committing this. Can you post a dmesg of your box? -- John Baldwin -- http://www.FreeBSD.org/~jhb/ PGP Key: http://www.baldwin.cx/~john/pgpkey.asc "Power Users Use the Power to Serve!" - http://www.FreeBSD.org/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message