From owner-freebsd-current@FreeBSD.ORG Thu Aug 26 14:53:28 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0655616A4CE for ; Thu, 26 Aug 2004 14:53:28 +0000 (GMT) Received: from ion.gank.org (ion.gank.org [69.55.238.164]) by mx1.FreeBSD.org (Postfix) with ESMTP id B9EC243D75 for ; Thu, 26 Aug 2004 14:53:27 +0000 (GMT) (envelope-from craig@xfoil.gank.org) Received: by ion.gank.org (mail, from userid 1001) id 780872B4D8; Thu, 26 Aug 2004 09:53:27 -0500 (CDT) Date: Thu, 26 Aug 2004 09:53:24 -0500 From: Craig Boston To: Doug White Message-ID: <20040826145324.GA40029@nowhere> Mail-Followup-To: Craig Boston , Doug White , freebsd-current@freebsd.org References: <20040822115345.Y94593@carver.gumbysoft.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20040822115345.Y94593@carver.gumbysoft.com> User-Agent: Mutt/1.4.2.1i cc: freebsd-current@freebsd.org Subject: Re: PLEASE TEST: IPI deadlock avoidance patch X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 26 Aug 2004 14:53:28 -0000 On Sun, Aug 22, 2004 at 12:05:39PM -0700, Doug White wrote: > If you have a reasonably fast i386 or amd64 multiprocessor and/or > hyperthreading machine and are experiencing reproducible hangs during -j > buildwords and other highly parallel operations, please try this patch: Just a follow-up to my off-list message and another data point, with this patch I no longer get deadlocks, however I now get random data corruption. The first time, it managed to complete the installworld and I didn't notice anything until trying to portupgrade Xorg failed. It turns out that the installed ranlib coredumped when run, but the one in /usr/obj worked fine. After that I was not able to successfully buildworld again. Every time one of the source files would inevitably end up corrupted with blocks of garbage, even after removing /usr/src entirely and re-cvsupping. Disabling the second processor or falling back to an older kernel (one from before the IPI hangs started) both fix the problem. Craig