From owner-freebsd-sparc64@FreeBSD.ORG Tue Aug 9 09:53:47 2011 Return-Path: Delivered-To: freebsd-sparc64@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 705CE1065672 for ; Tue, 9 Aug 2011 09:53:47 +0000 (UTC) (envelope-from marius@alchemy.franken.de) Received: from alchemy.franken.de (alchemy.franken.de [194.94.249.214]) by mx1.freebsd.org (Postfix) with ESMTP id 014EC8FC1A for ; Tue, 9 Aug 2011 09:53:46 +0000 (UTC) Received: from alchemy.franken.de (localhost [127.0.0.1]) by alchemy.franken.de (8.14.4/8.14.4/ALCHEMY.FRANKEN.DE) with ESMTP id p799rgIx055023; Tue, 9 Aug 2011 11:53:42 +0200 (CEST) (envelope-from marius@alchemy.franken.de) Received: (from marius@localhost) by alchemy.franken.de (8.14.4/8.14.4/Submit) id p799rfDi055022; Tue, 9 Aug 2011 11:53:41 +0200 (CEST) (envelope-from marius) Date: Tue, 9 Aug 2011 11:53:41 +0200 From: Marius Strobl To: kib@freebsd.org Message-ID: <20110809095341.GP48988@alchemy.franken.de> References: <4E0F6B8D.8000500@rice.edu> <20110704214158.GX14797@alchemy.franken.de> <20110705160709.GA77843@alchemy.franken.de> <4E135420.4080201@rice.edu> <20110705190126.GE14797@alchemy.franken.de> <20110706042634.GP65891@pjdesk.au.alcatel-lucent.com> <20110706103910.GG14797@alchemy.franken.de> <20110706222851.GQ65891@pjdesk.au.alcatel-lucent.com> <20110806180537.GB48988@alchemy.franken.de> <20110809034004.GK6142@pjdesk.au.alcatel-lucent.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20110809034004.GK6142@pjdesk.au.alcatel-lucent.com> User-Agent: Mutt/1.4.2.3i Cc: Peter Jeremy , "freebsd-sparc64@freebsd.org" Subject: Re: 'make -j16 universe' gives SIReset X-BeenThere: freebsd-sparc64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Porting FreeBSD to the Sparc List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Aug 2011 09:53:47 -0000 On Tue, Aug 09, 2011 at 01:40:04PM +1000, Peter Jeremy wrote: > On 2011-Aug-07 02:05:37 +0800, Marius Strobl wrote: > >Could you please give the below patch a try? This is just a shot in the > >dark though. > > No change. Running pho@'s stress test with INCARNATIONS=150 results > in 'thr1' wedging after about 2 hrs (2 thr1 processes in "urdlck" and > a pile of zombies). > Kib, do you happen to have an idea what could be going on causing no process being able to grab the lock in do_rw_rdlock() (see also 20110706042634.GP65891@pjdesk.au.alcatel-lucent.com)? The whole user mutex implementation (both kernel and userland side) seems weird to me as things like _has_waiters and m_owner sometimes are updated atomically and sometimes not, sometimes using memory barriers and sometimes not ... Marius