From owner-freebsd-sparc64@FreeBSD.ORG Wed Jul 6 04:41:52 2011 Return-Path: Delivered-To: freebsd-sparc64@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E632E106564A; Wed, 6 Jul 2011 04:41:52 +0000 (UTC) (envelope-from peter.jeremy@alcatel-lucent.com) Received: from ihemail1.lucent.com (ihemail1.lucent.com [135.245.0.33]) by mx1.freebsd.org (Postfix) with ESMTP id 41BB68FC08; Wed, 6 Jul 2011 04:41:51 +0000 (UTC) Received: from usnavsmail3.ndc.alcatel-lucent.com (usnavsmail3.ndc.alcatel-lucent.com [135.3.39.11]) by ihemail1.lucent.com (8.13.8/IER-o) with ESMTP id p664fon7005849 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK); Tue, 5 Jul 2011 23:41:50 -0500 (CDT) Received: from unixmail.au.alcatel-lucent.com (unixmail.au.alcatel-lucent.com [139.188.42.130]) by usnavsmail3.ndc.alcatel-lucent.com (8.14.3/8.14.3/GMO) with ESMTP id p664fkov026448 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NOT); Tue, 5 Jul 2011 23:41:49 -0500 Received: from insmb.au.alcatel-lucent.com (insmb.au.alcatel-lucent.com [139.188.42.184]) by unixmail.au.alcatel-lucent.com (8.13.8+Sun/8.13.3) with ESMTP id p664fjOn002877; Wed, 6 Jul 2011 14:41:45 +1000 (EST) Received: from pjdesk.au.alcatel-lucent.com (pjdesk.au.alcatel-lucent.com [139.188.2.2]) by insmb.au.alcatel-lucent.com (8.13.8+Sun/8.13.8) with ESMTP id p664Qhww016618; Wed, 6 Jul 2011 14:26:44 +1000 (EST) X-Bogosity: Ham, spamicity=0.000000 Received: from pjdesk.au.alcatel-lucent.com (localhost [127.0.0.1]) by pjdesk.au.alcatel-lucent.com (8.14.4/8.14.4) with ESMTP id p664QbTr003127; Wed, 6 Jul 2011 14:26:37 +1000 (EST) (envelope-from peter.jeremy@alcatel-lucent.com) Received: (from pjeremy@localhost) by pjdesk.au.alcatel-lucent.com (8.14.4/8.14.4/Submit) id p664QYNH003126; Wed, 6 Jul 2011 14:26:34 +1000 (EST) (envelope-from peter.jeremy@alcatel-lucent.com) Date: Wed, 6 Jul 2011 14:26:34 +1000 From: Peter Jeremy To: Marius Strobl Message-ID: <20110706042634.GP65891@pjdesk.au.alcatel-lucent.com> References: <20110629175444.GH14797@alchemy.franken.de> <20110629220010.GA53017@pjdesk.au.alcatel-lucent.com> <20110629223008.GL14797@alchemy.franken.de> <20110630221752.GG65891@pjdesk.au.alcatel-lucent.com> <20110702002325.GS14797@alchemy.franken.de> <4E0F6B8D.8000500@rice.edu> <20110704214158.GX14797@alchemy.franken.de> <20110705160709.GA77843@alchemy.franken.de> <4E135420.4080201@rice.edu> <20110705190126.GE14797@alchemy.franken.de> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="q5r20fdKX+PFtYHw" Content-Disposition: inline In-Reply-To: <20110705190126.GE14797@alchemy.franken.de> X-PGP-Key: http://members.optusnet.com.au/peterjeremy/pubkey.asc User-Agent: Mutt/1.5.21 (2010-09-15) X-Scanned-By: MIMEDefang 2.57 on 135.245.2.33 X-Scanned-By: MIMEDefang 2.64 on 135.3.39.11 Cc: "alc@freebsd.org" , "freebsd-sparc64@freebsd.org" , Alan Cox Subject: Re: 'make -j16 universe' gives SIReset X-BeenThere: freebsd-sparc64@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Porting FreeBSD to the Sparc List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 06 Jul 2011 04:41:53 -0000 --q5r20fdKX+PFtYHw Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 2011-Jul-06 03:01:26 +0800, Marius Strobl wr= ote: >Peter, could you please test again with at least r223795 and the below >patch but no additional change to pmap.c? I've updated my V890 to r223802 with no patches other than the vm_page.c one you posted and started running pho@'s stress test with INCARNATIONS=3D150. After about 1=BD hr, it reported: "witness_lock_list_get: witness exhausted" I presume I need to increase LOCK_CHILDCOUNT to avoid this. sysctl shows: debug.witness.sleep_cnt: 132 debug.witness.spin_cnt: 0 debug.witness.free_cnt: 751 debug.witness.badstacks: Witness not running debug.witness.fullgraph: Witness not running debug.witness.skipspin: 1 debug.witness.trace: 1 debug.witness.kdb: 0 debug.witness.watch: -1 After about 2=BC hrs, 'thr1' stopped making progress: It has 77 zombies and a further 5 processes stuck in "urdlck" (no other processes appear stuck). "procstat -k" shows: 8732 100898 thr1 - mi_switch sleepq_switch slee= pq_catch_signals sleepq_wait_sig _sleep kern_wait wait4 syscallenter syscal= l=20 8881 195433 thr1 - mi_switch sleepq_switch slee= pq_catch_signals sleepq_wait_sig _sleep do_rw_rdlock __umtx_op_rw_rdlock _u= mtx_op syscallenter syscall=20 And DDB for one of the stuck processes shows db> trace 8881 Tracing pid 8881 tid 195433 td 0xfffff8b0a2e72880 mi_switch() at mi_switch+0x2a8 sleepq_switch() at sleepq_switch+0x1cc sleepq_catch_signals() at sleepq_catch_signals+0x130 sleepq_wait_sig() at sleepq_wait_sig+0x8 _sleep() at _sleep+0x41c do_rw_rdlock() at do_rw_rdlock+0x7e4 __umtx_op_rw_rdlock() at __umtx_op_rw_rdlock+0x1c _umtx_op() at _umtx_op+0x3c syscallenter() at syscallenter+0x270 syscall() at syscall+0x74 -- syscall (454, FreeBSD ELF64, _umtx_op) %o7=3D0x40479574 -- userland() at 0x4047957c user trace: trap %o7=3D0x40479574 pc 0x4047957c, sp 0x7fdffffc561 pc 0x7fdffffd1c0, sp 0x40365a10 pc 0x90000000000125a, sp 0xac00002d11220000 Unfortunately, I'm somewhat at a loss as to how to investigate this further. In particular, DDB doesn't show the lock details and kgdb doesn't work. What is involved in getting kgdb to work on sparc64? --=20 Peter Jeremy --q5r20fdKX+PFtYHw Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.17 (FreeBSD) iEYEARECAAYFAk4T4/oACgkQ/opHv/APuIc8rQCbB95A2IXUVj5vD51hjVxtuEFJ lU8Amwck2OBYI1hkcCU3UBmVpFNDbeuo =BomO -----END PGP SIGNATURE----- --q5r20fdKX+PFtYHw--