Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 7 Jul 2011 08:28:51 +1000
From:      Peter Jeremy <peter.jeremy@alcatel-lucent.com>
To:        Marius Strobl <marius@alchemy.franken.de>
Cc:        "alc@freebsd.org" <alc@freebsd.org>, "freebsd-sparc64@freebsd.org" <freebsd-sparc64@freebsd.org>, Alan Cox <alc@rice.edu>
Subject:   Re: 'make -j16 universe' gives SIReset
Message-ID:  <20110706222851.GQ65891@pjdesk.au.alcatel-lucent.com>
In-Reply-To: <20110706103910.GG14797@alchemy.franken.de>
References:  <20110629223008.GL14797@alchemy.franken.de> <20110630221752.GG65891@pjdesk.au.alcatel-lucent.com> <20110702002325.GS14797@alchemy.franken.de> <4E0F6B8D.8000500@rice.edu> <20110704214158.GX14797@alchemy.franken.de> <20110705160709.GA77843@alchemy.franken.de> <4E135420.4080201@rice.edu> <20110705190126.GE14797@alchemy.franken.de> <20110706042634.GP65891@pjdesk.au.alcatel-lucent.com> <20110706103910.GG14797@alchemy.franken.de>

next in thread | previous in thread | raw e-mail | index | archive | help

[-- Attachment #1 --]
On 2011-Jul-06 18:39:10 +0800, Marius Strobl <marius@alchemy.franken.de> wrote:
>On Wed, Jul 06, 2011 at 02:26:34PM +1000, Peter Jeremy wrote:
>> And DDB for one of the stuck processes shows
>> db> trace 8881
>> Tracing pid 8881 tid 195433 td 0xfffff8b0a2e72880
>> mi_switch() at mi_switch+0x2a8
>> sleepq_switch() at sleepq_switch+0x1cc
>> sleepq_catch_signals() at sleepq_catch_signals+0x130
>> sleepq_wait_sig() at sleepq_wait_sig+0x8
>> _sleep() at _sleep+0x41c
>> do_rw_rdlock() at do_rw_rdlock+0x7e4
>> __umtx_op_rw_rdlock() at __umtx_op_rw_rdlock+0x1c
>> _umtx_op() at _umtx_op+0x3c
>> syscallenter() at syscallenter+0x270
>> syscall() at syscall+0x74
>> -- syscall (454, FreeBSD ELF64, _umtx_op) %o7=0x40479574 --
>> userland() at 0x4047957c
>> user trace: trap %o7=0x40479574
>> pc 0x4047957c, sp 0x7fdffffc561
>> pc 0x7fdffffd1c0, sp 0x40365a10
>> pc 0x90000000000125a, sp 0xac00002d11220000
>
>What line does mi_switch+0x2a8 translate to?

0xc0503628 <mi_switch+680>:     call  0xc0528ba0 <sched_switch>
    448         sched_switch(td, newtd, flags);

The system is still running so I think the bigger issue is why
none of the processes can grab the mutex.

>sparc64 package of gdb53, which still has the '-k' option:
>http://people.freebsd.org/~marius/gdb-5.3_1%2c1.tbz

Unfortunately, it doesn't like me:

# gdb53 -k /usr/obj/usr/src/sys/GENERIC/kernel.debug /dev/mem
GNU gdb 5.3 (FreeBSD)
Copyright 2002 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "sparc64-unknown-freebsd9.0"...
panic messages:
---
dmesg: kvm_nlist: No such file or directory
---
---Can't read userspace from dump, or kernel process---

(kgdb) where
---Can't read userspace from dump, or kernel process---

(kgdb) disas mi_switch
Segmentation fault (core dumped)

-- 
Peter Jeremy

[-- Attachment #2 --]
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.17 (FreeBSD)

iEYEARECAAYFAk4U4aMACgkQ/opHv/APuIcuUwCfeWs4HZnRUUZ8PSRrcLBuwrhJ
f3gAoIAs3B+u9sBoc+AYo0UC/CMftpuN
=Vi9c
-----END PGP SIGNATURE-----

Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20110706222851.GQ65891>