Date: Wed, 23 Jul 2008 15:03:48 +0300 From: Kostik Belousov <kostikbel@gmail.com> To: jhb@freebsd.org Cc: Mikhail Teterin <mi+mill@aldan.algebra.com>, Kris Kennaway <kris@freebsd.org>, stable@freebsd.org Subject: Re: "sleeping without queue" ? Message-ID: <20080723120348.GJ17123@deviant.kiev.zoral.com.ua> In-Reply-To: <48863C3D.7090401@aldan.algebra.com> References: <48860725.9050808@aldan.algebra.com> <20080722161958.GA11139@eos.sc1.parodius.com> <48860E8C.6050400@FreeBSD.org> <48860EF6.2040007@aldan.algebra.com> <48861284.1050706@FreeBSD.org> <48861448.7020708@aldan.algebra.com> <20080722192155.GC17123@deviant.kiev.zoral.com.ua> <48863465.8080204@aldan.algebra.com> <20080722194114.GE17123@deviant.kiev.zoral.com.ua> <48863C3D.7090401@aldan.algebra.com>
next in thread | previous in thread | raw e-mail | index | archive | help
--xg083K01wEKuIc36 Content-Type: text/plain; charset=koi8-r Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Tue, Jul 22, 2008 at 03:59:57PM -0400, Mikhail Teterin wrote: > Kostik Belousov =CE=C1=D0=C9=D3=C1=D7(=CC=C1): > >On Tue, Jul 22, 2008 at 03:26:29PM -0400, Mikhail Teterin wrote: > >>Kostik Belousov =CE=C1=D0=C9=D3=C1=D7(=CC=C1): > >>>Did you switched to the process before doing backtrace (using the proc= =20 > >>><pid> > >>>command)? > >>Ok, thanks. Did not know about this one. Here: > >>... > >>(kgdb) proc 79759 > >>(kgdb) bt > >>#0 sched_switch (td=3D0xffffff01286dc000, newtd=3D0xffffff00010ce000,= =20 > >>flags=3D2) at /var/src/sys/kern/sched_4bsd.c:928 > >>#1 0x0000000000000000 in ?? () > >>#2 0xffffffff802f1108 in mi_switch (flags=3D678281216, newtd=3D0x2) at= =20 > >>/var/src/sys/kern/kern_synch.c:442 > >>#3 0xffffffff80318513 in sleepq_check_timeout () at=20 > >>/var/src/sys/kern/subr_sleepqueue.c:519 > >>#4 0xffffffff80318c85 in sleepq_timedwait (wchan=3D0xffffffff80688408)= at=20 > >>/var/src/sys/kern/subr_sleepqueue.c:597 > >>#5 0xffffffff802f16a2 in _sleep (ident=3D0xffffffff80688408, lock=3D0x= 0,=20 > >>priority=3D0, wmesg=3D0xffffffff804f3059 "vmo_de", timo=3D1) at=20 > >>/var/src/sys/kern/kern_synch.c:224 > >>#6 0xffffffff8043036b in vm_object_deallocate=20 > >>(object=3D0xffffff0053024a90) at /var/src/sys/vm/vm_object.c:509 > >From this frame, please, print the object (like p *object) and > >likewise, print the object that is at the head of the object->shadow_head > >list. > kgdb /usr/obj/var/src/sys/SILVER-SMP/kernel.debug /dev/mem > [GDB will not be able to debug user-mode threads:=20 > /usr/lib/libthread_db.so: Undefined symbol "ps_pglobal_lookup"] > GNU gdb 6.1.1 [FreeBSD] > Copyright 2004 Free Software Foundation, Inc. > GDB is free software, covered by the GNU General Public License, and you = are > welcome to change it and/or distribute copies of it under certain=20 > conditions. > Type "show copying" to see the conditions. > There is absolutely no warranty for GDB. Type "show warranty" for detail= s. > This GDB was configured as "amd64-marcel-freebsd". > There is no member named pathname. > Reading symbols from /opt/modules/fuse.ko...done. > Loaded symbols for /opt/modules/fuse.ko > Reading symbols from /opt/modules/rtc.ko...done. > Loaded symbols for /opt/modules/rtc.ko > Reading symbols from /boot/kernel/snd_ich.ko...Reading symbols from=20 > /boot/kernel/snd_ich.ko.symbols...done. > done. > Loaded symbols for /boot/kernel/snd_ich.ko > Reading symbols from /boot/kernel/msdosfs.ko...Reading symbols from=20 > /boot/kernel/msdosfs.ko.symbols...done. > done. > Loaded symbols for /boot/kernel/msdosfs.ko > #0 0x0000000000000000 in ?? () > (kgdb) frame 6 > Error accessing memory address 0x0: Bad address. > (kgdb) pid 79759 > Undefined command: "pid". Try "help". > (kgdb) proc 79759 > (kgdb) frame 6 > #6 0xffffffff8043036b in vm_object_deallocate=20 > (object=3D0xffffff0053024a90) at /var/src/sys/vm/vm_object.c:509 > 509 pause("vmo_de", 1); > (kgdb) p *object > $1 =3D {mtx =3D {lock_object =3D {lo_name =3D 0xffffffff804f21c4 "vm obje= ct",=20 > lo_type =3D 0xffffffff804f3018 "standard object", lo_flags =3D 21168128,= =20 > lo_witness_data =3D { > lod_list =3D {stqe_next =3D 0x0}, lod_witness =3D 0x0}}, mtx_lock = =3D 4,=20 > mtx_recurse =3D 0}, object_list =3D {tqe_next =3D 0xffffff0005018a90, > tqe_prev =3D 0xffffff00539a6850}, shadow_head =3D {lh_first =3D=20 > 0xffffff005d3afa90}, shadow_list =3D {le_next =3D 0x0, le_prev =3D=20 > 0xffffff005d2cd048}, memq =3D { > tqh_first =3D 0xffffff007eb9fa58, tqh_last =3D 0xffffff007f864820}, ro= ot=20 > =3D 0xffffff007ee14d38, size =3D 427, generation =3D 66, ref_count =3D 2,= =20 > shadow_count =3D 1, > type =3D 0 '\0', flags =3D 256, pg_color =3D 0, paging_in_progress =3D 0= ,=20 > resident_page_count =3D 44, backing_object =3D 0x0, backing_object_offset= =3D=20 > 0, pager_object_list =3D { > tqe_next =3D 0x0, tqe_prev =3D 0x0}, cache =3D 0x0, handle =3D 0x0, un= _pager=20 > =3D {vnp =3D {vnp_size =3D 576646}, devp =3D {devp_pglist =3D {tqh_first = =3D 0x8cc86, > tqh_last =3D 0x0}}, swp =3D {swp_bcount =3D 576646}}} > (kgdb) p (object->shadow_head) > $2 =3D {lh_first =3D 0xffffff005d3afa90} > (kgdb) p *object->shadow_head.lh_first > $3 =3D {mtx =3D {lock_object =3D {lo_name =3D 0xffffffff804f21c4 "vm obje= ct",=20 > lo_type =3D 0xffffffff804f3018 "standard object", lo_flags =3D 21168128,= =20 > lo_witness_data =3D { > lod_list =3D {stqe_next =3D 0x0}, lod_witness =3D 0x0}}, mtx_lock = =3D 4,=20 > mtx_recurse =3D 0}, object_list =3D {tqe_next =3D 0xffffff0066c32340, > tqe_prev =3D 0xffffff012f673ac0}, shadow_head =3D {lh_first =3D 0x0},= =20 > shadow_list =3D {le_next =3D 0x0, le_prev =3D 0xffffff0053024ad0}, memq = =3D { > tqh_first =3D 0xffffff007779f9a0, tqh_last =3D 0xffffff0077c04140}, ro= ot=20 > =3D 0xffffff0077c04130, size =3D 387, generation =3D 3, ref_count =3D 1,= =20 > shadow_count =3D 0, > type =3D 0 '\0', flags =3D 8452, pg_color =3D 0, paging_in_progress =3D = 0,=20 > resident_page_count =3D 2, backing_object =3D 0xffffff0053024a90,=20 > backing_object_offset =3D 163840, > pager_object_list =3D {tqe_next =3D 0x0, tqe_prev =3D 0x0}, cache =3D 0x= 0,=20 > handle =3D 0x0, un_pager =3D {vnp =3D {vnp_size =3D 365278}, devp =3D {de= vp_pglist =3D { > tqh_first =3D 0x592de, tqh_last =3D 0x0}}, swp =3D {swp_bcount =3D= 365278}}} >=20 >=20 > > > >Another question is what scheduler do you use ? > options SCHED_4BSD # 4BSD scheduler > options PREEMPTION # Enable kernel thread preemption The state of the both object being destroyed and the object that is shadowed looks right for me. Moreover, the shadowed object is not locked, value of the mtx_lock is 4. It seems as if the thread missed the wakeup owed by pause. John, could it be that the following commit is supposed to fix the issue ? r179974 | jhb | 2008-06-24 22:36:33 +0300 (Tue, 24 Jun 2008) | 3 lines MFC: Change the roundrobin implementation in the 4BSD scheduler to trigger a userland preemption directly from hardclock() via sched_clock() >=20 > >>>Also, show the output of ps axl <pid>. > >> UID PID PPID CPU PRI NI VSZ RSS MWCHAN STAT TT TIME COMM= AND > >> 0 79759 79758 0 96 0 0 16 - DE+ p6 0:00,00=20 > >>/bin/tcsh -fc=20 > >>/meow/ports/editors/openoffice.org-3/work/BEB300_m3/solver/300/unxfbsdx= .pro/bin/ma > > > >It makes sense to show the whole ps axl output. > See http://aldan.algebra.com/~mi/tmp/ps-axl.txt -- I edited it for=20 > privacy a little bit, but process-states are intact. > The java-processes in the linuxf have remained unkillable for weeks now= =20 > -- I even forgot about them. But those are linuxulator problems, whereas= =20 > the tcsh is native... It seems that pid 63930 is problematic too ? --xg083K01wEKuIc36 Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (FreeBSD) iEYEARECAAYFAkiHHiMACgkQC3+MBN1Mb4hubwCdEaKZyQqJuk28P0Tmfyl44Kpw kM0An0iMAiDKRH9uj8wQtwgbFnl+0RgR =DMrw -----END PGP SIGNATURE----- --xg083K01wEKuIc36--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20080723120348.GJ17123>