Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 10 Dec 2012 20:45:45 +0200
From:      Konstantin Belousov <kostikbel@gmail.com>
To:        Rick Macklem <rmacklem@uoguelph.ca>
Cc:        Tim Kientzle <kientzle@freebsd.org>, freebsd-current Current <freebsd-current@freebsd.org>
Subject:   Re: r244036 kernel hangs under load.
Message-ID:  <20121210184545.GS3013@kib.kiev.ua>
In-Reply-To: <735026206.1290394.1355164701856.JavaMail.root@erie.cs.uoguelph.ca>
References:  <CAJ-VmomBhnTvtfjK4z=QTXTpCJk8pEgzmV3aHFxOxiLgD%2BLTpQ@mail.gmail.com> <735026206.1290394.1355164701856.JavaMail.root@erie.cs.uoguelph.ca>

next in thread | previous in thread | raw e-mail | index | archive | help

--04FybxbhfaAKXQYm
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Mon, Dec 10, 2012 at 01:38:21PM -0500, Rick Macklem wrote:
> Adrian Chadd wrote:
> > .. what was the previous kernel version?
> >=20
> Hopefully Tim has it narrowed down more, but I don't see
> the hangs on a Sept. 7 kernel from head and I do see them
> on a Dec. 3 kernel from head. (Don't know the eact rNNNNNN.)
>=20
> It seems to predate my commit (r244008), which was my first
> concern.
>=20
> I use old single core i386 hardware and can fairly reliably
> reproduce it by doing a kernel build and a "svn checkout"
> concurrently. No NFS activity. These are running on a local
> disk (UFS/FFS). (The kernel I reproduce it on is built via
> GENERIC for i386. If you want me to start a "binary search"
> for which rNNNNNN, I can do that, but it will take a while.:-)
>=20
> I can get out into DDB, but I'll admit I don't know enough
> about it to know where to look;-)
> Here's some lines from "db> ps", in case they give someone
> useful information. (I can leave this box sitting in DB for
> the rest of to-day, in case someone can suggest what I should
> look for on it.)
>=20
> Just snippets...
>    Ss pause     adjkerntz
>    DL sdflush  [sofdepflush]
>    RL            [syncer]
>    DL vlruwt   [vnlru]
>    DL psleep   [bufdaemon]
>    RL          [pagezero]
>    DL psleep   [vmdaemon]
>    DL psleep   [pagedaemon]
>    DL ccb_scan [xpt_thrd]
>    DL waiting_ [sctp_iterator]
>    DL ctl_work [ctl_thrd]
>    DL cooling  [acpi_cooling0]
>    DL tzpoll   [acpi_thermal]
>    DL (threaded) [usb]
>    ...
>    DL -        [yarrow]
>    DL (threaded) [geom]
>    D  -         [g_down]
>    D  -         [g_up]
>    D  -         [g_event]
>    RL   (threaded) [intr]
>    I            [irq15: ata1]
>    ...
>    Run CPU0    [swi6: Giant taskq]
> --> does this one indicate the CPU is actually running this?
>    (after a db> cont, wait a while <ctrl><alt><esc> db> ps
>     it is still the same)
>    I            [swi4: clock]
>    I            [swi1: netisr 0]
>    I            [swi3: vm]
>    RL           [idle: cpu0]
>    SLs wait     [init]
>    DL  audit_wo [audit]
>    DLs (threaded) [kernel]
>    D  -         [deadlkres]
>    ...
>    D   sched    [swapper]
>=20
> I have no idea if this "ps" output helps, unless it indicates
> that it is looping on the Giant taskq?
Might be. You could do 'bt <pid>' for the process to see where it loops.
Another good set of hints is at
http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kernel=
debug-deadlocks.html

>=20
> As I said, I can leave it in "db" for to-day, if anyone wants
> me to do anything in the debugger and I can probably reproduce
> it, if someone wants stuff tried later.
>=20
> rick
>=20
>=20
> >=20
> >=20
> > adrian
> >=20
> >=20
> > On 9 December 2012 22:08, Tim Kientzle <kientzle@freebsd.org> wrote:
> > > I haven't found any useful clues yet, but thought I'd ask if anyone
> > > else
> > > was seeing hangs in a recent kernel.
> > >
> > > I just upgraded to r244036 using a straight GENERIC i386 kernel.
> > > (Straight buildworld/buildkernel, no local changes, /etc/src.conf
> > > doesn't
> > > exist, /etc/make.conf just has PERL_VERSION defined.)
> > >
> > > When I try to cross build an ARM world on the resulting system,
> > > the entire system hangs hard after about 30 minutes: No network,
> > > no keyboard response, no nothing.
> > >
> > > Don't know if it's relevant, but the system is using NFS pretty
> > > heavily (Parallels VM mounting NFS from Mac OS 10.7 host.)
> > >
> > > I'll try to get some more details ...
> > >
> > > Tim
> > >
> > > _______________________________________________
> > > freebsd-current@freebsd.org mailing list
> > > http://lists.freebsd.org/mailman/listinfo/freebsd-current
> > > To unsubscribe, send any mail to
> > > "freebsd-current-unsubscribe@freebsd.org"
> > _______________________________________________
> > freebsd-current@freebsd.org mailing list
> > http://lists.freebsd.org/mailman/listinfo/freebsd-current
> > To unsubscribe, send any mail to
> > "freebsd-current-unsubscribe@freebsd.org"
> _______________________________________________
> freebsd-current@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-current
> To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org"

--04FybxbhfaAKXQYm
Content-Type: application/pgp-signature

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.19 (FreeBSD)

iQIcBAEBAgAGBQJQxi3ZAAoJEJDCuSvBvK1BTpgP/2t5z4TU//kLmdOplsnr3Nnn
Hf29ENQuwj6tMX8KcsFNnHMehbWkGpSzHaSgFW5Ud4AsiYid4rS+l6YZGt5YWxmw
U0gX4wa2vmDG72FhhTj9IVQm2O9rBRYM4rdEEc+Gr/CSRIe0Rp2Ia9xC6bHkw017
lZgsepbMMFiLDQkf+1TbwSI8bm/NN2uvByBTzgkg9kTdE8XiNiK8+neeTdhk+QWn
PMxeQ7XbNHmFMRbPAZyehoFnx0+/WTTmbT74Ryb2gzDIxUwLXjaZoo3TaMHP1G9a
w+7YXrtUU/0mWdGBSUBIFMa3dNnNnD7gY/edkrFqLarDqwZwJ70iwUfvwVU9i8r4
BmNlQhl+E27IlQf2hbdm9JvSpR8pFHFr/vNHPBj5LOtzEfNVAWnNTPHDgD9JxFQ/
eQxceX1ADWcJa6QR6id1q7weTH3g2kjFOQhifMzKdT/nILYFyybMS2aHfKVvTFvP
xb6bpUsKE4dvasGRKX4SsPGEwezAoXGobc65YPtklHnfh5k8we89PC8dbr0fqe/Q
HR20bmb/PABEy3XRzYYRfq+XJy4vPaleJYnuGTpYpo0pwURNQh+w62dDgXjM6HaS
zSFJP5JzUR8dDDhDnzMAcPjblXqV8h8Hb4HGlQ7o7LpF3vFOPXOEFLE3lC2RBWG0
u69ZHe44vz9NtZYbIbBT
=W9J8
-----END PGP SIGNATURE-----

--04FybxbhfaAKXQYm--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20121210184545.GS3013>