From owner-freebsd-current@FreeBSD.ORG Mon Dec 10 18:45:54 2012 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id D5ED3A29; Mon, 10 Dec 2012 18:45:54 +0000 (UTC) (envelope-from kostikbel@gmail.com) Received: from kib.kiev.ua (kib.kiev.ua [IPv6:2001:470:d5e7:1::1]) by mx1.freebsd.org (Postfix) with ESMTP id 432818FC12; Mon, 10 Dec 2012 18:45:54 +0000 (UTC) Received: from tom.home (kostik@localhost [127.0.0.1]) by kib.kiev.ua (8.14.5/8.14.5) with ESMTP id qBAIjjVC076350; Mon, 10 Dec 2012 20:45:45 +0200 (EET) (envelope-from kostikbel@gmail.com) DKIM-Filter: OpenDKIM Filter v2.7.1 kib.kiev.ua qBAIjjVC076350 Received: (from kostik@localhost) by tom.home (8.14.5/8.14.5/Submit) id qBAIjjuR076349; Mon, 10 Dec 2012 20:45:45 +0200 (EET) (envelope-from kostikbel@gmail.com) X-Authentication-Warning: tom.home: kostik set sender to kostikbel@gmail.com using -f Date: Mon, 10 Dec 2012 20:45:45 +0200 From: Konstantin Belousov To: Rick Macklem Subject: Re: r244036 kernel hangs under load. Message-ID: <20121210184545.GS3013@kib.kiev.ua> References: <735026206.1290394.1355164701856.JavaMail.root@erie.cs.uoguelph.ca> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="04FybxbhfaAKXQYm" Content-Disposition: inline In-Reply-To: <735026206.1290394.1355164701856.JavaMail.root@erie.cs.uoguelph.ca> User-Agent: Mutt/1.5.21 (2010-09-15) X-Spam-Status: No, score=-2.0 required=5.0 tests=ALL_TRUSTED,BAYES_00, DKIM_ADSP_CUSTOM_MED,FREEMAIL_FROM,NML_ADSP_CUSTOM_MED autolearn=no version=3.3.2 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on tom.home Cc: Tim Kientzle , freebsd-current Current X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 10 Dec 2012 18:45:54 -0000 --04FybxbhfaAKXQYm Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Mon, Dec 10, 2012 at 01:38:21PM -0500, Rick Macklem wrote: > Adrian Chadd wrote: > > .. what was the previous kernel version? > >=20 > Hopefully Tim has it narrowed down more, but I don't see > the hangs on a Sept. 7 kernel from head and I do see them > on a Dec. 3 kernel from head. (Don't know the eact rNNNNNN.) >=20 > It seems to predate my commit (r244008), which was my first > concern. >=20 > I use old single core i386 hardware and can fairly reliably > reproduce it by doing a kernel build and a "svn checkout" > concurrently. No NFS activity. These are running on a local > disk (UFS/FFS). (The kernel I reproduce it on is built via > GENERIC for i386. If you want me to start a "binary search" > for which rNNNNNN, I can do that, but it will take a while.:-) >=20 > I can get out into DDB, but I'll admit I don't know enough > about it to know where to look;-) > Here's some lines from "db> ps", in case they give someone > useful information. (I can leave this box sitting in DB for > the rest of to-day, in case someone can suggest what I should > look for on it.) >=20 > Just snippets... > Ss pause adjkerntz > DL sdflush [sofdepflush] > RL [syncer] > DL vlruwt [vnlru] > DL psleep [bufdaemon] > RL [pagezero] > DL psleep [vmdaemon] > DL psleep [pagedaemon] > DL ccb_scan [xpt_thrd] > DL waiting_ [sctp_iterator] > DL ctl_work [ctl_thrd] > DL cooling [acpi_cooling0] > DL tzpoll [acpi_thermal] > DL (threaded) [usb] > ... > DL - [yarrow] > DL (threaded) [geom] > D - [g_down] > D - [g_up] > D - [g_event] > RL (threaded) [intr] > I [irq15: ata1] > ... > Run CPU0 [swi6: Giant taskq] > --> does this one indicate the CPU is actually running this? > (after a db> cont, wait a while db> ps > it is still the same) > I [swi4: clock] > I [swi1: netisr 0] > I [swi3: vm] > RL [idle: cpu0] > SLs wait [init] > DL audit_wo [audit] > DLs (threaded) [kernel] > D - [deadlkres] > ... > D sched [swapper] >=20 > I have no idea if this "ps" output helps, unless it indicates > that it is looping on the Giant taskq? Might be. You could do 'bt ' for the process to see where it loops. Another good set of hints is at http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kernel= debug-deadlocks.html >=20 > As I said, I can leave it in "db" for to-day, if anyone wants > me to do anything in the debugger and I can probably reproduce > it, if someone wants stuff tried later. >=20 > rick >=20 >=20 > >=20 > >=20 > > adrian > >=20 > >=20 > > On 9 December 2012 22:08, Tim Kientzle wrote: > > > I haven't found any useful clues yet, but thought I'd ask if anyone > > > else > > > was seeing hangs in a recent kernel. > > > > > > I just upgraded to r244036 using a straight GENERIC i386 kernel. > > > (Straight buildworld/buildkernel, no local changes, /etc/src.conf > > > doesn't > > > exist, /etc/make.conf just has PERL_VERSION defined.) > > > > > > When I try to cross build an ARM world on the resulting system, > > > the entire system hangs hard after about 30 minutes: No network, > > > no keyboard response, no nothing. > > > > > > Don't know if it's relevant, but the system is using NFS pretty > > > heavily (Parallels VM mounting NFS from Mac OS 10.7 host.) > > > > > > I'll try to get some more details ... > > > > > > Tim > > > > > > _______________________________________________ > > > freebsd-current@freebsd.org mailing list > > > http://lists.freebsd.org/mailman/listinfo/freebsd-current > > > To unsubscribe, send any mail to > > > "freebsd-current-unsubscribe@freebsd.org" > > _______________________________________________ > > freebsd-current@freebsd.org mailing list > > http://lists.freebsd.org/mailman/listinfo/freebsd-current > > To unsubscribe, send any mail to > > "freebsd-current-unsubscribe@freebsd.org" > _______________________________________________ > freebsd-current@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-current > To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org" --04FybxbhfaAKXQYm Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.19 (FreeBSD) iQIcBAEBAgAGBQJQxi3ZAAoJEJDCuSvBvK1BTpgP/2t5z4TU//kLmdOplsnr3Nnn Hf29ENQuwj6tMX8KcsFNnHMehbWkGpSzHaSgFW5Ud4AsiYid4rS+l6YZGt5YWxmw U0gX4wa2vmDG72FhhTj9IVQm2O9rBRYM4rdEEc+Gr/CSRIe0Rp2Ia9xC6bHkw017 lZgsepbMMFiLDQkf+1TbwSI8bm/NN2uvByBTzgkg9kTdE8XiNiK8+neeTdhk+QWn PMxeQ7XbNHmFMRbPAZyehoFnx0+/WTTmbT74Ryb2gzDIxUwLXjaZoo3TaMHP1G9a w+7YXrtUU/0mWdGBSUBIFMa3dNnNnD7gY/edkrFqLarDqwZwJ70iwUfvwVU9i8r4 BmNlQhl+E27IlQf2hbdm9JvSpR8pFHFr/vNHPBj5LOtzEfNVAWnNTPHDgD9JxFQ/ eQxceX1ADWcJa6QR6id1q7weTH3g2kjFOQhifMzKdT/nILYFyybMS2aHfKVvTFvP xb6bpUsKE4dvasGRKX4SsPGEwezAoXGobc65YPtklHnfh5k8we89PC8dbr0fqe/Q HR20bmb/PABEy3XRzYYRfq+XJy4vPaleJYnuGTpYpo0pwURNQh+w62dDgXjM6HaS zSFJP5JzUR8dDDhDnzMAcPjblXqV8h8Hb4HGlQ7o7LpF3vFOPXOEFLE3lC2RBWG0 u69ZHe44vz9NtZYbIbBT =W9J8 -----END PGP SIGNATURE----- --04FybxbhfaAKXQYm--