From owner-freebsd-stable@FreeBSD.ORG Mon Sep 18 20:18:17 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 30F6616A4C2 for ; Mon, 18 Sep 2006 20:18:17 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (66-23-211-162.clients.speedfactory.net [66.23.211.162]) by mx1.FreeBSD.org (Postfix) with ESMTP id 9BCF443D8A for ; Mon, 18 Sep 2006 20:18:10 +0000 (GMT) (envelope-from jhb@freebsd.org) Received: from localhost.corp.yahoo.com (john@localhost [127.0.0.1]) (authenticated bits=0) by server.baldwin.cx (8.13.6/8.13.6) with ESMTP id k8IKI7LF078345; Mon, 18 Sep 2006 16:18:08 -0400 (EDT) (envelope-from jhb@freebsd.org) From: John Baldwin To: Frode Nordahl Date: Mon, 18 Sep 2006 16:14:51 -0400 User-Agent: KMail/1.9.1 References: <200609162242.56480.jhb@freebsd.org> In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200609181614.52260.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-2.0.2 (server.baldwin.cx [127.0.0.1]); Mon, 18 Sep 2006 16:18:08 -0400 (EDT) X-Virus-Scanned: ClamAV 0.88.3/1893/Mon Sep 18 14:37:26 2006 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=4.2 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on server.baldwin.cx Cc: freebsd-stable@freebsd.org Subject: Re: RELENG_6 Livelock X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 18 Sep 2006 20:18:17 -0000 On Sunday 17 September 2006 02:05, Frode Nordahl wrote: > On 17. sep. 2006, at 04.42, John Baldwin wrote: > > > On Saturday 16 September 2006 16:55, Frode Nordahl wrote: > >> On 16. sep. 2006, at 22.22, Frode Nordahl wrote: > >> > >>> On 16. sep. 2006, at 22.09, John Baldwin wrote: > >>> > >>>> On Saturday 16 September 2006 07:02, Frode Nordahl wrote: > >>>>> Hello, > >>>>> > >>>>> FreeBSD localhost.localdomain 6.2-PRERELEASE FreeBSD 6.2- > >>>>> PRERELEASE > >>>>> #1: Wed Sep 13 00:10:04 CEST 2006 > >>>>> frode@localhost.localdomain:/ > >>>>> usr/obj/usr/src/sys/PT i386 > >>>>> > >>>>> After running some stress tests for 3 days, I wanted to remove > >>>>> some > >>>>> large directories. > >>>> > >>>> Do you have a coredump? I assume you do from your debug > >>>> output. Can > >>>> you download http://www.FreeBSD.org/~jhb/gdb/gdb6, fire up kgdb, > >>>> and > >>>> once in kgdb, do 'source /path/to/gdb6' and then run 'ps' and reply > >>>> with the output from that? > >>> > >>> I am sorry, I have not. I tried to call doadump, but there was no > >>> dumpdevice configured :-( > >>> > >>> Somehow I have convinced myself that this was turned on by default > >>> now, so I have not enabled it explicitly in rc.conf. Is there any > >>> way to tell DDB what dumpdevice to use directly? > >>> > >>> I will configure a dumpdevice and try really hard to make it happen > >>> again. > >> > >> I was able to reproduce the livelock again, and this time I had the > >> system armed with dumpon :-) > >> > >> Here is the output you requested: > >> (kgdb) ps > >> pid ppid pgrp uid state wmesg wchan cmd > >> 2535 2499 2535 0 R+ CPU 0 rm > >> 2534 2499 2534 0 L+ *Giant 0xc6704580 rm > >> 2533 2499 2533 0 L+ *Giant 0xc6704580 rm > >> 2532 2499 2532 0 R+ rm > >> 2531 2499 2531 0 L+ *Giant 0xc6704580 rm > >> 2499 2496 2499 0 Ss+ ttyin 0xc655d810 bash > >> 2496 784 2496 0 Rs sshd > > > > Ok, do 'lockchain 2534' in kgdb (with gdb6 sourced) and let me see the > > output from that. > > (kgdb) lockchain 2534 > thread 100038 (pid 2534, rm) blocked on lock 0xc09e6800 "Giant" > thread 100091 (pid 2535, rm) running on CPU 0 Ok, do 'proc 2535' followed by 'where' -- John Baldwin