From owner-freebsd-current@FreeBSD.ORG Sun Apr 5 10:13:14 2009 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E6E201065670 for ; Sun, 5 Apr 2009 10:13:14 +0000 (UTC) (envelope-from ohartman@mail.zedat.fu-berlin.de) Received: from outpost1.zedat.fu-berlin.de (outpost1.zedat.fu-berlin.de [130.133.4.66]) by mx1.freebsd.org (Postfix) with ESMTP id 71CCF8FC18 for ; Sun, 5 Apr 2009 10:13:14 +0000 (UTC) (envelope-from ohartman@mail.zedat.fu-berlin.de) Received: from inpost2.zedat.fu-berlin.de ([130.133.4.69]) by outpost1.zedat.fu-berlin.de (Exim 4.69) with esmtp (envelope-from ) id <1LqPLk-00053g-9t>; Sun, 05 Apr 2009 12:13:12 +0200 Received: from e178001204.adsl.alicedsl.de ([85.178.1.204] helo=thor.walstatt.dyndns.org) by inpost2.zedat.fu-berlin.de (Exim 4.69) with esmtpsa (envelope-from ) id <1LqPLk-0007xw-6Q>; Sun, 05 Apr 2009 12:13:12 +0200 Message-ID: <49D88435.30900@mail.zedat.fu-berlin.de> Date: Sun, 05 Apr 2009 12:13:09 +0200 From: "O. Hartmann" User-Agent: Thunderbird 2.0.0.21 (X11/20090404) MIME-Version: 1.0 To: Boris Samorodov References: <200904041050.28932.thierry.herbelot@free.fr> <200904041151.18209.thierry.herbelot@free.fr> <10969763@bb.ipt.ru> In-Reply-To: <10969763@bb.ipt.ru> X-Enigmail-Version: 0.95.7 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Originating-IP: 85.178.1.204 X-Mailman-Approved-At: Sun, 05 Apr 2009 11:27:54 +0000 Cc: Thierry Herbelot , freebsd-current@freebsd.org, peter@FreeBSD.org Subject: Re: Stuck kernel while cleaning up the object tree X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 05 Apr 2009 10:13:15 -0000 Boris Samorodov wrote: > On Sat, 4 Apr 2009 11:51:17 +0200 Thierry Herbelot wrote: > =20 >> Le Saturday 04 April 2009, Thierry Herbelot a =C3=A9crit : >> =20 >>> Hello, >>> >>> On recent -current machines, I have seen a common pattern, with the m= achine >>> being frozen (still responsive to pings, though) in the initial phase= s of >>> the buildworld procedure : >>> >>> example freeze : >>> -------------------------------------------------------------- >>> >>> =20 >>>>>> stage 2.1: cleaning up the object tree >>>>>> =20 >>> -------------------------------------------------------------- >>> cd /usr/src; MAKEOBJDIRPREFIX=3D/usr/obj MACHINE_ARCH=3Di386 MACHIN= E=3Di386 >>> CPUTYPE=3D GROFF_BIN_PATH=3D/usr/obj/usr/src/tmp/legacy/usr/bin >>> GROFF_FONT_PATH=3D/usr/obj/usr/src/tmp/legacy/usr/share/groff_font >>> GROFF_TMAC_PATH=3D/usr/obj/usr/src/tmp/legacy/usr/share/tmac >>> _SHLIBDIRPREFIX=3D/usr/obj/usr/src/tmp VERSION=3D"FreeBSD 8.0-CURREN= T i386 >>> 800074" INSTALL=3D"sh /usr/src/tools/install.sh" >>> PATH=3D/usr/obj/usr/src/tmp/legacy/usr/sbin:/usr/obj/usr/src/tmp/lega= cy/usr/b >>> in:/usr/obj/usr/src/tmp/legacy/usr/games:/usr/obj/usr/src/tmp/usr/sbi= n:/usr/ >>> obj/usr/src/tmp/usr/bin:/usr/obj/usr/src/tmp/usr/games:/sbin:/bin:/us= r/sbin: >>> /usr/bin NO_CTF=3D1 make -f Makefile.inc1 DESTDIR=3D/usr/obj/usr/src/= tmp >>> par-cleandir =3D=3D=3D> share/info (cleandir) >>> =3D=3D=3D> lib (cleandir) >>> =3D=3D=3D> lib/csu/i386-elf (cleandir) >>> [type ^T in the console] >>> load: 0.00 cmd: sh 24587 [*Name Cache] 0.01u 0.00s 0% 1584k >>> >>> The other machines also froze while "cleaning up the object tree". >>> >>> The machines are configured with serial consoles : I have no kernel s= tack >>> backtrace to aid in pinpointing the cause of this freeze. >>> >>> Cheers >>> >>> TfH >>> =20 > > =20 >> With a bit more investigation : >> =20 > > =20 >> on a separate ssh session, top is still live and shows processes stuck= as : >> 24523 root 1 76 0 1888K 764K *Name 1 0:00 0.00= % make >> =20 > > =20 >> on still another machine, running Witnesses (all other machines run wi= th a=20 >> lean GENERIC, with most of the debuging features commented out) : >> System call __getcwd returning with the following locks held: >> shared rw Name Cache (Name Cache) r =3D 0 (0xc0ee7e1c) locked=20 >> @ /usr/src/sys/kerne/vfs_cache.c:974 >> =20 > > This is definitely related to: > SVN rev 190655 on 2009-04-02 21:16:20Z by peter > (peter@ CCed) > > =20 >> panic: witness_warn >> cpuid =3D 0 >> KDB: enter: panic >> =20 > > > WBR > =20 Is there a fix in sight soon? I do have this error/fault/lockup now on ALL FreeBSD 8.0-CURRENT/amd64 machines I have. Regards, Oliver