From owner-freebsd-stable@FreeBSD.ORG Mon Apr 24 19:17:50 2006 Return-Path: X-Original-To: stable@freebsd.org Delivered-To: freebsd-stable@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id E76CB16A400; Mon, 24 Apr 2006 19:17:49 +0000 (UTC) (envelope-from hlh@restart.be) Received: from oola.is.scarlet.be (oola.is.scarlet.be [193.74.71.23]) by mx1.FreeBSD.org (Postfix) with ESMTP id D0B1643D46; Mon, 24 Apr 2006 19:17:46 +0000 (GMT) (envelope-from hlh@restart.be) Received: from ([62.235.0.66]) by oola.is.scarlet.be with ESMTP id k3OJHfj11704; Mon, 24 Apr 2006 21:17:44 +0200 Received: from [192.168.24.1] (norquay.restart.bel [192.168.24.1]) (authenticated bits=0) by restart.be (8.13.6/8.13.6) with ESMTP id k3OJHTwi030359; Mon, 24 Apr 2006 21:17:30 +0200 (CEST) (envelope-from hlh@restart.be) DomainKey-Signature: a=rsa-sha1; s=norquay; d=restart.be; c=nofws; q=dns; h=message-id:date:from:organization:user-agent:mime-version:to:cc: subject:references:in-reply-to:content-type: content-transfer-encoding:received-spf:x-spam-status:x-scanned-by; b=WKHqjJCc26kU9aNomwiq/y5h9LOmrQKh/MWlwqhlM5Y9nKvElPZZ4MJHOwwedy/Z7 uszzPqyOj/ofmXlzaTFdQ== Message-ID: <444D2449.4080102@restart.be> Date: Mon, 24 Apr 2006 21:17:29 +0200 From: Henri Hennebert Organization: RestartSoft User-Agent: Thunderbird 1.5 (X11/20060401) MIME-Version: 1.0 To: stable@freebsd.org References: <20060420093147.7j9stegaiocc0ok4@webmail.restart.be> <20060424142455.G44099@fledge.watson.org> In-Reply-To: <20060424142455.G44099@fledge.watson.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Received-SPF: pass (restart.be: 192.168.24.1 is authenticated by a trusted mechanism) X-Spam-Status: -1.311 (ALL_TRUSTED,AWL) X-Scanned-By: MIMEDefang 2.56 on 192.168.24.1 X-DCC-scarlet.be-Metrics: oola 2020; Body=2 Fuz1=2 Fuz2=2 Cc: Robert Watson Subject: Re: 6.1RC system nearly freezing X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 24 Apr 2006 19:17:50 -0000 Robert Watson wrote: > > On Thu, 20 Apr 2006, Henri Hennebert wrote: > >> I upgrade a web, squid, mail server (under SMP with 2 pentium III) to >> 6.1-RC (Apr 9 2006) and encounter 2 `freezing'. >> >> The system is still responding to http requests but I can't login on >> the console or through ssh -- no shell prompt. No more mail delivery. >> >> I break to KDB and found more then 1000 sendmail processes waiting for >> devfs... >> >> call boot(0) can't complete the shutdown process. >> >> I join the KDB informations. Let me know if more informations are needed. > > Are you running with WITNESS and INVARIANTS enabled? If not, could you > do so and see if the problem is reproduceable, and if so, whether or not > WITNESS (and friends) generate any warnings? > > It looks like something has leaked a lock, resulting in deadlock. The > question is, however, which lock, and where. WITNESS may be able to > provide some insight into this; if you could run "show alllocks" with > WITNESS in place, that would be helpful also. I add WITNESS and INVARIANTS to my config and the next freeze/boot will have it [see PS]. This server is in production and running with a newer kernel for more than 5 days now. The diff (from apr 13) with the previous kernel [the one with the last freeze] are: Connected to cvsup.ciger.be Updating collection src-all/cvs Edit src/etc/sendmail/freebsd.mc Edit src/etc/sendmail/freebsd.submit.mc Edit src/lib/libc/gen/vis.3 Edit src/release/doc/en_US.ISO8859-1/hardware/common/dev.sgml Edit src/release/doc/share/misc/dev.archlist.txt Edit src/sbin/geom/core/geom.c Edit src/share/man/man4/Makefile Checkout src/share/man/man4/bce.4 Edit src/share/man/man4/miibus.4 Edit src/sys/amd64/conf/GENERIC Edit src/sys/conf/files Edit src/sys/conf/options Checkout src/sys/dev/bce/if_bce.c Checkout src/sys/dev/bce/if_bcefw.h Checkout src/sys/dev/bce/if_bcereg.h Edit src/sys/dev/ipw/if_ipw.c Edit src/sys/dev/ipw/if_ipwvar.h Edit src/sys/dev/mii/brgphy.c Edit src/sys/dev/mii/miidevs Edit src/sys/i386/conf/GENERIC Edit src/sys/modules/Makefile Checkout src/sys/modules/bce/Makefile Edit src/usr.sbin/wpa/wpa_supplicant/Packet32.c Finished successfully Maybe something in this changes make things better ? Anyway, I will reboot this night (with WITNESS and friends) but maybe have to reverse it if the performances are too bad :-/ Thanks for your concern, Henri P.S. ARGH... buildkernel failed with: cc -c -O -pipe -march=pentium3 -Wall -Wredundant-decls -Wnested-externs -Wstrict-prototypes -Wmissing-prototypes -Wpointer-arith -Winline -Wcast-qual -fformat-extensions -std=c99 -g -nostdinc -I- -I. -I/usr/src/sys -I/usr/src/sys/contrib/altq -I/usr/src/sys/contrib/ipfilter -I/usr/src/sys/contrib/pf -I/usr/src/sys/contrib/dev/ath -I/usr/src/sys/contrib/dev/ath/freebsd -I/usr/src/sys/contrib/ngatm -I/usr/src/sys/dev/twa -D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include opt_global.h -fno-common -finline-limit=8000 --param inline-unit-growth=100 --param large-function-growth=1000 -mno-align-long-strings -mpreferred-stack-boundary=2 -mno-mmx -mno-3dnow -mno-sse -mno-sse2 -ffreestanding -Werror /usr/src/sys/dev/ata/atapi-cd.c /usr/src/sys/dev/ata/atapi-cd.c: In function `acd_geom_attach': /usr/src/sys/dev/ata/atapi-cd.c:179: warning: implicit declaration of function `_sx_assert' /usr/src/sys/dev/ata/atapi-cd.c:179: warning: nested extern declaration of `_sx_assert' *** Error code 1 Stop in /usr/obj/usr/src/sys/MORZINE. *** Error code 1 Stop in /usr/src. *** Error code 1 I run cvsup and retry... I keep you posted. > > Robert N M Watson