From owner-freebsd-stable@FreeBSD.ORG Tue Dec 22 15:49:06 2009 Return-Path: Delivered-To: freebsd-stable@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7B692106568B for ; Tue, 22 Dec 2009 15:49:06 +0000 (UTC) (envelope-from flo@smeets.im) Received: from mail.solomo.de (mail.solomo.de [85.214.124.163]) by mx1.freebsd.org (Postfix) with ESMTP id 340D28FC2B for ; Tue, 22 Dec 2009 15:49:06 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by mail.solomo.de (Postfix) with ESMTP id 155076228A; Tue, 22 Dec 2009 16:49:05 +0100 (CET) X-Virus-Scanned: amavisd-new at vistream.de Received: from mail.solomo.de ([127.0.0.1]) by localhost (db1.solomo.de [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 8oEqcecI0eVA; Tue, 22 Dec 2009 16:48:49 +0100 (CET) Received: from nibbler.vistream.local (relay3.vistream.de [87.139.10.28]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) by mail.solomo.de (Postfix) with ESMTPSA id 992F1620FA; Tue, 22 Dec 2009 16:48:49 +0100 (CET) Message-ID: <4B30EA60.9060500@smeets.im> Date: Tue, 22 Dec 2009 16:48:48 +0100 From: Florian Smeets User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.1.5) Gecko/20091220 Shredder/3.0.1pre MIME-Version: 1.0 To: Pete French References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-stable@FreeBSD.org Subject: Re: Disc lock up on 8.0-STABLE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 22 Dec 2009 15:49:06 -0000 On 12/22/09 3:19 PM, Pete French wrote: >> As the box is remote with only ssh access, it's a little difficult to >> debug this. During the weekend i waited till 3 o'clock with a top >> running, and saw that hundreds/thousands of /bin/sh processes were >> started. After that i commented out periodic daily in /etc/crontab, that >> "solved" the problem for me. > > Interesting - I have tried running the daily periodic processes by hand, but > that does not trigger it. Possibly it is something about being run from > cron maybe ? Same here! > >> I was not able to debug this any further yet, i have one other box with >> all UFS and a ZFS backup disc also running latest 8-STABLE but it does >> not exhibit the problem. > > Same here :-( My other box is identical hardware and almost config (UFS > and ZFS) but it works fine. > > I am not sure that ym debug kernel will help now - assuming this is the > same thing as you are seeing, it sounds like a process running amok rather > than any kernel bug. Was yoour system a fresh install, or an upgrade from > 7-STABLE ? I'm not 100% sure anymore, but i think i installed 7.2-RELEASE set hw.pci.mcfg=0 in loader.conf due to 8.0-BETAX not detecting the discs without it and then immediately upgrading to 8-STABLE. But the box has been running 8-STABLE since around end of August, and the first time it happened was about 10-14 days ago, after i upgraded world+kernel. When the problem occurs and you happen to have a shell everything you try to run is being rejected with an error no resources for fork or something. I had to enter reboot like 30 times before the box rebooted. So with a bit of patience one will be able to get aditional information out of the box. Cheers, Florian