From owner-freebsd-stable@FreeBSD.ORG Mon Jun 26 15:12:31 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id DDB8D16A47E; Mon, 26 Jun 2006 15:12:31 +0000 (UTC) (envelope-from scrappy@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.FreeBSD.org (Postfix) with ESMTP id EFB5E44A15; Mon, 26 Jun 2006 14:06:52 +0000 (GMT) (envelope-from scrappy@hub.org) Received: from localhost (wm.hub.org [200.46.204.128]) by hub.org (Postfix) with ESMTP id 97175290C1F; Mon, 26 Jun 2006 11:06:46 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by localhost (mx1.hub.org [200.46.204.128]) (amavisd-new, port 10024) with ESMTP id 39814-03; Mon, 26 Jun 2006 14:06:50 +0000 (UTC) Received: from ganymede.hub.org (blk-7-151-244.eastlink.ca [71.7.151.244]) by hub.org (Postfix) with ESMTP id 2C678290C1E; Mon, 26 Jun 2006 11:06:46 -0300 (ADT) Received: by ganymede.hub.org (Postfix, from userid 1000) id 73D175C279; Mon, 26 Jun 2006 11:06:57 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by ganymede.hub.org (Postfix) with ESMTP id 7242D5C257; Mon, 26 Jun 2006 11:06:57 -0300 (ADT) Date: Mon, 26 Jun 2006 11:06:57 -0300 (ADT) From: "Marc G. Fournier" To: Robert Watson In-Reply-To: <20060626140333.M38418@fledge.watson.org> Message-ID: <20060626110636.I1114@ganymede.hub.org> References: <20060626100949.G24406@fledge.watson.org> <20060626081029.L1114@ganymede.hub.org> <20060626140333.M38418@fledge.watson.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: freebsd-acpi@freebsd.org, freebsd-stable@freeBSD.org, Pete French Subject: Re: FreeBSD 6.x CVSUP today crashes with zero load ... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 26 Jun 2006 15:12:32 -0000 On Mon, 26 Jun 2006, Robert Watson wrote: > > On Mon, 26 Jun 2006, Marc G. Fournier wrote: > >>> I'm also running 6.x on several dual-PIII without problems. An issue >>> local to Marc's setup is definitely indicated. Given the failure mode, I >>> would be worried about a potential hardware issue, although subtle >>> hardware and subtle system software problems are sometimes difficult to >>> distinguish. >> >> Well, I've been trying to do it 'the hardway' ... went back to the original >> kernel, and am slowly upgrading forward ... I'm currently running a June >> 15th kernel with none of the problems that I was seeing before ... I'm just >> in the process of running my third 'make -j3 buildworld' on this kernel, >> and its clean ... going to go forward to June 22nd next, see if that too is >> clean *cross fingers* > > I think this is a useful activity, especially if you've already run extensive > memory testing on the box. If you haven't yet done that, I encourage you to > take a break from buildworld's and make sure the memory tests pass. I spent > several months on and off trying to track down a bug a few years ago, which > turned out to be a one bit error in memory on the box. It would appear and > disappear based on how the memory page was used -- for debugging kernels, it > consistently got mapped to padding in the kernel's bss. For non-debugging > kernels, it typically manifested in other usable kernel momory. Changes in > kernel versions would move the bit around kernel memory and user memory, > resulting in hard to debug failure modes. I wish I'd run the memory test > earlier, but the lesson is clear! Is there something that I can run *from* FreeBSD, remotely, to do this? ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664