From owner-freebsd-current@FreeBSD.ORG Sun Oct 9 02:07:29 2005 Return-Path: X-Original-To: freebsd-current@freebsd.org Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 284E916A41F for ; Sun, 9 Oct 2005 02:07:29 +0000 (GMT) (envelope-from PeterJeremy@optushome.com.au) Received: from mail05.syd.optusnet.com.au (mail05.syd.optusnet.com.au [211.29.132.186]) by mx1.FreeBSD.org (Postfix) with ESMTP id 7EAD543D48 for ; Sun, 9 Oct 2005 02:07:28 +0000 (GMT) (envelope-from PeterJeremy@optushome.com.au) Received: from cirb503493.alcatel.com.au (c220-239-19-236.belrs4.nsw.optusnet.com.au [220.239.19.236]) by mail05.syd.optusnet.com.au (8.12.11/8.12.11) with ESMTP id j9927QD0004186 (version=TLSv1/SSLv3 cipher=EDH-RSA-DES-CBC3-SHA bits=168 verify=NO); Sun, 9 Oct 2005 12:07:26 +1000 Received: from cirb503493.alcatel.com.au (localhost.alcatel.com.au [127.0.0.1]) by cirb503493.alcatel.com.au (8.12.10/8.12.10) with ESMTP id j9927QHh000666; Sun, 9 Oct 2005 12:07:26 +1000 (EST) (envelope-from pjeremy@cirb503493.alcatel.com.au) Received: (from pjeremy@localhost) by cirb503493.alcatel.com.au (8.12.10/8.12.9/Submit) id j9927Pcc000665; Sun, 9 Oct 2005 12:07:25 +1000 (EST) (envelope-from pjeremy) Date: Sun, 9 Oct 2005 12:07:25 +1000 From: Peter Jeremy To: Mark Kirkwood Message-ID: <20051009020725.GC223@cirb503493.alcatel.com.au> References: <20050921174155.GB80991@xor.obsecurity.org> <43486C48.3090404@paradise.net.nz> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <43486C48.3090404@paradise.net.nz> User-Agent: Mutt/1.4.2.1i Cc: Joseph Peralta , freebsd-current@freebsd.org Subject: Re: errors during buildworld X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 09 Oct 2005 02:07:29 -0000 On Sun, 2005-Oct-09 14:03:04 +1300, Mark Kirkwood wrote: >- Tested cpus with cpuburn (2xburnP6 for 1 hour). >- Tested memory with memtest-86 (about 6 hours). memtest-86 and cpuburn can demonstrate that there is a fault but not that there isn't. Pattern-sensitive memory errors, in particular, are very unlikely to be detected. Also, the above tests are focussed on specific subsystems and would not pick up a problem was was triggered by interactions between different subsystems (eg there is no disk or PCI I/O in the above tests). >The system passes these tests easily, so I am finding it hard to see >hardware problems (Indeed the system is well ventilated and cooled Cooling isn't the only hardware problem. Marginal PSUs or marginal electros on the motherboard are also quite common - especially if the hardware is getting old. Electrolytic capacitors have a finite life and this is shortened by heat and high ripple currents - both of which are common in computers. >I removed /usr/obj/usr/src/* and tried buildworld again, and it went >through that time (am running the updated system now)... That suggests a hardware problem to me. >So it's a bit confusing, could we be seeing a real gcc bug? gcc is deterministic so a real gcc bug is more likely to manifest as a consistent failure at the same point. A problem that moves around and isn't always there is more indicative of a hardware issue. -- Peter Jeremy