From owner-freebsd-stable Thu Sep 21 7:27:56 2000 Delivered-To: freebsd-stable@freebsd.org Received: from mail.wolves.k12.mo.us (mail.wolves.k12.mo.us [207.160.214.1]) by hub.freebsd.org (Postfix) with ESMTP id 5CAD037B424 for ; Thu, 21 Sep 2000 07:27:44 -0700 (PDT) Received: from mail.wolves.k12.mo.us (cdillon@mail.wolves.k12.mo.us [207.160.214.1]) by mail.wolves.k12.mo.us (8.9.3/8.9.3) with ESMTP id JAA28232; Thu, 21 Sep 2000 09:27:41 -0500 (CDT) (envelope-from cdillon@wolves.k12.mo.us) Date: Thu, 21 Sep 2000 09:27:40 -0500 (CDT) From: Chris Dillon To: BSD Cc: stable@FreeBSD.ORG Subject: Re: Constant panics on 4.1-STABLE! In-Reply-To: Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG On Wed, 20 Sep 2000, BSD wrote: > On Wed, 20 Sep 2000, Chris Dillon wrote: > > I hate to tell you this, but this is most certainly a memory problem. > > Get some memory that has been tested and approved by your motherboard > > manufacturer for that board, and to be double-sure, make it ECC memory > > and then enable ECC in the motherboard's BIOS. If you STILL have > > problems after that, then you can start blaming the problem on > > something else. The ONLY other time I have had problems like this is > > when overclocking the processor. You aren't overclocking those > > processors are you? > > Are you saying all 3 sticks are bad at 133MHz (KA7) and one > or more is bad at 66MHz (BP6)? The likelihood of that is > extremely small. You said you were running a PIII on that BP6. Therefore, you would have to be running it at either 100MHz or 133MHz (which the BX doesn't officially support, but it works pretty well anyway). Also, it might not be that the memory is bad, but just out of spec for your systems. For example, if your system is expecting RAM that it can use CAS2 timings with but you have CAS3 RAM, that is going to cause problems. If this is the case, the EEPROMs on the sticks might be programmed with incorrect timing information. Tell your systems to ignore the EEPROM (SPD), and try manually setting the most conservative memory timings you can in each of your systems. > Also, a 512MB stick of RAM would cost me $1,600CAD. Sigh. > That's not going to happen anytime soon. Furthermore, I stress > tested each stick of RAM, with make -j64 buildworld. Nothing > failed there. The panics happenned when the system was just doing > its normal tasks. I'll try to post more detailed reports > (including crash dumps). A 'make world' is a pretty good way to stress-test things, but its far from perfect. I've had flaky systems survive multiple 'make world' sessions but still fail unexpectedly at other times. BTW, crash dumps will be meaningless if this really is a hardware problem. These kinds of problems are exactly why I spend the few extra bucks to buy ECC RAM for my important systems, even my workstation at home. Its worth it. If I have problems and I have ECC enabled, I can be fairly sure it isn't the RAM. Usually I just enable EC (Error-checking only) on my system at home, so if I start getting a lot of NMI panics I know that my memory is starting to flake out on me, at which point I can turn on ECC and start shopping for new memory. So far, I've never gotten one. This is probably due to me running PC133 memory on only a 66MHz bus. :-) -- Chris Dillon - cdillon@wolves.k12.mo.us - cdillon@inter-linc.net FreeBSD: The fastest and most stable server OS on the planet. For Intel x86 and Alpha architectures. ( http://www.freebsd.org ) To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message