Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 15 Oct 2004 11:41:22 -0600
From:      Robert Gray <bob@boulderlabs.com>
To:        freebsd-current@freebsd.org
Subject:   Re: 5.3-BETA7 install cd: kernel trap 12 with interrupts disabled (fwd)
Message-ID:  <200410151741.i9FHfMxQ036620@vec.boulderlabs.com>

next in thread | raw e-mail | index | archive | help


Yes, a built-in memory tester would be helpful.  However, my experience
is that many memory problems pass the x86memtest, but cause 
"seg faults" during buildworld, partly due to the extra load and heat
from disk activity.

I'm a firm believer that we should encourage our users to
buy ECC systems - motherboards that support ECC, and the
more expensive SIMMs/DIMMs that have redundant bits.

I'd rather spend a few more dollars to *know* when memory is
the problem during normal operations instead of trying to
locate the source of the problem after something bad happens.

I would be happy to lead a "documentation" project for FreeBSD
that talks about the issues and the current hardware options.
I've got the start of an article for the USENIX login magazine.
If people want to encourage me to publish, please send email
and suggest an appropriate mailing group or forum.

Thanks
-robert gray



Robert Watson <rwatson@freebsd.org> Fri, 15 Oct 2004 04:14:04 EDT says:
>On Fri, 15 Oct 2004, Guido van Rooij wrote:
>
>> It turns out this was a memory problem. Refitting the dimms was all
>> that was needed to 'solve' the issue. The weird thing is that
>> 1. The BIOS had memtesting enabled and did not complain
>> 2. W2K seemed to install ok (though I never let it hit the disk)
>> 3. FBSD would repeatedly crash at exactly the same spot.
>> 
>> I am beginning to wonder if we should have a boot option that enables a
>> thorough memtest from within the kernel...(e.g. boot -m). 
>
>I guess the old /dev/test_for_shoddy_workmanship driver is working :-).
>
>It's probably just bad luck -- BIOS memory testing is generally pretty
>poor, and it could just be FreeBSD stored your root vnode pointer (or some
>other critical thing) in a memory word that Win2K was using for an icon,
>so a single bit twiddle did pretty different things.  Glad it's fixed.  I
>occasionally wonder if we shouldn't build a memory tester into the FreeBSD
>boot loader to help diagnose this sort of stuff, though.  Not to run every
>boot, but as a diagnostics option.
>
>Robert N M Watson             FreeBSD Core Team, TrustedBSD Projects
>robert@fledge.watson.org      Principal Research Scientist, McAfee Research



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200410151741.i9FHfMxQ036620>