Date: Tue, 26 Jun 2007 08:55:20 +0200 From: Ed Schouten <ed@fxq.nl> To: Suleiman Souhlal <ssouhlal@FreeBSD.org> Cc: current@freebsd.org Subject: Re: [PATCH] Machine Check Architecture on amd64 Message-ID: <20070626065520.GQ27942@hoeg.nl> In-Reply-To: <46806B3E.2060701@FreeBSD.org>
index | next in thread | previous in thread | raw e-mail
[-- Attachment #1 --] * Suleiman Souhlal <ssouhlal@FreeBSD.org> wrote: > Hi, > > I have a simple patch for amd64 that uses the Machine Check > Architecture/Exceptions on most recent x86 CPUs to detect memory errors: > > http://people.freebsd.org/~ssouhlal/testing/mce-20070621.diff > > It will report uncorrected and corrected errors (the latter, only if sysctl > machdep.mce.log_corrected=1). > You can ask the kernel to panic if it gets an uncorrected error by setting > machdep.mce.panic_on_uc=1. > All this can be disabled by setting the machdep.mce.enable tunable to 0. I'm > still not sure if I want this enabled by default, as I don't have any Intel > machines to test this on, but I have tested it on Opteron (both corrected > and uncorrected errors). > > I would appreciate it if someone would try this, especially if you have > Intel machines with bad RAM. > > Comments are welcome. | /* | * Uncorrected MCEs will generate a #MC, while corrected | * don't, so we have to periodically poll for them. | */ What about adding an option to only print uncorrected MCE's? That's the most interesting data and we can get that without using a kthread, right? Nice work! :-) -- Ed Schouten <ed@fxq.nl> WWW: http://g-rave.nl/ [-- Attachment #2 --] -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.7 (FreeBSD) iD8DBQFGgLhY52SDGA2eCwURAh74AJ9s3HAH9RQJX3FI3eacfjiwdXCw8QCePqaD VTMzInO7WHRiA3uPHRyMchY= =GHGK -----END PGP SIGNATURE-----home | help
Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20070626065520.GQ27942>
