Date: Tue, 24 Aug 2010 23:03:26 +0300 From: Andriy Gapon <avg@icyb.net.ua> To: Artem Belevich <fbsdlist@src.cx> Cc: Ronald Klop <ronald-freebsd8@thuis.klop.ws>, freebsd-stable@freebsd.org Subject: Re: kernel MCA messages Message-ID: <4C74258E.2060403@icyb.net.ua> In-Reply-To: <AANLkTik8AcWPw_Vc5Jjn97AoXyp5=w0SNmJXCVXAJCHn@mail.gmail.com> References: <4C71CC62.6060803@langille.org> <4C71D756.5080205@langille.org> <4C7218D6.6090408@icyb.net.ua> <201008230820.35260.jhb@freebsd.org> <op.vhxiafp38527sy@212-123-145-58.ip.telfort.nl> <4C737F85.5010804@icyb.net.ua> <AANLkTik8AcWPw_Vc5Jjn97AoXyp5=w0SNmJXCVXAJCHn@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
on 24/08/2010 22:51 Artem Belevich said the following: > IMHO the key here is whether hardware is broken or not. The only case > where correctable ECC errors are OK is when a bit gets flipped by a > high-energy particle. That's a normal but fairly rare event. If you > get bit flips often enough that you can recall details of more then > one of them on the same hardware, my guess would be that you're > dealing with something else -- bad/marginal memory, signal integrity > issues, power issues, overheating... The list continues.. In all those > cases hardware does *not* work correctly. Whether you can (or want to) > keep running stuff on the hardware that is broken is another question. Have you read the article? :) If not, read at least the summary. > On Tue, Aug 24, 2010 at 1:15 AM, Andriy Gapon <avg@icyb.net.ua> wrote: >> on 24/08/2010 09:14 Ronald Klop said the following: >>> >>> A little off topic, but what is 'a low rate of corrected ECC errors'? At work >>> one machine has them like ones per day, but runs ok. Is ones per day much? >> >> That's up to your judgment. It's like after how many remapped sectors do you >> replace HDD. >> You may find this interesting: >> http://www.cs.toronto.edu/~bianca/papers/sigmetrics09.pdf >> >> -- >> Andriy Gapon -- Andriy Gapon
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4C74258E.2060403>