From owner-freebsd-hardware@freebsd.org Tue Jul 28 19:04:45 2015 Return-Path: Delivered-To: freebsd-hardware@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 621FC9AD08D for ; Tue, 28 Jul 2015 19:04:45 +0000 (UTC) (envelope-from jpaetzel@FreeBSD.org) Received: from out4-smtp.messagingengine.com (out4-smtp.messagingengine.com [66.111.4.28]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 360EB35F for ; Tue, 28 Jul 2015 19:04:44 +0000 (UTC) (envelope-from jpaetzel@FreeBSD.org) Received: from compute1.internal (compute1.nyi.internal [10.202.2.41]) by mailout.nyi.internal (Postfix) with ESMTP id 24D9420ECF for ; Tue, 28 Jul 2015 15:04:44 -0400 (EDT) Received: from frontend1 ([10.202.2.160]) by compute1.internal (MEProxy); Tue, 28 Jul 2015 15:04:44 -0400 DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d= messagingengine.com; h=content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-sasl-enc:x-sasl-enc; s=smtpout; bh=hSN2WQUABttH+Lg hYjK5QUj82cs=; b=MtrSYVjB6dGGzjd6kNPrngZQ/JzoU9QI1rqcv9U80GQFBCu 3GaMrJ/PQtO5S9oVcK6c4L4QqVKXOJ7HqBEGxPSFkpfb1PjlokqCNv16nz+wHAbi 7T5P9emzhFMzfm9SPdvQ+kDjRcu0/2GcNHmSYlsOPhWRwnPv4Qk5vaAH8iEo= X-Sasl-enc: 5hMdtnf8fBNRnzPD0UzB3YMn3i4JjMw9HSjdB407eSta 1438110283 Received: from roadrash.tcbug.org (cheqtel-68.234.77-pppoe-58.airstreamcomm.net [68.234.77.58]) by mail.messagingengine.com (Postfix) with ESMTPA id E0A00C0001F for ; Tue, 28 Jul 2015 15:04:43 -0400 (EDT) Message-ID: <55B7D24B.5060709@FreeBSD.org> Date: Tue, 28 Jul 2015 14:04:43 -0500 From: Josh Paetzel Organization: FreeBSD User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0 MIME-Version: 1.0 To: freebsd-hardware@freebsd.org Subject: Re: L2 cache errors??? References: <55B7B8FA.2060800@digiware.nl> <55B7C059.5020701@sentex.net> <55B7CCA1.4020906@digiware.nl> In-Reply-To: <55B7CCA1.4020906@digiware.nl> Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-hardware@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: General discussion of FreeBSD hardware List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Jul 2015 19:04:45 -0000 On 07/28/2015 13:40, Willem Jan Withagen wrote: > On 28/07/2015 19:48, Mike Tancsa wrote: >> On 7/28/2015 1:16 PM, Willem Jan Withagen wrote: >>> Hi, >>> >>> Are these what I think they are? >>> Errors in the CPU L2 cache? >>> >>> Are the ECC corrected? >>> Or is error really "data kaput"? >>> >> >> >> Could be. There is also an erratum issue that triggers these errors on >> certain CPUs when running software like virtualbox. It was fixed in >> RELENG_10 some time ago. What are you running ? >> >> >> https://svnweb.freebsd.org/base?view=revision&revision=269052 >> >> has some details. > > 'mmm, > Not running Haswell stuff, but rather older hardware. > > Looked in older logfiles, and there are a few more... > All with the same data, except that it is detected on different CPUs > > And it occurs when running: > mbuffer -4 -m 1000M -I 6666 | \ > zfs receive -F -d -v zfs > to receive a full backup from my fileserver. > > --WjW > You can tell ECC corrected the error because on FreeBSD if ECC can't fix the error the system will panic. Other systems (Solaris and HP-UX being the two I have direct experience with) can detach subsystems that have sustained uncorrectable errors in some cases. (Yes, even CPUs!) If a system is generating hundreds or thousands of MCAs a minute you are dealing with a hardware issue. If you are getting spurious MCAs to the tune of a few a day there's nothing abnormal or broken there it's just the system doing what it's supposed to. Given the amount of data that flies around inside modern computers I'm surprised there aren't more MCAs than there are in most systems. -- FreeBSD - The Power To Serve.