From owner-freebsd-current@FreeBSD.ORG Sat Jul 16 18:38:17 2011 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C5CD0106564A for ; Sat, 16 Jul 2011 18:38:17 +0000 (UTC) (envelope-from sgk@troutmask.apl.washington.edu) Received: from troutmask.apl.washington.edu (troutmask.apl.washington.edu [128.95.76.21]) by mx1.freebsd.org (Postfix) with ESMTP id 8896C8FC0A for ; Sat, 16 Jul 2011 18:38:17 +0000 (UTC) Received: from troutmask.apl.washington.edu (localhost.apl.washington.edu [127.0.0.1]) by troutmask.apl.washington.edu (8.14.4/8.14.4) with ESMTP id p6GIcH89072982; Sat, 16 Jul 2011 11:38:17 -0700 (PDT) (envelope-from sgk@troutmask.apl.washington.edu) Received: (from sgk@localhost) by troutmask.apl.washington.edu (8.14.4/8.14.4/Submit) id p6GIcH1w072981; Sat, 16 Jul 2011 11:38:17 -0700 (PDT) (envelope-from sgk) Date: Sat, 16 Jul 2011 11:38:17 -0700 From: Steve Kargl To: Kim Culhan Message-ID: <20110716183817.GA72898@troutmask.apl.washington.edu> References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2.3i Cc: freebsd-current@freebsd.org Subject: Re: Interpreting MCA error output X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 16 Jul 2011 18:38:17 -0000 On Sat, Jul 16, 2011 at 02:25:20PM -0400, Kim Culhan wrote: > Noticed the following console message while running make world with > 9.0-CURRENT on 7-16-11 > > Jul 16 11:15:20 delta kernel: MCA: Vendor "GenuineIntel", ID 0x106a5, APIC > ID 16 > Jul 16 11:15:20 delta kernel: MCA: CPU 8 COR (1) RD channel ?? memory error > Jul 16 11:15:20 delta kernel: MCA: Address 0x28f261f80 > Jul 16 11:15:20 delta kernel: MCA: Misc 0x1834958000001385 > Jul 16 12:15:20 delta kernel: MCA: Bank 8, Status 0x8c0000400001009f > Jul 16 12:15:20 delta kernel: MCA: Global Cap 0x0000000000001c09, Status > 0x00000 > 00000000000 > Jul 16 12:15:20 delta kernel: MCA: Vendor "GenuineIntel", ID 0x106a5, APIC > ID 16 > Jul 16 12:15:20 delta kernel: MCA: CPU 8 COR (1) RD channel ?? memory error > Jul 16 12:15:20 delta kernel: MCA: Address 0x28e019f80 > Jul 16 12:15:20 delta kernel: MCA: Misc 0x1834958000000588 Copying the above into zxc, I see troutmask:kargl[212] ./mcelog --ascii < zxc mcelog: Cannot open /dev/mem for DMI decoding: Permission denied HARDWARE ERROR. This is *NOT* a software problem! Please contact your hardware vendor CPU 8 BANK 8 MISC 1834958000000588 ADDR 28e019f80 MCG status: MCi status: MCi_MISC register valid MCi_ADDR register valid MCA: MEMORY CONTROLLER RD_CHANNELunspecified_ERR Transaction: Memory read error Memory read ECC error Memory corrected error count (CORE_ERR_CNT): 1 Memory transaction Tracker ID (RTId): 88 Memory DIMM ID of error: 0 Memory channel ID of error: 0 Memory ECC syndrome: 18349580 STATUS 8c0000400001009f MCGSTATUS 0 MCGCAP 1c09 APICID 10 SOCKETID 0 CPUID Vendor Intel Family 6 Model 26 Looks like your DIMM 0 had an error that was corrected due to ECC.