From owner-freebsd-current@FreeBSD.ORG Wed Jul 28 09:58:59 2010 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 77609106566C for ; Wed, 28 Jul 2010 09:58:59 +0000 (UTC) (envelope-from avg@icyb.net.ua) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id C094E8FC16 for ; Wed, 28 Jul 2010 09:58:58 +0000 (UTC) Received: from porto.topspin.kiev.ua (porto-e.starpoint.kiev.ua [212.40.38.100]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id MAA00441; Wed, 28 Jul 2010 12:58:22 +0300 (EEST) (envelope-from avg@icyb.net.ua) Received: from localhost.topspin.kiev.ua ([127.0.0.1]) by porto.topspin.kiev.ua with esmtp (Exim 4.34 (FreeBSD)) id 1Oe3P4-000NpW-FK; Wed, 28 Jul 2010 12:58:22 +0300 Message-ID: <4C4FFF3D.50305@icyb.net.ua> Date: Wed, 28 Jul 2010 12:58:21 +0300 From: Andriy Gapon User-Agent: Thunderbird 2.0.0.24 (X11/20100603) MIME-Version: 1.0 To: "V. T. Mueller, Continum" References: <4C4FF906.6060107@continum.net> In-Reply-To: <4C4FF906.6060107@continum.net> X-Enigmail-Version: 0.96.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: freebsd-current@freebsd.org Subject: Re: MCE machine check exceptions - status, tools? X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 28 Jul 2010 09:58:59 -0000 on 28/07/2010 12:31 V. T. Mueller, Continum said the following: > Hello, > > By searching the net I was only able to find that "better support" for > 9.0 is on its way. So I'd like to ask if MCEs (like ECC-related messages > from, say Supermicro boards) are being already processed by the kernel. > Are there any (plans for) tools to handle and process these messages in > userland? > > The amount of memory and memory modules keeps increasing, so MCE logging > for non A-brand hardware (these trigger LEDs and/or tools from firmware) > appears to gain increasing importance, too. > > I'd be grateful for hints, URLs, tips etc. MCA support is in current and stable/8. I believe that it's enabled by default, so there is not much to configure or to do except watching for MCE reports in system log (or via hw.mca.count). That's for correctable MCEs though, non-correctable would result in panic. See sys/x86/x86/mca.c code for details. John Baldwin has a tool that produces more human-friendly description of the exceptions should you ever get one. -- Andriy Gapon