From owner-freebsd-stable@FreeBSD.ORG Sat Oct 1 18:11:45 2011 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4EB09106564A for ; Sat, 1 Oct 2011 18:11:45 +0000 (UTC) (envelope-from royce.williams@gmail.com) Received: from mail-iy0-f182.google.com (mail-iy0-f182.google.com [209.85.210.182]) by mx1.freebsd.org (Postfix) with ESMTP id 1B3A18FC0A for ; Sat, 1 Oct 2011 18:11:44 +0000 (UTC) Received: by iadk27 with SMTP id k27so4688027iad.13 for ; Sat, 01 Oct 2011 11:11:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; bh=s22Qlq9ncoWUZeh9OKjyuKCSGZzQpHSMGnplfIAU/wI=; b=rQKo3tazS5tHVmtYHlO7ZWSh4DCe9QBQefPix1VjWMJuy0tP8YOHcepsD/banrS2se STlb+32tk41FX0TEDv6LGPjTJOzNPdlnEc8MZBziAatlgLl/rC1BTBW1xTG0bRKvjgF+ 3HE5lINYVKd13PAIKmD8C+Tgcg6hR7AWcSibQ= Received: by 10.231.21.212 with SMTP id k20mr1166921ibb.50.1317490907079; Sat, 01 Oct 2011 10:41:47 -0700 (PDT) MIME-Version: 1.0 Received: by 10.231.14.8 with HTTP; Sat, 1 Oct 2011 10:41:27 -0700 (PDT) In-Reply-To: <20111001102327.GA37434@icarus.home.lan> References: <20111001102327.GA37434@icarus.home.lan> From: Royce Williams Date: Sat, 1 Oct 2011 09:41:27 -0800 Message-ID: To: freebsd-stable Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Subject: Re: Interpreting MCA error output X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 01 Oct 2011 18:11:45 -0000 On Sat, Oct 1, 2011 at 2:23 AM, Jeremy Chadwick wrote: [snip] > Decoding the MCE can be done using Linux's mcelog program -- you'll need > to download the source and apply the patch by hand *and* put in place a > heavily modified version of memstream.c -- which requires a lot of > patching to work on FreeBSD, and can only be used to decode > ASCII-provided MCEs; DMI support does not work. =A0So, you have to apply > patches then use "mcelog --no-dmi --ascii" and provide the MCE text via > stdin (or use --file). I'm glad to see this thread; I have a different error, for which I wanted to make sure I was fixing the right problem before randomly swapping hardware. > John Baldwin tends to keep up-to-date patches for mcelog here: > > http://people.freebsd.org/~jhb/mcelog/ > > The last build of mcelog I did on FreeBSD was for mcelog-1.0pre2, which > John's patch (at the time) did not work with. =A0I made my own patch > (dated 2011/02/11), but it looks like John has since updated his patch. > If you need/want mine, I can put it up on the web. That would be very useful as a crosscheck. I found one additional intermediate patch from John, posted on 2011-04-26, that appears to have not been merged into ~jhb/mcelog/ : http://lists.freebsd.org/pipermail/freebsd-hackers/2011-April/035159.html That patch got me up and going (using 'gmake FREEBSD=3Dyes i386=3Dyes') My problem is different from the original poster; looks a bit more serious: royce@heffalump$ ./mcelog --no-dmi --ascii MCA: Bank 1, Status 0x9400000000000151 MCA: Global Cap 0x0000000000000105, Status 0x0000000000000000 MCA: Vendor "AuthenticAMD", ID 0xfc0, APIC ID 0 MCA: CPU 0 COR ICACHE L1 IRD error MCA: Address 0xc089d890 HARDWARE ERROR. This is *NOT* a software problem! Please contact your hardware vendor CPU 0 1 instruction cache ADDR c089d890 memory/cache error 'instruction fetch mem transaction, instruction transaction, level 1' STATUS 9400000000000151 MCGSTATUS 0 MCGCAP 105 APICID 0 SOCKETID 0 CPUID Vendor AMD Family 15 Model 12 > A few moments ago I tried to download mcelog from the official site, but > ftp.kernel.org is presently returning NXDOMAIN for me (e.g. A record not > found). =A0The same goes for git.kernel.org. =A0Great..... kernel.org is still down from the compromise a few weeks ago. At least one kernel.org mirror still has mcelog-1.0pre2: http://mirror.xmission.com/kernel.org/linux/utils/cpu/mce/mcelog-1.0pre2.ta= r.gz The main mcelog page also has a link to how to get it from GitHub while kernel.org is down: https://github.com/andikleen/mcelog Agreed that a port and a CGI would be higher leverage, but these breadcrumbs should help in the short term. Royce