From owner-freebsd-stable@FreeBSD.ORG Mon May 16 19:48:54 2011 Return-Path: Delivered-To: freebsd-stable@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B6D19106566C; Mon, 16 May 2011 19:48:54 +0000 (UTC) (envelope-from avg@FreeBSD.org) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id C843C8FC15; Mon, 16 May 2011 19:48:53 +0000 (UTC) Received: from porto.topspin.kiev.ua (porto-e.starpoint.kiev.ua [212.40.38.100]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id WAA07565; Mon, 16 May 2011 22:48:47 +0300 (EEST) (envelope-from avg@FreeBSD.org) Received: from localhost.topspin.kiev.ua ([127.0.0.1]) by porto.topspin.kiev.ua with esmtp (Exim 4.34 (FreeBSD)) id 1QM3mZ-00045Q-J4; Mon, 16 May 2011 22:48:47 +0300 Message-ID: <4DD17F9E.6000702@FreeBSD.org> Date: Mon, 16 May 2011 22:48:46 +0300 From: Andriy Gapon User-Agent: Mozilla/5.0 (X11; U; FreeBSD amd64; en-US; rv:1.9.2.17) Gecko/20110503 Lightning/1.0b2 Thunderbird/3.1.10 MIME-Version: 1.0 To: John Hay References: <20110510125220.GA88338@zibbi.meraka.csir.co.za> <20110516162319.GA58581@zibbi.meraka.csir.co.za> In-Reply-To: <20110516162319.GA58581@zibbi.meraka.csir.co.za> X-Enigmail-Version: 1.1.2 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: alc@FreeBSD.org, freebsd-stable@FreeBSD.org Subject: Re: MCA: CPU 0 UNCOR PCC DTLB L1 error X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 16 May 2011 19:48:54 -0000 on 16/05/2011 19:23 John Hay said the following: > > I have applied the patch, but got another one today. Can you please double-check that you indeed have the patch and you are running a patched kernel. > I still do not get > a prompt or dump. :-( That could be expected, because MCE is not a software problem, but a hardware one. > It just get stuck right after #4. If there is anything > more that I can try, just ask. Please try to disable superpages, put vm.pmap.pg_ps_enabled="0" in your loader.conf. If the problem persists, then my guess would be that you have a genuine hardware problem with your CPU. Next thing to try would be to replace it. If disabling superpages helps you, then please let us know. But please do not hurry with conclusions. > ##################################################################### > MCA: Bank 0, Status 0xb600000000010015 > MCA: Global Cap 0x0000000000000106, Status 0x0000000000000004 > MCA: Vendor "AuthenticAMD", ID 0x500f10, APIC ID 0 > MCA: CPU 0 UNCOR PCC DTLB L1 error To Alan, just in case: previously OVER bit was also set here. > MCA: Address 0x808ace000 > > > Fatal trap 28: machine check trap while in user mode > cpuid = 1; apic id = 01 > instruction pointer = 0x43:0x80af206d5 > stack pointer = 0x3b:0x7fffffffb8e8 > frame pointer = 0x3b:0x809b92450 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 3, pres 1, long 1, def32 0, gran 1 > processor eflags = interrupt enabled, IOPL = 0 > current process = 22228 (initial thread) > trap number = 28 > panic: machine check trap > cpuid = 1 > KDB: stack backtrace: > #0 0xffffffff80608f6e at kdb_backtrace+0x5e > #1 0xffffffff805d6917 at panic+0x187 > #2 0xffffffff808bf7c0 at trap_fatal+0x290 > #3 0xffffffff808bfda9 at trap+0x109 > #4 0xffffffff808a8084 at calltrap+0x8 > ##################################################################### -- Andriy Gapon