From owner-freebsd-fs@FreeBSD.ORG Thu Mar 11 19:09:57 2010 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C9CE5106566B for ; Thu, 11 Mar 2010 19:09:57 +0000 (UTC) (envelope-from freebsd-fs@m.gmane.org) Received: from lo.gmane.org (lo.gmane.org [80.91.229.12]) by mx1.freebsd.org (Postfix) with ESMTP id 55B2A8FC17 for ; Thu, 11 Mar 2010 19:09:57 +0000 (UTC) Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1Npnlc-0001JL-1S for freebsd-fs@freebsd.org; Thu, 11 Mar 2010 20:09:56 +0100 Received: from 207.155.204.151.ptr.us.xo.net ([207.155.204.151]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 11 Mar 2010 20:09:56 +0100 Received: from atkin901 by 207.155.204.151.ptr.us.xo.net with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Thu, 11 Mar 2010 20:09:56 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-fs@freebsd.org From: Mark Atkinson Date: Thu, 11 Mar 2010 11:09:43 -0800 Lines: 52 Message-ID: References: <20100311133916.42ba69b0@orwell.free.de> <4B993D19.60504@icyb.net.ua> Mime-Version: 1.0 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 8bit X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: 207.155.204.151.ptr.us.xo.net User-Agent: Mozilla/5.0 (X11; U; FreeBSD i386; en-US; rv:1.9.1.8) Gecko/20100303 Thunderbird/3.0.3 In-Reply-To: <4B993D19.60504@icyb.net.ua> Cc: freebsd-stable@freebsd.org Subject: Re: proliant server lockups with freebsd-amd64-stable (2010-03-10) X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 11 Mar 2010 19:09:57 -0000 On 03/11/10 10:57, Andriy Gapon wrote: > on 11/03/2010 20:25 Mark Atkinson said the following: >> On 03/11/10 04:39, Kai Gallasch wrote: >>> Hi. >>> >>> I have some trouble with an opteron server locking up spontaneously. It looses >>> all networks connectivity and even through console I can get no shell. >>> >>> Lockups occur mostly under disk load (periodic daily, bacula backup >>> running, make buildworld/buildkernel) and I can provoke them easily. >> >> try adding: >> >> vm.pmap.pg_ps_enabled="0" > > The lockups happen with this setting too. > This is likely a red herring, because Kai already has Alan's workaround from head. I'm not sure that the workaround covered the processor listed here > CPU: Quad-Core AMD Opteron(tm) Processor 2352 (2100.08-MHz K8-class CPU) > Origin = "AuthenticAMD" Id = 0x100f23 Stepping = 3 (mine is the same number and wasn't covered by the coded workaround commited to Linux, or IIRC Freebsd -- I'll have to look again). Turning off superpages totally solves the problem for me. But for Kai's problem, as Andriy points out, turning it off may not solve it. If I turn off superpages and enable MCE's I'll get the MCE when hitting the condition, so I just leave the MCE code off. >> to /boot/loader.conf and reboot. >> >> Long thread reference: >> >> http://thread.gmane.org/gmane.os.freebsd.current/121035/focus=121457 >> >> possible AMD "errata 147" > > 147? > I thought this problem was speculated to be from the bug: http://bugzilla.kernel.org/show_bug.cgi?id=11305 Which references an AMD "errata 147" from "Revision Guide for AMD Athlon™ 64 and AMD Opteron™ Processors." http://support.amd.com/us/Processor_TechDocs/25759.pdf