From owner-freebsd-stable@FreeBSD.ORG Mon Jul 23 14:26:52 2012 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 0AE391065673 for ; Mon, 23 Jul 2012 14:26:52 +0000 (UTC) (envelope-from clay@milos.co.za) Received: from lisa.milos.co.za (lisa.milos.co.za [109.169.49.137]) by mx1.freebsd.org (Postfix) with ESMTP id 77F6E8FC1E for ; Mon, 23 Jul 2012 14:26:51 +0000 (UTC) Received: (qmail 94387 invoked by uid 89); 23 Jul 2012 14:26:50 -0000 Received: from unknown (HELO ClayDesktop) (clay@milos.co.za@192.168.200.6) by lisa.milos.co.za with ESMTPA; 23 Jul 2012 14:26:50 -0000 From: "Clayton Milos" To: "'Steven Hartland'" , References: <00f701cd68d4$4a5dd030$df197090$@milos.co.za> In-Reply-To: Date: Mon, 23 Jul 2012 15:26:15 +0100 Message-ID: <00f901cd68df$31696980$943c3c80$@milos.co.za> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Outlook 14.0 thread-index: AQEXgfnPJJuGgmq1l3YMN6hEjNHHFgGFRE45mJaP0mA= Content-Language: en-gb Cc: Subject: RE: ZFS causing panic X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 23 Jul 2012 14:26:52 -0000 > Hi guys > > I've had an issue for some time now. When I'm copying a lot of files over to > ZFS usually using SMB it causes a panic and locks up the server. > I'm running FreeBSD 9.0-RELEASE with a custom kernel. I've just pulled > unnecessary drivers out of the config and added: > cpu HAMMER > device pf > device pflog > options DEVICE_POLLING > options HZ=1000 > > For full disclosure I am getting these errors in the syslog which means > there's an ECC error occurring somewhere which I am trying to locate. I have > replaced both of the CPU's and all of the RAM and am still getting it so > perhaps the north bridge has bought the farm. We have some similar HW here and we suspect either CPU or Northbridge too, we where seeing day to day panics and a scrub would pretty much guarantee a panic. We also replaced the CPU's with no joy but found disabling the cores of the CPU in the second socket made the issues go away so strengthening the Northbridge theory. Try disabling the cores with the following and see if it helps:- /boot/loader.conf hint.lapic.1.disabled="1" hint.lapic.2.disabled="1" Unfortunately if you have perceived corruption due to this type of issue there's no guaranteeing what state your data is really in :( Regards Steve ================================================ This e.mail is private and confidential between Multiplay (UK) Ltd. and the person or entity to whom it is addressed. In the event of misdirection, the recipient is prohibited from using, copying, printing or otherwise disseminating it or any information contained in it. In the event of misdirection, illegible or incomplete transmission please telephone +44 845 868 1337 or return the E.mail to postmaster@multiplay.co.uk. Thanks Steve I'm giving that a try now. Luckily all of the data is backed up so I don't mind copying it back until it panics again :) //Clay