From owner-freebsd-stable@FreeBSD.ORG Fri Jul 20 21:39:59 2012 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4CE5B106566B for ; Fri, 20 Jul 2012 21:39:59 +0000 (UTC) (envelope-from joe@tao.org.uk) Received: from babel.karthauser.co.uk (babel.realityhacker.info [95.154.203.112]) by mx1.freebsd.org (Postfix) with ESMTP id 093538FC0A for ; Fri, 20 Jul 2012 21:39:59 +0000 (UTC) Received: from dspam (babel.realityhacker.info [95.154.203.112]) by babel.karthauser.co.uk (Postfix) with SMTP id 0459C7DD for ; Fri, 20 Jul 2012 21:39:51 +0000 (UTC) Received: from infi.fritz.box (unknown [81.187.183.70]) (Authenticated sender: joemail@tao.org.uk) by babel.karthauser.co.uk (Postfix) with ESMTPSA id 9CA2C7D9; Fri, 20 Jul 2012 21:39:36 +0000 (UTC) Mime-Version: 1.0 (Apple Message framework v1257) Content-Type: text/plain; charset=us-ascii From: Dr Josef Karthauser In-Reply-To: <20120719171548.GM32960@teardrop.org> Date: Fri, 20 Jul 2012 16:09:28 +0100 Content-Transfer-Encoding: quoted-printable Message-Id: References: <20120719152909.GL32960@teardrop.org> <002D6A20-D2A4-4909-B2EA-3DB562326050@tao.org.uk> <20120719171548.GM32960@teardrop.org> To: James Snow X-Mailer: Apple Mail (2.1257) X-DSPAM-Result: Innocent X-DSPAM-Processed: Fri Jul 20 21:39:51 2012 X-DSPAM-Confidence: 1.0000 X-DSPAM-Probability: 0.0023 X-DSPAM-Signature: 5009d027713785118724188 Cc: "freebsd-stable@freebsd.org" Subject: Re: Checksum errors across ZFS array X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 20 Jul 2012 21:39:59 -0000 On 19 Jul 2012, at 18:15, James Snow wrote: > On Thu, Jul 19, 2012 at 06:05:32PM +0100, Dr Joe Karthauser wrote: >=20 >> Hi James, >>=20 >> It's almost definitely a memory problem. I'd change it ASAP if I were >> you. >>=20 >> I lost about 70mb from my zfs pool for this very reason just a few >> weeks ago. Luckily I had enough snapshots from before the rot set in >> to recover most of what I lost. >=20 > Thanks for the input. I will run a memory test against it. >=20 > If I may, why "almost definitely" a memory problem and not an issue = with > the controller? (Or did you mean the controller memory?) Hey Snow, Ok, it's not definitely. Of course, it could be anything. But, memory is = where I'd look first. Take care though, my system which had been working fine for about a year = when I noticed the ZFS rot (which all appears to be recent in time). I = ran memcheck+ on it for 8 hours or so, and it showed no errors at all. = However, when I replaced the memory with a different vendor the problems = went away. (Reboots and power off/on restarts hadn't fixed the problem = before!). So, take care if the memory doesn't report any failures, it might still = be faulty. Joe p.s. It was my fault that I wasn't running ECC memory on the system! :/. =20=