From owner-freebsd-current@FreeBSD.ORG Tue Oct 2 18:28:00 2007 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8576416A46B for ; Tue, 2 Oct 2007 18:28:00 +0000 (UTC) (envelope-from ss.alert@online.no) Received: from mail44.e.nsc.no (mail44.e.nsc.no [193.213.115.44]) by mx1.freebsd.org (Postfix) with ESMTP id 11E4213C458 for ; Tue, 2 Oct 2007 18:27:59 +0000 (UTC) (envelope-from ss.alert@online.no) Received: from [192.168.1.222] (ti0034a340-0733.bb.online.no [88.90.2.221]) by mail44.nsc.no (8.13.8/8.13.5) with ESMTP id l92IRvRV025010 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NOT); Tue, 2 Oct 2007 20:27:57 +0200 (MEST) In-Reply-To: <20071002181404.GA4717@garage.freebsd.pl> References: <7344605.82541191344652015.JavaMail.root@zmail.illuminati.org> <20071002181404.GA4717@garage.freebsd.pl> Mime-Version: 1.0 (Apple Message framework v752.3) Message-Id: <3939DF4C-D688-40B2-9A9A-1400E528AB07@online.no> From: Sverre Svenningsen Date: Tue, 2 Oct 2007 20:27:55 +0200 To: Pawel Jakub Dawidek X-Mailer: Apple Mail (2.752.3) Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-current , Brooks Talley Subject: Re: ZFS corrupting data, even just sitting idle X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 02 Oct 2007 18:28:00 -0000 On Oct 2, 2007, at 20:14 , Pawel Jakub Dawidek wrote: > On Tue, Oct 02, 2007 at 10:04:12AM -0700, Brooks Talley wrote: >> Hi, everyone. I'm running 7.0-current amd64, built from CVS on >> September 12. I've got a 4.5TB ZFS array across 8 750GB drives in >> a RAIDZ1 + hotspare configuration. >> >> It's corrupting data even just sitting at idle with no access at >> all. I had loaded it up with about 4TB of data several weeks ago, >> then noticed that a zpool status showed checksum errors about a >> week ago. I ran a scrub and it turned 122 errors affecting about >> 20 files. The errors were spread across the physical disks pretty >> evenly, so it didn't seem like one bad drive. >> >> I left for vacation and unplugged the network from the machine to >> ensure that there would be no access to the disk. There are no >> cron jobs or anything else running locally that so much as touch >> the zpool. >> >> Upon returning, I ran a zpool scrub and it found an additional 116 >> checksum errors in another 17 files, also evenly spread across the >> physical drives. >> >> The system is running a Supermicro motherboard, Supermicro AOC-SAT- >> MV8 SATA card, and WD 750GB drives. 2GB memory, no real apps >> running, just storage. >> >> Anyone seen anything like this? It's a bit of a concern. > > Ok, and why do you blame ZFS for corrupting for data instead of be > thankful for detecting corruptions? I'm quite sure it's not ZFS > what is > corrupting your data. > > -- > Pawel Jakub Dawidek http://www.wheel.pl > pjd@FreeBSD.org http://www.FreeBSD.org > FreeBSD committer Am I Evil? Yes, I Am! Supposedly this card uses a Marvell 88SX6081 chipset, which as far as i could tell is handled by Sorens ATA driver. Looks like work done elsewhere in the kernel is making that driver misbehave in all sorts of weird ways now. It's nice that ZFS makes it easy to discover, at least :) -Sverre