From owner-freebsd-fs@FreeBSD.ORG Thu Jun 21 21:51:15 2012 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 63B46106564A for ; Thu, 21 Jun 2012 21:51:15 +0000 (UTC) (envelope-from rincebrain@gmail.com) Received: from mail-qc0-f182.google.com (mail-qc0-f182.google.com [209.85.216.182]) by mx1.freebsd.org (Postfix) with ESMTP id 16F228FC15 for ; Thu, 21 Jun 2012 21:51:15 +0000 (UTC) Received: by qcsg15 with SMTP id g15so743066qcs.13 for ; Thu, 21 Jun 2012 14:51:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=4xTlhwxX55e4QCz4RcmJn7pUcH+/zAcSYJjToPD8cSs=; b=Ey4wPYiBUyQeu/3V/JRhcZ+i6dJV2akzPPJFJDRLyTr736l9Ie6DAr2ToKDPNF82Lt rJJIZHvenPaBAIPfUeJ8wjqawsxlP14pIddw9FbNiLGuXJqvPvy6RP4FGmaKfJ56MAoy LYqDMBbfKbqK0p9UlpZhf51TpoJvSjr5hTj9cuOZuGqFRuf7k0PNFJHgz93h2y4JmKIj F1kYXh0hSHWh39naWjI5dpnf+ZekyfOFyChe8JOSlXJ37UVelc+tUHzGi27A194zGLVs aob/7iyFsFmeWwVAsxvZOy23tN4HHhQs92DAx/lkhhHAl3xv4ssTPyrJSM7gcBlWofZP iL8Q== MIME-Version: 1.0 Received: by 10.224.116.203 with SMTP id n11mr2182124qaq.61.1340315474523; Thu, 21 Jun 2012 14:51:14 -0700 (PDT) Sender: rincebrain@gmail.com Received: by 10.229.250.6 with HTTP; Thu, 21 Jun 2012 14:51:14 -0700 (PDT) In-Reply-To: <1953965235.30115.1340315339964.JavaMail.root@sz0192a.westchester.pa.mail.comcast.net> References: <1953965235.30115.1340315339964.JavaMail.root@sz0192a.westchester.pa.mail.comcast.net> Date: Thu, 21 Jun 2012 17:51:14 -0400 X-Google-Sender-Auth: AI0lr7-9K9OWN1mwLHbfaum3lkk Message-ID: From: Rich To: rondzierwa@comcast.net Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Cc: freebsd-fs@freebsd.org Subject: Re: ZFS Checksum errors X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 21 Jun 2012 21:51:15 -0000 To be honest, if ZFS says you've got a ton of checksum errors, I would strongly bet in favor of your data being damaged over a bug in ZFS. What're the underlying disks and RAID card? - Rich On Thu, Jun 21, 2012 at 5:48 PM, wrote: > > ok, i ran a verify on the raid, and it completed, so I believe that, from= the hardware standpoint, da0 should be a functioning, 12TB disk. > > i did a zpool clear and re-ran the scrub, and the results were almost ide= ntical: > > phoenix# zpool status -v zfsPool > pool: zfsPool > state: ONLINE > status: One or more devices has experienced an error resulting in data > corruption. Applications may be affected. > action: Restore the file in question if possible. Otherwise restore the > entire pool from backup. > see: http://www.sun.com/msg/ZFS-8000-8A > scrub: scrub completed after 3h39m with 6353 errors on Thu Jun 21 17:28:1= 0 2012 > config: > > NAME STATE READ WRITE CKSUM > zfsPool ONLINE 0 0 6.20K > da0 ONLINE 0 0 12.5K 24K repaired > > errors: Permanent errors have been detected in the following files: > > zfsPool/raid:<0x9e241> > zfsPool/Build:<0x0> > phoenix# > > along with the 6,353 I/O errors, there were over 12,000 checksum mismatch= errors on the console. > > > The recommendation from ZFS is to restore the file in question. At this p= oint, I would just like to delete the two files. > how do i do that? > > its these kind of antics that make me resistant to the thought of allowin= g ZFS to manage the raid. it seems to be having problems just managing a bi= g file system. I don't want it to correct anything, or restore anything, ju= st let me delete the files that hurt, fix up the free space list so it does= n't point outside the bounds of the disk, and get on with life. > > if its finding corrupted files that appear to not have a directory entry = associated with them (unlinked files), why doesn't it just delete them? fsc= k asks you if you want to delete unlinked files, why doesn't zfs do the sam= e, or at least give you the option of deleting bad files when it finds them= ? > > this is causing a lot of down time, and its making linux look very attrac= tive in my organization. how do I get this untangled short of reformatting = and starting over? > > ron. > > > ----- Original Message ----- > From: "Xin LI" > To: rondzierwa@comcast.net > Cc: "Steven Hartland" , freebsd-fs@freebsd.org > Sent: Wednesday, June 20, 2012 6:56:09 PM > Subject: Re: ZFS Checksum errors > > On Wed, Jun 20, 2012 at 1:55 PM, wrote: >> Steve. >> >> well, it got done, and it found another anonymous file with errors . any= idea how to get rid of these? > > Normally you need to "zpool clear zfsPool", and rerun zpool scrub. If > you see these numbers growing again, it's likely that there are some > other problems with your hardware. The recommended configuration is > to use ZFS to manage disks, or at least split your RAID volumes into > smaller ones by the way, since otherwise the volume is seen as a > "single disk" to ZFS, making it impossible to repair data errors > unless you add additional redundancy (zfs set copies=3D2, etc). > >> >> thanks, >> ron. >> >> >> >> phoenix# zpool status -v zfsPool >> pool: zfsPool >> state: ONLINE >> status: One or more devices has experienced an error resulting in data >> corruption. Applications may be affected. >> action: Restore the file in question if possible. Otherwise restore the >> entire pool from backup. >> see: http://www.sun.com/msg/ZFS-8000-8A >> scrub: scrub completed after 8h29m with 6276 errors on Wed Jun 20 16:18:= 01 2012 >> config: >> >> NAME STATE READ WRITE CKSUM >> zfsPool ONLINE 0 0 6.17K >> da0 ONLINE 0 0 13.0K 1.34M repaired >> >> errors: Permanent errors have been detected in the following files: >> >> zfsPool/raid:<0x9e241> >> zfsPool/Build:<0x0> >> phoenix# >> >> >> >> >> ----- Original Message ----- >> From: "Steven Hartland" >> To: rondzierwa@comcast.net, freebsd-fs@freebsd.org >> Sent: Wednesday, June 20, 2012 1:58:20 PM >> Subject: Re: ZFS Checksum errors >> >> ----- Original Message ----- >> From: >> .. >> >>> zpool status indicates that a file has errors, but doesn't tell me its = name: >>> >>> phoenix# zpool status -v zfsPool >>> pool: zfsPool >>> state: ONLINE >>> status: One or more devices has experienced an error resulting in data >>> corruption. Applications may be affected. >>> action: Restore the file in question if possible. Otherwise restore the >>> entire pool from backup. >>> see: http://www.sun.com/msg/ZFS-8000-8A >>> scrub: scrub in progress for 5h27m, 18.71% done, 23h42m to go >> >> Try waiting for the scrub to complete and see if its more helpful after = that. >> >> Regards >> Steve >> >> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D >> This e.mail is private and confidential between Multiplay (UK) Ltd. and = the person or entity to whom it is addressed. In the event of misdirection,= the recipient is prohibited from using, copying, printing or otherwise dis= seminating it or any information contained in it. >> >> In the event of misdirection, illegible or incomplete transmission pleas= e telephone +44 845 868 1337 >> or return the E.mail to postmaster@multiplay.co.uk. >> >> _______________________________________________ >> freebsd-fs@freebsd.org mailing list >> http://lists.freebsd.org/mailman/listinfo/freebsd-fs >> To unsubscribe, send any mail to "freebsd-fs-unsubscribe@freebsd.org" > > > > -- > Xin LI https://www.delphij.net/ > FreeBSD - The Power to Serve! Live free or die > _______________________________________________ > freebsd-fs@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-fs > To unsubscribe, send any mail to "freebsd-fs-unsubscribe@freebsd.org"