From owner-freebsd-current@FreeBSD.ORG Thu Jul 12 11:52:10 2007 Return-Path: X-Original-To: current@freebsd.org Delivered-To: freebsd-current@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id E263116A400 for ; Thu, 12 Jul 2007 11:52:10 +0000 (UTC) (envelope-from enderbsd@gmail.com) Received: from wx-out-0506.google.com (wx-out-0506.google.com [66.249.82.226]) by mx1.freebsd.org (Postfix) with ESMTP id 9F92F13C447 for ; Thu, 12 Jul 2007 11:52:10 +0000 (UTC) (envelope-from enderbsd@gmail.com) Received: by wx-out-0506.google.com with SMTP id i29so101175wxd for ; Thu, 12 Jul 2007 04:52:10 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:references; b=aGmlby2x/PseoJbpcteJN2XCZRH4gGfZw2nqh2ucROV6GY7Gbo3mJhzJNxeXTu6VjwnYAyBgXjHjL+PlV+A5TGZhtDGT1T1FVRxfnfC0jl3s+tv7eHaEVSzEStj9H7UgRk+zZS/roOOaDkk/51gFcj2/k2TjnYWC8hAiyCiuDXo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:references; b=evRngPihelNoei3Ltf/MbaiweQhuLF6/T18SaBAl2jKme1W2Hgky1oTpvmbp1JG6I0+rkSUq0E5Y6JilzxTy1OB6f2SyANvjuMouMwrB78/tvRS/GFBbVPMSlTxs1FAluFul66hyI5FeC/K5P43SEyq9I0S/jVd5OQSJvrvOpic= Received: by 10.90.79.6 with SMTP id c6mr235024agb.1184241129161; Thu, 12 Jul 2007 04:52:09 -0700 (PDT) Received: by 10.90.56.12 with HTTP; Thu, 12 Jul 2007 04:52:08 -0700 (PDT) Message-ID: <15d429c0707120452g5308aedcx301d32c387396a65@mail.gmail.com> Date: Thu, 12 Jul 2007 07:52:08 -0400 From: "Simon Dircks" To: "Pawel Jakub Dawidek" In-Reply-To: <20070712064807.GC1227@garage.freebsd.pl> MIME-Version: 1.0 References: <200707071426.18202.dfr@rabson.org> <20070709000918.GD1208@garage.freebsd.pl> <46922D75.4010006@gddsn.org.cn> <15d429c0707111724w76d47529v4c5fad6ac7892875@mail.gmail.com> <20070712064807.GC1227@garage.freebsd.pl> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: current@freebsd.org Subject: Re: ZFS leaking vnodes (sort of) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 12 Jul 2007 11:52:11 -0000 On 7/12/07, Pawel Jakub Dawidek wrote: > > On Wed, Jul 11, 2007 at 08:24:41PM -0400, Simon Dircks wrote: > > With this patch i am still able to reproduce my ZFS crash. > > > > controllera# uname -a > > FreeBSD controllera.storage.ksdhost.com 7.0-CURRENT FreeBSD 7.0-CURRENT#0: > > Thu Jul 12 02:28:52 UTC 2007 > > graff@controllera.storage.ksdhost.com:/usr/obj/usr/src/sys/CONTROLLERA > > amd64 > > > > > > panic: ZFS: bad checksum (read on off 0: zio > 0xffffff001d729810 > > [LO SP > > A space map] 1000L/800P DVA[0]=<0:1600421800:800> > DVA[1]=<0:2c000f7000:800> > > DVA[ > > 2]=<0:4200013800:800> fletcher4 lzjb LE contiguous birth=566 fill=1 > > chsum=5d3276 > > 7b98:635ff7022f8b:4251 > > cpuid = 0 > > KDB: enter: panic > > [thread pid 802 tid 100066 ] > > stopped at kdb_enter+0x31: leave > > This isn't related to the patch, actually. It looks like you don't have > enough redundancy. Can you paste 'zpool status' output? > > Sure controllera# zpool status pool: tank state: UNAVAIL status: One or more devices could not be opened. There are insufficient replicas for the pool to continue functioning. action: Attach the missing device and online it using 'zpool online'. see: http://www.sun.com/msg/ZFS-8000-D3 scrub: none requested config: NAME STATE READ WRITE CKSUM tank UNAVAIL 0 0 0 insufficient replicas mirror UNAVAIL 0 0 0 insufficient replicas ggate111 UNAVAIL 0 0 0 cannot open ggate211 UNAVAIL 0 0 0 cannot open Now here is another interesting thing: I can cause a crash now by just reattaching the disks. I can repeat this without fail in just a few mins. So for every test i have been typing zpool destroy tank, and making a fresh pool. Kip Macy wrote: That looks more like bad disk than a file system bug. That could be, my "disks" are actually ggatec devices on other machines. And for some reason when they are under ZFS i get alot of packet loss (and ping spike) over the gigabit interface even when it is not maxed out. But should this still cause a panic? my / and /usr are a normal local disk on UFS. Using the same ggate111 and ggate112 devices with gmirror + mount -o async and copying file.XXX over to the gmirror I get no packet loss, no ping spikes, and the checksum matches after i move the file over. All the machines in question are the same version of freebsd, and all have full debugging on and are not in production or running anything else.