From owner-freebsd-fs@FreeBSD.ORG Mon Nov 9 09:13:01 2009 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 885E9106566B for ; Mon, 9 Nov 2009 09:13:01 +0000 (UTC) (envelope-from gerrit@pmp.uni-hannover.de) Received: from mrelay1.uni-hannover.de (mrelay1.uni-hannover.de [130.75.2.106]) by mx1.freebsd.org (Postfix) with ESMTP id 134E08FC0A for ; Mon, 9 Nov 2009 09:13:00 +0000 (UTC) Received: from www.pmp.uni-hannover.de (www.pmp.uni-hannover.de [130.75.117.2]) by mrelay1.uni-hannover.de (8.14.2/8.14.2) with ESMTP id nA99CtQm029150; Mon, 9 Nov 2009 10:12:56 +0100 Received: from pmp.uni-hannover.de (arc.pmp.uni-hannover.de [130.75.117.1]) by www.pmp.uni-hannover.de (Postfix) with SMTP id 82EB424; Mon, 9 Nov 2009 10:12:55 +0100 (CET) Date: Mon, 9 Nov 2009 10:12:55 +0100 From: Gerrit =?ISO-8859-1?Q?K=FChn?= To: "James R. Van Artsdalen" Message-Id: <20091109101255.e81774e4.gerrit@pmp.uni-hannover.de> In-Reply-To: <4AF4AAFF.2080104@jrv.org> References: <20091106094734.4b056899.gerrit@pmp.uni-hannover.de> <4AF4123A.4080301@andric.com> <20091106231440.4f0f2cbb.gerrit@pmp.uni-hannover.de> <4AF4AAFF.2080104@jrv.org> Organization: Albert-Einstein-Institut (MPI =?ISO-8859-1?Q?f=FCr?= Gravitationsphysik & IGP =?ISO-8859-1?Q?Universit=E4t?= Hannover) X-Mailer: Sylpheed 2.7.1 (GTK+ 2.12.11; i386-portbld-freebsd7.0) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-PMX-Version: 5.5.5.374460, Antispam-Engine: 2.7.1.369594, Antispam-Data: 2009.11.9.90052 Cc: freebsd-fs@freebsd.org Subject: Re: trace for zfs panic mounting fs after crash with RC2 X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 09 Nov 2009 09:13:01 -0000 On Fri, 06 Nov 2009 17:02:23 -0600 "James R. Van Artsdalen" wrote about Re: trace for zfs panic mounting fs after crash with RC2: JRVA> How the ZIL got corrupted - if it did - is a harder question. I think it is. Otherwise zfs would not crash while trying to replay the ZIL, wouldn't it? It seems that this happens rather easily with the system I have at hand (it happend twice to me so far - and I crashed the system only twice, that makes 100%, although I doubt that it is that reproducible). Searching around I found some reports of the same or similar issues (but no solution). So apart from recovering my fs (I did not try your suggested patch yet), there are two things I regard as very important: 1. Find you why the ZIL gets corrupted under some circumstances. 2. Find a safe way to recover a fs with a corrupted ZIL. I guess I could live with a corrupted ZIL after a crash, if there was some kind of --ignore-zil switch to get my data back online. In any case, zfs should not panic on corrupted ZIL data, should it? As I do not dare to use the system for storing data until this is sorted out, I can try out almost anything to get more information about the problem. Please let me know what I should do to support debugging. cu Gerrit