From owner-freebsd-stable@FreeBSD.ORG Wed May 18 09:30:05 2011 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 069791065673 for ; Wed, 18 May 2011 09:30:04 +0000 (UTC) (envelope-from luke@digital-crocus.com) Received: from mail.digital-crocus.com (node2.digital-crocus.com [91.209.244.128]) by mx1.freebsd.org (Postfix) with ESMTP id 9D9698FC12 for ; Wed, 18 May 2011 09:30:04 +0000 (UTC) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dkselector; d=hybrid-logic.co.uk; h=Received:Received:Subject:From:Reply-To:To:Cc:In-Reply-To:References:Content-Type:Organization:Date:Message-ID:Mime-Version:X-Mailer:Content-Transfer-Encoding:X-Spam-Score:X-Digital-Crocus-Maillimit:X-Authenticated-Sender:X-Complaints:X-Admin:X-Abuse; b=movQEO4bSYc5rwCsTEW3ba+fjvXu3hLPP7UC1M5fowuH72HUz5KUD1rvmGlOYGRa6+vR8s/fGlqUqSVUT2A9WWptXXfbRGV4NN8N2b8WVyyFqaNeeigeJXpsR1gVqPvM; Received: from luke by mail.digital-crocus.com with local (Exim 4.69 (FreeBSD)) (envelope-from ) id 1QMceF-000GAR-En for stable@freebsd.org; Wed, 18 May 2011 10:02:31 +0100 Received: from [109.107.35.48] (helo=[IPv6:::1]) by mail.digital-crocus.com with esmtpa (Exim 4.69 (FreeBSD)) (envelope-from ) id 1QMceC-000G9m-1D; Wed, 18 May 2011 10:02:31 +0100 From: Luke Marsden To: Jeremy Chadwick In-Reply-To: <20110517112952.GA48610@icarus.home.lan> References: <20110517073029.GA44359@icarus.home.lan> <4DD25264.8040305@FreeBSD.org> <20110517112952.GA48610@icarus.home.lan> Content-Type: text/plain; charset="UTF-8" Organization: Hybrid Web Cluster Date: Wed, 18 May 2011 10:03:08 +0100 Message-ID: <1305709388.4447.104.camel@pow> Mime-Version: 1.0 X-Mailer: Evolution 2.32.2 Content-Transfer-Encoding: 7bit X-Spam-Score: -1.0 X-Digital-Crocus-Maillimit: done X-Authenticated-Sender: luke X-Complaints: abuse@digital-crocus.com X-Admin: admin@digital-crocus.com X-Abuse: abuse@digital-crocus.com (Please include full headers in abuse reports) Cc: Charles Sprickman , stable@FreeBSD.org, tech@hybrid-logic.co.uk, Andriy Gapon Subject: Re: 8.1R possible zfs snapshot livelock? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: luke@hybrid-logic.co.uk List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 18 May 2011 09:30:05 -0000 Hi all, On Tue, 2011-05-17 at 04:29 -0700, Jeremy Chadwick wrote: > There are still some outstanding incidents that directly pertain to > ZFS snapshots, or are "related" to ZFS snapshots (meaning things like > send/recv which are commonly used alongside snapshots), which I > remember reading about but really saw no answer to: > > * ZFS send | ssh zfs recv results in ZFS subsystem hanging; > 8.1-RELEASE; > > February 2011: >http://lists.freebsd.org/pipermail/freebsd-fs/2011-February/010602.html As the original author of this post I wanted to chime in to say that our problem was mis-diagnosed here as being related to snapshots and zfs send/receive. Instead, it was a bug [1] relating to force-unmounting a ZFS filesystem which has active child nullfs mounts and active special devices (FIFO). There is a related kernel panic [1] which suggests that this is a problem area. I've been meaning to collect enough information to submit a proper bug report -- I can at least reliably reproduce the issue -- but have been rather too busy with the 1.0 release of our application, and was put off by one response: "IMO this is expected". [1] http://lists.freebsd.org/pipermail/freebsd-fs/2011-March/010983.html Our application -- see HCFS at http://www.hybrid-cluster.com/tech/ -- makes very heavy use of ZFS snapshots and ZFS send/receive on FreeBSD (currently 8.1), and since we engineered it so that it never attempts foolish force-unmounts on "busy" filesystems we've seen no kernel hangs over the course of hundreds of thousands of snapshot and zfs replication events in testing. I'm interested to know whether the OP's problem is fixed in 8.2 or 8-STABLE, since it could affect us. Also, thanks for the links to the backports for 8.2, Jeremy, I'll include those in our next system image. -- Best Regards, Luke Marsden CTO, Hybrid Logic Ltd. Web: http://www.hybrid-cluster.com/ Hybrid Web Cluster - cloud web hosting Phone: +447791750420