From owner-freebsd-questions@FreeBSD.ORG Sun Feb 1 13:23:45 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6F77D106564A for ; Sun, 1 Feb 2009 13:23:45 +0000 (UTC) (envelope-from andrewhw@ieee.org) Received: from mailserv.mta.ca (mailserv.mta.ca [138.73.1.1]) by mx1.freebsd.org (Postfix) with ESMTP id 423DF8FC1A for ; Sun, 1 Feb 2009 13:23:45 +0000 (UTC) (envelope-from andrewhw@ieee.org) Received: from [138.73.29.51] (helo=qemg.org) by mailserv.mta.ca with esmtp (Exim 4.61) (envelope-from ) id 1LTbtl-0001l4-W4 for freebsd-questions@freebsd.org; Sun, 01 Feb 2009 08:58:06 -0400 Date: Sun, 1 Feb 2009 08:59:02 -0400 (AST) From: Andrew Hamilton-Wright Sender: andrew@qemg.org To: freebsd-questions@freebsd.org Message-ID: User-Agent: Alpine 2.00 (BSF 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; format=flowed; charset=US-ASCII Subject: dump(8) using snapshot + "recommended" cache X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 01 Feb 2009 13:23:45 -0000 Hi All; I regularly use dump(8) with snapshots to back up my server. While this seems to have been working perfectly well so far, upon (re)reading the man page for dump(8), I have noticed a somewhat scary pair of lines in the paragraph describing the option for -C (emphasis with stars mine): [Use of this option] will greatly improve performance at the cost of ***dump possibly not noticing changes in the file system*** between passes. ***It is recommended that you always use this option when dumping a snapshot.*** Does anyone know what, exactly, this means? In particular, is the first statement actually trying to say: Use of this option will greatly improve performance; however it may be that changes made to the filesystem made between _dump_ passes will be ignored. ***The resulting dumped filesystem image will be consistent and correct based on a timestamp no later than that of the point at which the dump was started***. Is this a fair statement? Is this guaranteed? Or are we trying to say that: The resulting filesystem will contain images of individual files based on a timestamp no later than that of the point at which the dump was started, however any individual files modified after the dump begins may be stored using any of the version that appeared written to disk during the period of the dump. As far as the second line goes, I am not at all clear on what this is trying to say. Why is the cache recommended? For speed? Stability? Output correctness? In particular, if a snapshot dump is made without a cache option, is it potentially corrupt? In particular, if the second attempt above is more true than the first, it seems to me that we should _not_ recommend the use of a cache with snapshots, as it seems to erode the utility of the snapshot itself. It is for this reason that I am suspecting that there is more here than meets the eye, which is why I am keen to make sure that this is clear. I am very happy to put in an update to the docs if we can make sure that we know exactly what we are trying to say here. Thanks, Andrew.