From owner-freebsd-hackers@FreeBSD.ORG Tue Aug 1 17:11:52 2006 Return-Path: X-Original-To: freebsd-hackers@freebsd.org Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 7483C16A4E0 for ; Tue, 1 Aug 2006 17:11:52 +0000 (UTC) (envelope-from rick@kiwi-computer.com) Received: from kiwi-computer.com (megan.kiwi-computer.com [63.224.10.3]) by mx1.FreeBSD.org (Postfix) with SMTP id 9D55E43D49 for ; Tue, 1 Aug 2006 17:11:51 +0000 (GMT) (envelope-from rick@kiwi-computer.com) Received: (qmail 3958 invoked by uid 2001); 1 Aug 2006 17:11:50 -0000 Date: Tue, 1 Aug 2006 12:11:50 -0500 From: "Rick C. Petty" To: Peter Jeremy Message-ID: <20060801171150.GB3413@megan.kiwi-computer.com> References: <200607271150.k6RBoM9p031745@lurza.secnetix.de> <44C8FB65.9020102@FreeBSD.org> <44CE03D2.2050803@centtech.com> <17614.4005.407223.621637@bhuda.mired.org> <44CE199C.2020500@centtech.com> <17614.8289.134373.387558@bhuda.mired.org> <96b30c400607310847s1d2f845eo212b234d03f51e9a@mail.gmail.com> <17614.10982.499561.139268@bhuda.mired.org> <20060801072611.GA717@turion.vk2pj.dyndns.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20060801072611.GA717@turion.vk2pj.dyndns.org> User-Agent: Mutt/1.4.2.1i Cc: freebsd-hackers@freebsd.org Subject: Re: [PATCH] adding two new options to 'cp' X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: rick-freebsd@kiwi-computer.com List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 01 Aug 2006 17:11:52 -0000 On Tue, Aug 01, 2006 at 05:26:11PM +1000, Peter Jeremy wrote: > On Mon, 2006-Jul-31 22:42:49 +0200, Ivan Voras wrote: > >I agree with this, and while you're in there, can you add -s to copy > >sparse files (via the usual "if the buffer is all nulls, seek beyond eof > >instead of writing" trick)? > > Note that it isn's possible to accurately distinguish between a block > of NULs and a hole in the file through the filesystem. The only way > to accurately copy a sparse file is with dump/restore. Sure it is-- in a number of ways. The most useful way is to do something of the sort: int sd, dd; // assume these are set to source & dest descriptors unsigned char* zeros; unsigned char* buffer; struct stat st; size_t bytes, offset; fstat(sd, &st); zeros = malloc(st.st_blksize); bzero(zeros, st.st_blksize); for (offset = 0; offset < st.st_size; offset += bytes) { bytes = st.st_blksize; if (offset + bytes > st.st_size) bytes = st.st_size - offset; read(sd, buffer, bytes); if (0 == memcmp(buffer, zeros, bytes)) lseek(dd, bytes, SEEK_CUR); else write(sd, buffer, bytes); } Obviously, I didn't add the error checking/handling, but AFAIK this is essentially what the -S option to gnu's tar does. In this example, you may not mimic the allocated blocks of a sparse file, but you would optimize the copy to use as few filesystem blocks as possible. -- Rick C. Petty