From owner-freebsd-arch Wed Mar 14 14:19:59 2001 Delivered-To: freebsd-arch@freebsd.org Received: from ns.yogotech.com (ns.yogotech.com [206.127.123.66]) by hub.freebsd.org (Postfix) with ESMTP id CB26D37B719 for ; Wed, 14 Mar 2001 14:19:53 -0800 (PST) (envelope-from nate@yogotech.com) Received: from nomad.yogotech.com (nomad.yogotech.com [206.127.123.131]) by ns.yogotech.com (8.9.3/8.9.3) with ESMTP id PAA19996; Wed, 14 Mar 2001 15:19:37 -0700 (MST) (envelope-from nate@nomad.yogotech.com) Received: (from nate@localhost) by nomad.yogotech.com (8.8.8/8.8.8) id PAA01352; Wed, 14 Mar 2001 15:19:31 -0700 (MST) (envelope-from nate) From: Nate Williams MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <15023.61042.768406.854325@nomad.yogotech.com> Date: Wed, 14 Mar 2001 15:19:30 -0700 (MST) To: Dan Nelson Cc: Dag-Erling Smorgrav , freebsd-arch@FreeBSD.ORG, Nate Williams Subject: Re: [PATCH] add a SITE MD5 command to ftpd In-Reply-To: <20010314161646.A1482@dan.emsphone.com> References: <20010314105918.A5204@roaming.cacheboy.net> <35525.984597779@critter> <20010314210758.A2405@roaming.cacheboy.net> <15023.53743.215996.538067@nomad.yogotech.com> <20010314130025.A3031@dragon.nuxi.com> <20010314161646.A1482@dan.emsphone.com> X-Mailer: VM 6.75 under 21.1 (patch 12) "Channel Islands" XEmacs Lucid Reply-To: nate@yogotech.com (Nate Williams) Sender: owner-freebsd-arch@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG > > > The checker can *easily* keep a list of files sizes and date stamps > > > and compare that. > > > > Date stamps are useless. They'd be different on different master > > sites anyway. File size is almost as bad, as two files can very > > easily be totally different and still have the same size. > > But how often do port distfiles change, but keep their size? Often enough that it's been seen. > Pretty low, I'd say, at least compared to the number of times the size > changes and the filename stays the same. True, but now we'd have to modify every port to include both the MD5 *AND* filesize information in the port. Right now the MD5 is part of the port, so all the information is necessary in the ports tree to do the SITE-MD5, except for the ftp infrastructure. > Another thing to consider before adding SITE MD5 as a command is that > it's an extremely slow operation. md5'ing a 10MB file takes about > 1/3rd of a second on my pIII/600. It would take 5 minutes of CPU time > to md5 1-gig worth of sources, and that's assuming that the FTP server > is idle. ftpd would have to cache the md5 checksum somewhere for it > to be acceptable, and then you've got the same caching problem (how > does ftpd know when the file has changed to is can update its cached > md5?). Is that cost greater than the cost of sending the data out over the wire? Nate To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-arch" in the body of the message