From owner-freebsd-ports@FreeBSD.ORG  Mon Jul  2 04:32:15 2007
Return-Path: <owner-freebsd-ports@FreeBSD.ORG>
X-Original-To: ports@FreeBSD.org
Delivered-To: freebsd-ports@FreeBSD.ORG
Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52])
	by hub.freebsd.org (Postfix) with ESMTP id C126916A41F
	for <ports@FreeBSD.org>; Mon,  2 Jul 2007 04:32:15 +0000 (UTC)
	(envelope-from youshi10@u.washington.edu)
Received: from mxout4.cac.washington.edu (mxout4.cac.washington.edu
	[140.142.33.19])
	by mx1.freebsd.org (Postfix) with ESMTP id A200413C468
	for <ports@FreeBSD.org>; Mon,  2 Jul 2007 04:32:15 +0000 (UTC)
	(envelope-from youshi10@u.washington.edu)
Received: from smtp.washington.edu (smtp.washington.edu [140.142.33.9] (may be
	forged))
	by mxout4.cac.washington.edu (8.13.7+UW06.06/8.13.7+UW07.05) with ESMTP
	id l624WFEv021535
	(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK)
	for <ports@FreeBSD.org>; Sun, 1 Jul 2007 21:32:15 -0700
X-Auth-Received: from [192.168.10.45] (c-24-10-12-194.hsd1.ca.comcast.net
	[24.10.12.194]) (authenticated authid=youshi10)
	by smtp.washington.edu (8.13.7+UW06.06/8.13.7+UW07.03) with ESMTP id
	l624WE2O031045
	(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NOT)
	for <ports@FreeBSD.org>; Sun, 1 Jul 2007 21:32:15 -0700
Message-ID: <46887FD3.3080307@u.washington.edu>
Date: Sun, 01 Jul 2007 21:32:19 -0700
From: Garrett Cooper <youshi10@u.washington.edu>
User-Agent: Thunderbird 2.0.0.4 (Windows/20070604)
MIME-Version: 1.0
To: ports@FreeBSD.org
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
X-PMX-Version: 5.3.2.304607, Antispam-Engine: 2.5.1.298604,
	Antispam-Data: 2007.7.1.211533
X-Uwash-Spam: Gauge=IIIIIII, Probability=7%, Report='__C230066_P2 0, __CT 0,
	__CTE 0, __CT_TEXT_PLAIN 0, __HAS_MSGID 0, __MIME_TEXT_ONLY 0,
	__MIME_VERSION 0, __SANE_MSGID 0, __USER_AGENT 0'
Cc: 
Subject: +CONTENTS files
X-BeenThere: freebsd-ports@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Porting software to FreeBSD <freebsd-ports.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-ports>,
	<mailto:freebsd-ports-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-ports>
List-Post: <mailto:freebsd-ports@freebsd.org>
List-Help: <mailto:freebsd-ports-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-ports>,
	<mailto:freebsd-ports-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Mon, 02 Jul 2007 04:32:15 -0000

Pardon me for being naive, but wouldn't it be wiser for all of the data 
in the +CONTENTS file to be aggregated into sections instead of having 
line by line info?

Example (net/samba_3.0.25a):

@comment MD5:9e94560ac5e757d3bc5f922dcf3ab4fb
man/man1/log2pcap.1.gz
[~100 lines of repetitive data...]
@comment MD5:9f5fc8df2a1383a175e165ef2e0b10cc
man/man8/vfs_notify_fam.8.gz

    Could be aggregated into:

@MD5
9e94560ac5e757d3bc5f922dcf3ab4fb man/man1/log2pcap.1.gz
c58f068d603a12d4af867c15cf77e636 man/man1/nmblookup.1.gz
[etc..]
@end MD5

    or something similar to XML.

    This would reduce the filesize from n bytes to n - (9 + 4 -1) * 
i_entries + 8. In larger package files this would reduce the amount of 
data parsing by a long shot. Also, more powerful scripting languages 
like Perl, Python, or smart parsers in C could make short work of this 
data and just extract the MD5 elements for comparison.

    Also, by doing a little extra work when creating packages by 
organizing all the sections together, I think that the file size could 
be reduced by a large degree.

    Similar fields to @comment MD5 could be reduced I believe, but with 
less benefit maybe, other than just the @unexec rmdir, etc lines.

    Either that, or the data should be organized into separate files I 
think (increases number of files, but reduces overall processing time IMO).

Thanks,
-Garrett