From owner-freebsd-current@FreeBSD.ORG Fri Sep 23 02:42:22 2005 Return-Path: X-Original-To: current@freebsd.org Delivered-To: freebsd-current@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A782816A41F for ; Fri, 23 Sep 2005 02:42:22 +0000 (GMT) (envelope-from cswiger@mac.com) Received: from pi.codefab.com (pi.codefab.com [199.103.21.227]) by mx1.FreeBSD.org (Postfix) with ESMTP id 3E3C643D46 for ; Fri, 23 Sep 2005 02:42:22 +0000 (GMT) (envelope-from cswiger@mac.com) Received: from localhost (localhost [127.0.0.1]) by pi.codefab.com (Postfix) with ESMTP id 75DBF5E4F; Thu, 22 Sep 2005 22:42:21 -0400 (EDT) Received: from pi.codefab.com ([127.0.0.1]) by localhost (pi.codefab.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 36867-04; Thu, 22 Sep 2005 22:42:20 -0400 (EDT) Received: from [192.168.1.3] (pool-68-161-68-11.ny325.east.verizon.net [68.161.68.11]) by pi.codefab.com (Postfix) with ESMTP id B67485C9D; Thu, 22 Sep 2005 22:42:19 -0400 (EDT) Message-ID: <43336B8D.3010008@mac.com> Date: Thu, 22 Sep 2005 22:42:21 -0400 From: Chuck Swiger Organization: The Courts of Chaos User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.7.11) Gecko/20050728 X-Accept-Language: en-us, en MIME-Version: 1.0 To: Mikhail Teterin References: <200509220446.j8M4kBPA019823@blue.virtual-estates.net> <200509221652.54123.mi+mx@aldan.algebra.com> <184C5FE7-B956-43E8-AC60-68EA6D5337BB@mac.com> <200509222130.18284.mi+mx@aldan.algebra.com> In-Reply-To: <200509222130.18284.mi+mx@aldan.algebra.com> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Scanned: amavisd-new at codefab.com Cc: current@freebsd.org Subject: Re: using bzip2 to compress man-pages X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 23 Sep 2005 02:42:22 -0000 Mikhail Teterin wrote: >> http://www.pkix.net/~chuck/manpage_test/ > > Interesting. I did not realize, bzip2 is inferior to gzip on small files. It > still wins overall, however -- the wins on large man-pages compensate for > losses on the small ones. Your script does not show the total number of > sectors in each case (patch attached). Thanks, your patch has been committed. If nothing else, we now have lots of data on just how bzip2 and gzip compare for tiny files.... [ ... ] > 14919 of .gz can be turned into 14738 of .bz2 > That's 181 512-byte sectors or 92672 bytes. Not very much, but this is just > the /usr/share/man. Considering the /usr/share/cat (with larger _formatted_ > files), plus the ports' man-pages, I still think bzip2 is beneficial. > > Assuming 1024-sized sectors, I get 8170 for .gz vs. 8067 for .bz2, or 105472 > bytes. OK. This seems to be about a 1% difference, which is at least noticable, and even going in the right direction. (It'd be a lot closer without csh.1, ppp.8, and lex.1, which have the honor of being the longest manpages present. If only we had used the Y2K problem as a good excuse to remove CSH, that would have been a perfect cover. :-) > Reducing reliance on GNU software remains an extra bonus... Certainly, there exists GNU software which makes me cringe, but gzip isn't part of that group. I suppose there's a preference for BSD-licensed code rather than GPL'ed code, but I'd expect gzip and zlib to be a part of FreeBSD for the foreseeable future... > Finally, the PR contains independent patches for both man(1) and the man-page > compressing infrastructure. After 5-months wait, I'll settle for partial > acceptance. I don't mind the notion of supporting bzip'ed manpages. I'll even give a +1 to the idea, not that I expect democracy to break out among the powers-that-be. -- -Chuck