From owner-freebsd-hackers Tue Mar 14 18:37:13 1995 Return-Path: hackers-owner Received: (from majordom@localhost) by freefall.cdrom.com (8.6.10/8.6.6) id SAA06177 for hackers-outgoing; Tue, 14 Mar 1995 18:37:13 -0800 Received: from hda.com (hda.com [199.232.40.182]) by freefall.cdrom.com (8.6.10/8.6.6) with ESMTP id SAA06171 for ; Tue, 14 Mar 1995 18:37:11 -0800 Received: (dufault@localhost) by hda.com (8.6.9/8.3) id VAA01047; Tue, 14 Mar 1995 21:27:18 -0500 From: Peter Dufault Message-Id: <199503150227.VAA01047@hda.com> Subject: Re: SCSI ASC-ASCQ descriptions To: joerg_wunsch@uriah.heep.sax.de Date: Tue, 14 Mar 1995 21:27:18 -0500 (EST) Cc: freebsd-hackers@FreeBSD.org In-Reply-To: <199503142355.AAA01220@uriah.heep.sax.de> from "J Wunsch" at Mar 15, 95 00:55:38 am X-Mailer: ELM [version 2.4 PL24] Content-Type: text Content-Length: 1123 Sender: hackers-owner@FreeBSD.org Precedence: bulk J Wunsch writes: > > As Peter Dufault wrote: > > > > > I'm really tempted to make a program to do this... :) > > > > Yes, I thought of that too. I even went through the effort of seeing > > how many unique words there are (about 300). > > > > If you had a clever way of finding "good overlap" I think you > > could cut the size in half or more. > > Well, in this case, even a rather simple compression scheme will do > it. Find the most common words, and -- since they consist only of > ASCII characters -- assign them ``abbrevations'' in the range of 0x80 > and up. > A good suggestion - but sort | uniq -c | tail -128 and a C program and I get (I have to code in C since I'm inadequate in perl): > Starting with 4958 bytes we will replace 3556 bytes with 1043 (129 > references, dictionary size 785 + 129 NULLS) with 1402 uncompressed > bytes remaining for 2445 bytes final total (49% compression). Still not worth the complication. Peter -- Peter Dufault Real Time Machine Control and Simulation HD Associates, Inc. Voice: 508 433 6936 dufault@hda.com Fax: 508 433 5267