From owner-freebsd-database Mon Mar 30 07:41:38 1998 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id HAA21490 for freebsd-database-outgoing; Mon, 30 Mar 1998 07:41:38 -0800 (PST) (envelope-from owner-freebsd-database@FreeBSD.ORG) Received: from mail.cs.tu-berlin.de (root@mail.cs.tu-berlin.de [130.149.17.13]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id HAA21417; Mon, 30 Mar 1998 07:41:11 -0800 (PST) (envelope-from wosch@cs.tu-berlin.de) Received: from caramba.cs.tu-berlin.de (wosch@caramba.cs.tu-berlin.de [130.149.17.12]) by mail.cs.tu-berlin.de (8.8.8/8.8.8) with ESMTP id RAA03617; Mon, 30 Mar 1998 17:34:11 +0200 (MET DST) Received: (from wosch@localhost) by caramba.cs.tu-berlin.de (8.8.8/8.8.8) id RAA25384; Mon, 30 Mar 1998 17:33:58 +0200 (MET DST) Message-ID: <19980330173358.57866@caramba.cs.tu-berlin.de> Date: Mon, 30 Mar 1998 17:33:58 +0200 From: Wolfram Schneider To: John Fieber , nik@iii.co.uk Cc: shimon@simon-shapiro.org, Wolfram Schneider , freebsd-database@FreeBSD.ORG, andreas@klemm.gtn.com, scrappy@hub.org, Satoshi Asami , Amancio Hasty Subject: Re: Mailing list search interface References: <19980330110200.17368@iii.co.uk> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii In-Reply-To: ; from John Fieber on Mon, Mar 30, 1998 at 09:48:45AM -0500 Sender: owner-freebsd-database@FreeBSD.ORG Precedence: bulk On 1998-03-30 09:48:45 -0500, John Fieber wrote: > > I mentioned MHonArc to Jordan, and his first response was > > > > > Eeek! The evil MHonArc resurfaces! ;-) > > > > > > It doesn't scale at all well - just try MHonArc'ing a really big mailing > > > list archive. You soon get a set of monster html files that are > > > essentially unusable - I know, I did the short-lived "FreeBSD Docs" > > > CD for awhile using MHonArc. > > Listen to the man! He knows what he is talking about...well, in > this case at least. :) Agreed. > Though I have no first-hand proof, knowing how Glimpse works, I > suspect searches will generate quite a bit more disk I/O on the > server than freeWAIS. There is a technical report about glimpse, 10 pages. I strongly recommend to read this paper before using glimpse in real word applications! ftp://ftp.cs.arizona.edu/glimpse/glimpse.ps.Z Basically, glimpse does a linear full text search like grep. Searching 400MB E-Mails will take twice the time (for CPU *and* disk I/O) as seaching in 200MB. Glimpse does not scale by design. In best case glimpse is 256 x faster than grep, in worst case it is slow as grep. > And on and on... I think it is time to add an FAQ entry on why > we don't use hypermail or MHonArc for the mailing list archives. ;-) -- Wolfram Schneider http://www.freebsd.org/~wosch/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-database" in the body of the message