From owner-freebsd-hackers Fri Aug 23 16:51:32 1996 Return-Path: owner-hackers Received: (from root@localhost) by freefall.freebsd.org (8.7.5/8.7.3) id QAA25647 for hackers-outgoing; Fri, 23 Aug 1996 16:51:32 -0700 (PDT) Received: from fallout.campusview.indiana.edu (fallout.campusview.indiana.edu [149.159.1.1]) by freefall.freebsd.org (8.7.5/8.7.3) with ESMTP id QAA25639 for ; Fri, 23 Aug 1996 16:51:27 -0700 (PDT) Received: from localhost (jfieber@localhost) by fallout.campusview.indiana.edu (8.7.5/8.7.3) with SMTP id SAA13578; Fri, 23 Aug 1996 18:51:08 -0500 (EST) Date: Fri, 23 Aug 1996 18:51:07 -0500 (EST) From: John Fieber To: "Jordan K. Hubbard" cc: Julian Elischer , hackers@freebsd.org Subject: Re: MAIL archive not archiving? In-Reply-To: <1933.840837613@time.cdrom.com> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-hackers@freebsd.org X-Loop: FreeBSD.org Precedence: bulk On Fri, 23 Aug 1996, Jordan K. Hubbard wrote: > What are the minimum resources you need for processing this kind of > data then? Give me some parameters to work with and I'll go sniffing > around. Basically, Iindex can index stuff in a reasonable amount of time *if* it can hold the entire index in memory. If it can't it has to index chunk at a time and merge the results. The merging is absolutely glacial. The raw text of the mail archive is about 250 megabytes. Freewais-sf is much faster at indexing, but the ranking of search results isn't as good. -john == jfieber@indiana.edu =========================================== == http://fallout.campusview.indiana.edu/~jfieber ================