Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 30 Mar 1998 17:20:33 -0500 (EST)
From:      John Fieber <jfieber@indiana.edu>
To:        Simon Shapiro <shimon@simon-shapiro.org>
Cc:        freebsd-database@FreeBSD.ORG
Subject:   RE: Mail indexing infrastructure
Message-ID:  <Pine.BSF.3.96.980330164024.8294C-100000@fallout.campusview.indiana.edu>
In-Reply-To: <XFMail.980330133652.shimon@simon-shapiro.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, 30 Mar 1998, Simon Shapiro wrote:

> > The current indexed archive goes back to 1994.
> 
> This is not an answer to my question :-)  Currently we are keeping 4 years. 
> Do we want to keep 40? 10? 5? Some (theoretical) limit has to be put.

Oh, I would say indefinately until there is a compelling reason
to dump some.  The more we have, however, the more essential date
scoping becomes.  I think it is already becoming a bit of a
problem.

> If thread retrieval is based on Subject: line, an RDBMS is a trivially good
> solution.  One can even apply regex to the subject, limit dates, etc.

Good thread indexing is based on subjects, message-ids, dates and
content.  Quick-and-dirty thread retrieval is an easy RDBMS
problem, good thread retrieval is rather more complex.  For a
nice summary outline of threading methods and their performance,
see: 

  Lewis, David; Knowles, Kimberly (1997).  Theading Electronic
  Mail: A Preliminary Study.  Information Processing & Management,
  33(2):209-217.

> If the current system is good and should only be augmented,
> rather than replaced, this is fine by me.

Let me re-phrase: most proposals to date do replacement without
preservation of what is good with the current system.  A
wholesale replacement WITH preservation of what is good would be
most welcome.  I'd be the first to jump up and down with glee to
find a viable alternative to freeWAIS for doing full text
searches with stemming, soundex matching, automatic term
weighting etc...  freeWAIS is is a festering heap of bugs, but it
is the best the free software world has.  Postgres with a module
offering similar functionality would make me one happy camper. 

-john


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-database" in the body of the message



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?Pine.BSF.3.96.980330164024.8294C-100000>