From owner-freebsd-database Mon Mar 30 02:59:49 1998 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id CAA13953 for freebsd-database-outgoing; Mon, 30 Mar 1998 02:59:49 -0800 (PST) (envelope-from owner-freebsd-database@FreeBSD.ORG) Received: from rah.star-gate.com (rah.star-gate.com [209.133.7.234]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id CAA13947; Mon, 30 Mar 1998 02:59:46 -0800 (PST) (envelope-from hasty@rah.star-gate.com) Received: from rah.star-gate.com (localhost.star-gate.com [127.0.0.1]) by rah.star-gate.com (8.8.8/8.8.8) with ESMTP id CAA08389; Mon, 30 Mar 1998 02:59:06 -0800 (PST) (envelope-from hasty@rah.star-gate.com) Message-Id: <199803301059.CAA08389@rah.star-gate.com> X-Mailer: exmh version 2.0.2 2/24/98 To: Wolfram Schneider cc: shimon@simon-shapiro.org, freebsd-database@FreeBSD.ORG, andreas@klemm.gtn.com, scrappy@hub.org, Satoshi Asami Subject: Re: [PORTS] Pgaccess doesn't run on -current anymore, Update In-reply-to: Your message of "Mon, 30 Mar 1998 12:31:30 +0200." <19980330123130.39177@caramba.cs.tu-berlin.de> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Date: Mon, 30 Mar 1998 02:59:05 -0800 From: Amancio Hasty Sender: owner-freebsd-database@FreeBSD.ORG Precedence: bulk And if people do a decent job they may be able to sell the project, complete with OS and computer 8) Have Fun, Amancio > On 1998-03-29 13:57:30 -0800, Simon Shapiro wrote: > > We have been playing with the idea of normalizing the archive into an > > RDBMS. Some of the benefits are: > > > > * no need to update the threads database. It will always be updated. > > * Users can create, easily, their own thread logic with no impact on > > system performance. > > * Searching on normalized fields are many times faster, and much less > > costly in system resources. > > Some figures ... > > The FreeBSD mailing list archive is 620MB large. There are currently > 270,000 messages. The archive grow with 100,000 messages/year. > > If you plan to use a real SQL database, you should consider at least > 500,000 data sets, better 1 million. You need 2GB for the raw E-Mails > and 2-4GB for the index. I don't know if there are free available > databases which can handle this large data. > > That was the hardware part. You must hire a database expert, a Web > designer and a cgi script programmer. All people should be willing to work > for at least 2-3 years on this project. This is not an easy task. > > > A full update of the thread database took 6 min on hub (Pentium Pro), > thats 100MB/min ;-) An update for the last week took 3-6 seconds. > > -- > Wolfram Schneider http://www.freebsd.org/~wosch/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-database" in the body of the message