From owner-freebsd-chat  Wed Mar  3 15:57:41 1999
Delivered-To: freebsd-chat@freebsd.org
Received: from cheddar.netmonger.net (cheddar.netmonger.net [209.54.21.140])
	by hub.freebsd.org (Postfix) with ESMTP id 3B49D14C21
	for <freebsd-chat@FreeBSD.ORG>; Wed,  3 Mar 1999 15:57:38 -0800 (PST)
	(envelope-from chris@cheddar.netmonger.net)
Received: (from chris@localhost)
	by cheddar.netmonger.net (8.8.8/8.8.8) id SAA08217;
	Wed, 3 Mar 1999 18:57:21 -0500 (EST)
Message-ID: <19990303185721.A6776@netmonger.net>
Date: Wed, 3 Mar 1999 18:57:21 -0500
From: Christopher Masto <chris@netmonger.net>
To: freebsd-chat@FreeBSD.ORG
Subject: Re: The FreeBSD Dictionary
Mail-Followup-To: freebsd-chat@FreeBSD.ORG
References: <19990303045313.B1500@net> <Pine.BSF.3.91.990303021508.5542A-100000@dsinw.com> <19990303181853.B12020@net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
X-Mailer: Mutt 0.91.1i
In-Reply-To: <19990303181853.B12020@net>; from Rob on Wed, Mar 03, 1999 at 06:18:53PM -0500
Sender: owner-freebsd-chat@FreeBSD.ORG
Precedence: bulk
X-Loop: FreeBSD.org

This being 1999, one could write a trivial web crawler which produces
a very large list of words.  One could require that such words appear
on at least X web pages, to try to weed some portion of the spelling
errors and obscure terms.  One could then remove the ones which
already appear in the dictionary, sort the remainder by frequency, and
manually examine them for possible inclusion in a dictionary
supplement.

It should be possible to do this in a way that results in a large
proportion of the commonly-used-but-not-in-the-current-dictionary
words being up front, so they can be added early.  Eventually, I think
it would be too much boring work to have to look up words like
skrjabingylus for someone who isn't being paid to do it.
-- 
Christopher Masto        Director of Operations      NetMonger Communications
chris@netmonger.net        info@netmonger.net        http://www.netmonger.net

Free yourself, free your machine, free the daemon -- http://www.freebsd.org/


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-chat" in the body of the message