From owner-freebsd-chat Wed Mar 3 15:57:41 1999 Delivered-To: freebsd-chat@freebsd.org Received: from cheddar.netmonger.net (cheddar.netmonger.net [209.54.21.140]) by hub.freebsd.org (Postfix) with ESMTP id 3B49D14C21 for ; Wed, 3 Mar 1999 15:57:38 -0800 (PST) (envelope-from chris@cheddar.netmonger.net) Received: (from chris@localhost) by cheddar.netmonger.net (8.8.8/8.8.8) id SAA08217; Wed, 3 Mar 1999 18:57:21 -0500 (EST) Message-ID: <19990303185721.A6776@netmonger.net> Date: Wed, 3 Mar 1999 18:57:21 -0500 From: Christopher Masto To: freebsd-chat@FreeBSD.ORG Subject: Re: The FreeBSD Dictionary Mail-Followup-To: freebsd-chat@FreeBSD.ORG References: <19990303045313.B1500@net> <19990303181853.B12020@net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 0.91.1i In-Reply-To: <19990303181853.B12020@net>; from Rob on Wed, Mar 03, 1999 at 06:18:53PM -0500 Sender: owner-freebsd-chat@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org This being 1999, one could write a trivial web crawler which produces a very large list of words. One could require that such words appear on at least X web pages, to try to weed some portion of the spelling errors and obscure terms. One could then remove the ones which already appear in the dictionary, sort the remainder by frequency, and manually examine them for possible inclusion in a dictionary supplement. It should be possible to do this in a way that results in a large proportion of the commonly-used-but-not-in-the-current-dictionary words being up front, so they can be added early. Eventually, I think it would be too much boring work to have to look up words like skrjabingylus for someone who isn't being paid to do it. -- Christopher Masto Director of Operations NetMonger Communications chris@netmonger.net info@netmonger.net http://www.netmonger.net Free yourself, free your machine, free the daemon -- http://www.freebsd.org/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-chat" in the body of the message