From owner-freebsd-database Fri Nov 27 05:26:29 1998 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id FAA20263 for freebsd-database-outgoing; Fri, 27 Nov 1998 05:26:29 -0800 (PST) (envelope-from owner-freebsd-database@FreeBSD.ORG) Received: from hesiod.nhh.no (hesiod.nhh.no [158.37.96.15]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id FAA20258 for ; Fri, 27 Nov 1998 05:26:22 -0800 (PST) (envelope-from tih@athene.nhh.no) Received: from athene.nhh.no (athene.nhh.no [158.37.96.16]) by hesiod.nhh.no (8.8.8/8.8.8) with ESMTP id OAA02062; Fri, 27 Nov 1998 14:25:03 +0100 (CET) Received: (from tih@localhost) by athene.nhh.no (8.8.8/8.8.8) id OAA14713; Fri, 27 Nov 1998 14:25:02 +0100 (CET) To: The Hermit Hacker Cc: Andrew McNaughton , Iani Brankov , pgsql-hackers@postgreSQL.org, database@FreeBSD.ORG Subject: Re: [HACKERS] Re: Mysql 321 - Mysql 322 - msql References: From: Tom Ivar Helbekkmo Date: 27 Nov 1998 14:25:00 +0100 In-Reply-To: The Hermit Hacker's message of "Fri, 27 Nov 1998 08:54:57 -0400 (AST)" Message-ID: <86ogptmfo3.fsf@athene.nhh.no> Lines: 23 X-Mailer: Gnus v5.6.44/Emacs 19.34 Sender: owner-freebsd-database@FreeBSD.ORG Precedence: bulk The Hermit Hacker writes: > What do you mean by "fulltext searching"? He's talking about inverted text indices, where text is indexed such that a word is the key, and the index returns pointers to all the places where that word occurs. Knowledge of word structure is usually built in, so that "hacks", "hacker", "hackers", "hacking" and so on are known to be derivatives of "hack", and can match it if requested. Noise words such as "a", "the" and so forth are usually not indexed. Inverted indexed text storage tends to take up much space, but there are ways to reduce this, and the best implementations do it remarkably well. A simple example: it is not really necessary to actually store the original text; it can instead be a sequence of links to the store of all individual words in the text database. See http://glimpse.cs.arizona.edu/ for a powerful inverted indexing engine and various related software. -tih -- Popularity is the hallmark of mediocrity. --Niles Crane, "Frasier" To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-database" in the body of the message