From owner-freebsd-isp Fri Oct 9 14:20:54 1998 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id OAA23186 for freebsd-isp-outgoing; Fri, 9 Oct 1998 14:20:54 -0700 (PDT) (envelope-from owner-freebsd-isp@FreeBSD.ORG) Received: from homer.talcom.net ([209.5.1.4]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id OAA23159 for ; Fri, 9 Oct 1998 14:20:45 -0700 (PDT) (envelope-from leo@homer.talcom.net) Received: (from leo@localhost) by homer.talcom.net (8.8.8/8.8.5) id RAA20733; Fri, 9 Oct 1998 17:22:17 -0400 (EDT) Message-ID: <19981009172217.33716@talcom.net> Date: Fri, 9 Oct 1998 17:22:17 -0400 From: Leo Papandreou To: isp@FreeBSD.ORG Subject: Re: web site search engines for FreeBSD server? References: <361D1C8A.A8CCDE4E@partitur.se> <361DBFAD.4321@echidna.com> <19981009015100.A1730@mu.org> <361E3888.1303@echidna.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 0.89i In-Reply-To: <361E3888.1303@echidna.com>; from Graeme Tait on Fri, Oct 09, 1998 at 09:23:36AM -0700 X-No-Archive: Yes X-Organization: Not very, no. X-Wife: Forgotten but not gone. Sender: owner-freebsd-isp@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org On Fri, Oct 09, 1998 at 09:23:36AM -0700, Graeme Tait wrote: > Paul Saab wrote: > > > > have you looked at ht://dig? (http://www.htdig.org/) > > > No, and it may be interesting for some things. Does it run under FreeBSD? Yes, it runs well and is superior to anything i've seen for modest sites. Nowhere near as complicated as a Harvest system, either. > > However, some of my applications may not fit it. They involve indexing large numbers > of files (hundreds of thousands). All the files reside on the machine that does the > indexing. Glimpse does a good job of this, building compact indices from the Configured for "fuzzy" (spelling, synonyms, endings, soundex, metaphone) searches, htdig is considerably slower and less powerful than a harvest + glimpse combination for LARGE sites. If your site is as large as you say it is, people will routinely be hitting their browser's stop button. Those that dont will often get timed out. However, it's a snap to set up so you may want to try it for yourself. You can spread out multiple copies of it to service different sections of your site, perhaps. To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-isp" in the body of the message