Date: Tue, 26 Oct 2004 15:02:01 +0800 (CST) From: Cheng-Lung Sung <clsung@tw.freebsd.org> To: FreeBSD-gnats-submit@FreeBSD.org Subject: ports/73150: [NEW PORT] textproc/p5-Search-VectorSpace: A very basic vector-space search engine perl module Message-ID: <20041026070201.B2FDF106CC5@FreeBSD.csie.NCTU.edu.tw> Resent-Message-ID: <200410260710.i9Q7AScS007346@freefall.freebsd.org>
next in thread | raw e-mail | index | archive | help
>Number: 73150 >Category: ports >Synopsis: [NEW PORT] textproc/p5-Search-VectorSpace: A very basic vector-space search engine perl module >Confidential: no >Severity: non-critical >Priority: low >Responsible: freebsd-ports-bugs >State: open >Quarter: >Keywords: >Date-Required: >Class: change-request >Submitter-Id: current-users >Arrival-Date: Tue Oct 26 07:10:27 GMT 2004 >Closed-Date: >Last-Modified: >Originator: Cheng-Lung Sung >Release: FreeBSD 4.10-STABLE i386 >Organization: FreeBSD @ Taiwan >Environment: System: FreeBSD FreeBSD.csie.NCTU.edu.tw 4.10-STABLE FreeBSD 4.10-STABLE #0: Sat May 29 08:18:00 CST 2004 >Description: This module takes a list of documents (in English) and builds a simple in-memory search engine using a vector space model. Documents are stored as PDL objects, and after the initial indexing phase, the search should be very fast. This implementation applies a rudimentary stop list to filter out very common words, and uses a cosine measure to calculate document similarity. All documents above a user-configurable similarity threshold are returned. Author: Maciej Ceglowski <maciej AT ceglowski.com> WWW: http://search.cpan.org/dist/Search-VectorSpace/ Generated with FreeBSD Port Tools 0.63 >How-To-Repeat: >Fix: --- p5-Search-VectorSpace-0.02.shar begins here --- # This is a shell archive. Save it in a file, remove anything before # this line, and then unpack it by entering "sh file". Note, it may # create directories; files and directories will be owned by you and # have default permissions. # # This archive contains: # # p5-Search-VectorSpace # p5-Search-VectorSpace/pkg-descr # p5-Search-VectorSpace/Makefile # p5-Search-VectorSpace/pkg-plist # p5-Search-VectorSpace/distinfo # echo c - p5-Search-VectorSpace mkdir -p p5-Search-VectorSpace > /dev/null 2>&1 echo x - p5-Search-VectorSpace/pkg-descr sed 's/^X//' >p5-Search-VectorSpace/pkg-descr << 'END-of-p5-Search-VectorSpace/pkg-descr' XThis module takes a list of documents (in English) and Xbuilds a simple in-memory search engine using a vector Xspace model. Documents are stored as PDL objects, and Xafter the initial indexing phase, the search should be Xvery fast. This implementation applies a rudimentary Xstop list to filter out very common words, and uses a Xcosine measure to calculate document similarity. XAll documents above a user-configurable similarity Xthreshold are returned. X XAuthor: Maciej Ceglowski <maciej AT ceglowski.com> XWWW: http://search.cpan.org/dist/Search-VectorSpace/ END-of-p5-Search-VectorSpace/pkg-descr echo x - p5-Search-VectorSpace/Makefile sed 's/^X//' >p5-Search-VectorSpace/Makefile << 'END-of-p5-Search-VectorSpace/Makefile' X# New ports collection makefile for: p5-Search-VectorSpace X# Date created: 2004-10-26 X# Whom: clsung X# X# $FreeBSD$ X# X XPORTNAME= Search-VectorSpace XPORTVERSION= 0.02 XCATEGORIES= textproc perl5 XMASTER_SITES= ${MASTER_SITE_PERL_CPAN} XMASTER_SITE_SUBDIR= Search XPKGNAMEPREFIX= p5- X XMAINTAINER= clsung@FreeBSD.org XCOMMENT= A very basic vector-space search engine perl module X XBUILD_DEPENDS= ${SITE_PERL}/${PERL_ARCH}/PDL.pm:${PORTSDIR}/math/PDL \ X ${SITE_PERL}/Lingua/Stem.pm:${PORTSDIR}/textproc/p5-Lingua-Stem XRUN_DEPENDS= ${BUILD_DEPENDS} X XPERL_CONFIGURE= yes XMAN3= Search::VectorSpace.3 X X.include <bsd.port.pre.mk> X X.if ${PERL_LEVEL} < 500601 XIGNORE= This port requires perl 5.6.x or later. Install lang/perl5 then try again X.endif X X.include <bsd.port.post.mk> END-of-p5-Search-VectorSpace/Makefile echo x - p5-Search-VectorSpace/pkg-plist sed 's/^X//' >p5-Search-VectorSpace/pkg-plist << 'END-of-p5-Search-VectorSpace/pkg-plist' X@comment $FreeBSD$ X%%SITE_PERL%%/Search/VectorSpace.pm X%%SITE_PERL%%/%%PERL_ARCH%%/auto/Search/VectorSpace/.packlist X@dirrm %%SITE_PERL%%/%%PERL_ARCH%%/auto/Search/VectorSpace X@unexec rmdir %D/%%SITE_PERL%%/%%PERL_ARCH%%/auto/Search 2>/dev/null || true X@unexec rmdir %D/%%SITE_PERL%%/Search 2>/dev/null || true END-of-p5-Search-VectorSpace/pkg-plist echo x - p5-Search-VectorSpace/distinfo sed 's/^X//' >p5-Search-VectorSpace/distinfo << 'END-of-p5-Search-VectorSpace/distinfo' XMD5 (Search-VectorSpace-0.02.tar.gz) = adce4a8bded096c925226c5788888900 XSIZE (Search-VectorSpace-0.02.tar.gz) = 4251 END-of-p5-Search-VectorSpace/distinfo exit --- p5-Search-VectorSpace-0.02.shar ends here --- >Release-Note: >Audit-Trail: >Unformatted:
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20041026070201.B2FDF106CC5>