Date: Mon, 25 Sep 2006 02:04:52 +0900 (JST) From: markun@onohara.to To: FreeBSD-gnats-submit@FreeBSD.org Cc: markun@onohara.to Subject: ports/103571: [NEW PORT] textproc/p5-Text-Language-Guess: Trained module to guess a document's language Message-ID: <20060924170452.2B4FA2A429@gate.onohara.to> Resent-Message-ID: <200609241710.k8OHAQdN081869@freefall.freebsd.org>
next in thread | raw e-mail | index | archive | help
>Number: 103571 >Category: ports >Synopsis: [NEW PORT] textproc/p5-Text-Language-Guess: Trained module to guess a document's language >Confidential: no >Severity: non-critical >Priority: low >Responsible: freebsd-ports-bugs >State: open >Quarter: >Keywords: >Date-Required: >Class: change-request >Submitter-Id: current-users >Arrival-Date: Sun Sep 24 17:10:25 GMT 2006 >Closed-Date: >Last-Modified: >Originator: Masahiro Teramoto <markun@onohara.to> >Release: FreeBSD 4.11-RELEASE-p21 i386 >Organization: >Environment: System: FreeBSD gate.onohara.to 4.11-RELEASE-p21 FreeBSD 4.11-RELEASE-p21 #0: Thu Sep 7 22:37:27 JST 2006 root@gate.onohara.to:/usr/obj/usr/src/sys/RENTALv6 i386 >Description: Text::Language::Guess guesses a document's language. Its implementation is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN, it determines how many of the known stopwords the document contains for each language supported by "Lingua::StopWords". Each word in the document recognized as stopword of a particular language scores one point for this language. The "language_guess()" function takes a document as a parameter and returns the abbreviation of the language that it is most likely written in. Author: Mike Schilli <cpan@perlmeister.com> WWW: http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/ >How-To-Repeat: >Fix: # This is a shell archive. Save it in a file, remove anything before # this line, and then unpack it by entering "sh file". Note, it may # create directories; files and directories will be owned by you and # have default permissions. # # This archive contains: # # p5-Text-Language-Guess # p5-Text-Language-Guess/Makefile # p5-Text-Language-Guess/distinfo # p5-Text-Language-Guess/pkg-descr # p5-Text-Language-Guess/pkg-plist # echo c - p5-Text-Language-Guess mkdir -p p5-Text-Language-Guess > /dev/null 2>&1 echo x - p5-Text-Language-Guess/Makefile sed 's/^X//' >p5-Text-Language-Guess/Makefile << 'END-of-p5-Text-Language-Guess/Makefile' X# New ports collection makefile for: textproc/p5-Text-Language-Guess X# Date created: 2006-09-25 X# Whom: Masahiro Teramoto <markun@onohara.to> X# X# $FreeBSD$ X# X XPORTNAME= Text-Language-Guess XPORTVERSION= 0.02 XCATEGORIES= textproc perl5 XMASTER_SITES= ${MASTER_SITE_PERL_CPAN} XMASTER_SITE_SUBDIR= ../../authors/id/M/MS/MSCHILLI XPKGNAMEPREFIX= p5- X XMAINTAINER= markun@onohara.to XCOMMENT= Trained module to guess a document's language X XBUILD_DEPENDS= p5-Log-Log4perl>=1.0:${PORTSDIR}/devel/p5-Log-Log4perl \ X p5-Text-ExtractWords>=0:${PORTSDIR}/textproc/p5-Text-ExtractWords \ X p5-Lingua-StopWords>=0:${PORTSDIR}/textproc/p5-Lingua-StopWords XRUN_DEPENDS= ${BUILD_DEPENDS} X XPERL_CONFIGURE= yes X XMAN1= language-guess.1 XMAN3= Text::Language::Guess.3 X X.include <bsd.port.pre.mk> X.include <bsd.port.post.mk> END-of-p5-Text-Language-Guess/Makefile echo x - p5-Text-Language-Guess/distinfo sed 's/^X//' >p5-Text-Language-Guess/distinfo << 'END-of-p5-Text-Language-Guess/distinfo' XMD5 (Text-Language-Guess-0.02.tar.gz) = 66fbb68b17c3e62febbba633111f852e XSHA256 (Text-Language-Guess-0.02.tar.gz) = 12ef612c1de0451367d403db73723446b836e2e10adeec5e9386b7baa8ede12f XSIZE (Text-Language-Guess-0.02.tar.gz) = 5377 END-of-p5-Text-Language-Guess/distinfo echo x - p5-Text-Language-Guess/pkg-descr sed 's/^X//' >p5-Text-Language-Guess/pkg-descr << 'END-of-p5-Text-Language-Guess/pkg-descr' XText::Language::Guess guesses a document's language. Its implementation Xis simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN, Xit determines how many of the known stopwords the document contains for Xeach language supported by "Lingua::StopWords". X XEach word in the document recognized as stopword of a particular Xlanguage scores one point for this language. X XThe "language_guess()" function takes a document as a parameter and Xreturns the abbreviation of the language that it is most likely written Xin. X XAuthor: Mike Schilli <cpan@perlmeister.com> XWWW: http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/ END-of-p5-Text-Language-Guess/pkg-descr echo x - p5-Text-Language-Guess/pkg-plist sed 's/^X//' >p5-Text-Language-Guess/pkg-plist << 'END-of-p5-Text-Language-Guess/pkg-plist' X@comment $FreeBSD$ Xbin/language-guess X%%SITE_PERL%%/%%PERL_ARCH%%/auto/Text/Language/Guess/.packlist X%%SITE_PERL%%/Text/Language/Guess.pm X@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Text/Language/Guess X@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Text/Language X@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Text X@dirrmtry %%SITE_PERL%%/Text/Language X@dirrmtry %%SITE_PERL%%/Text END-of-p5-Text-Language-Guess/pkg-plist exit >Release-Note: >Audit-Trail: >Unformatted:
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20060924170452.2B4FA2A429>