From owner-freebsd-ports-bugs@FreeBSD.ORG Fri Jul 21 14:50:25 2006 Return-Path: X-Original-To: freebsd-ports-bugs@hub.freebsd.org Delivered-To: freebsd-ports-bugs@hub.freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 253D416A4DD for ; Fri, 21 Jul 2006 14:50:25 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [216.136.204.21]) by mx1.FreeBSD.org (Postfix) with ESMTP id CAB2043D70 for ; Fri, 21 Jul 2006 14:50:13 +0000 (GMT) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (gnats@localhost [127.0.0.1]) by freefall.freebsd.org (8.13.4/8.13.4) with ESMTP id k6LEoDfY042746 for ; Fri, 21 Jul 2006 14:50:13 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.13.4/8.13.4/Submit) id k6LEoDvC042745; Fri, 21 Jul 2006 14:50:13 GMT (envelope-from gnats) Resent-Date: Fri, 21 Jul 2006 14:50:13 GMT Resent-Message-Id: <200607211450.k6LEoDvC042745@freefall.freebsd.org> Resent-From: FreeBSD-gnats-submit@FreeBSD.org (GNATS Filer) Resent-To: freebsd-ports-bugs@FreeBSD.org Resent-Reply-To: FreeBSD-gnats-submit@FreeBSD.org, Jin-Shan Tseng Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 925F416A4E1 for ; Fri, 21 Jul 2006 14:40:57 +0000 (UTC) (envelope-from tjs@Pluto.CDPA.nsysu.edu.tw) Received: from Pluto.CDPA.nsysu.edu.tw (Pluto.CDPA.nsysu.edu.tw [140.117.205.30]) by mx1.FreeBSD.org (Postfix) with ESMTP id A4DA043D45 for ; Fri, 21 Jul 2006 14:40:56 +0000 (GMT) (envelope-from tjs@Pluto.CDPA.nsysu.edu.tw) Received: from Pluto.CDPA.nsysu.edu.tw (localhost [127.0.0.1]) by Pluto.CDPA.nsysu.edu.tw (8.13.6/8.13.6) with ESMTP id k6LEeq8w036622 for ; Fri, 21 Jul 2006 22:40:52 +0800 (CST) (envelope-from tjs@Pluto.CDPA.nsysu.edu.tw) Received: (from root@localhost) by Pluto.CDPA.nsysu.edu.tw (8.13.6/8.13.6/Submit) id k6LEepoL036621; Fri, 21 Jul 2006 22:40:51 +0800 (CST) (envelope-from tjs) Message-Id: <200607211440.k6LEepoL036621@Pluto.CDPA.nsysu.edu.tw> Date: Fri, 21 Jul 2006 22:40:51 +0800 (CST) From: tjs To: FreeBSD-gnats-submit@FreeBSD.org X-Send-Pr-Version: 3.113 Cc: Subject: ports/100675: New port: textproc/p5-Lingua-EN-NamedEntity Basic Named Entity Extraction algorithm X-BeenThere: freebsd-ports-bugs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Jin-Shan Tseng List-Id: Ports bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 21 Jul 2006 14:50:25 -0000 >Number: 100675 >Category: ports >Synopsis: New port: textproc/p5-Lingua-EN-NamedEntity Basic Named Entity Extraction algorithm >Confidential: no >Severity: non-critical >Priority: low >Responsible: freebsd-ports-bugs >State: open >Quarter: >Keywords: >Date-Required: >Class: update >Submitter-Id: current-users >Arrival-Date: Fri Jul 21 14:50:13 GMT 2006 >Closed-Date: >Last-Modified: >Originator: Jin-Shan Tseng >Release: FreeBSD 6.1-STABLE i386 >Organization: NCHC >Environment: System: FreeBSD Pluto.CDPA.nsysu.edu.tw 6.1-STABLE FreeBSD 6.1-STABLE #0: Mon Jul 17 12:45:19 CST 2006 tjs@Pluto.CDPA.nsysu.edu.tw:/usr/obj/usr/src/sys/Pluto i386 >Description: "Named entities" is the NLP jargon for proper nouns which represent people, places, organisations, and so on. This module provides a very simple way of extracting these from a text. If we run the "extract_entities" routine on a piece of news coverage of recent UK political events, we should expect to see it return a list of hash references looking like this: { entity => 'Mr Howard', class => 'person', scores => { ... }, }, { entity => 'Ministry of Defence', class => 'organisation', ... }, { entity => 'Oxfordshire', class => 'place', ... }, The additional "scores" hash reference in there breaks down the various possible classes for this entity in an open-ended scale. WWW: http://search.cpan.org/dist/Lingua-EN-NamedEntity/ >How-To-Repeat: >Fix: --- p5-Lingua-EN-NamedEntity.shar begins here --- # This is a shell archive. Save it in a file, remove anything before # this line, and then unpack it by entering "sh file". Note, it may # create directories; files and directories will be owned by you and # have default permissions. # # This archive contains: # # p5-Lingua-EN-NamedEntity # p5-Lingua-EN-NamedEntity/Makefile # p5-Lingua-EN-NamedEntity/distinfo # p5-Lingua-EN-NamedEntity/pkg-descr # p5-Lingua-EN-NamedEntity/pkg-plist # echo c - p5-Lingua-EN-NamedEntity mkdir -p p5-Lingua-EN-NamedEntity > /dev/null 2>&1 echo x - p5-Lingua-EN-NamedEntity/Makefile sed 's/^X//' >p5-Lingua-EN-NamedEntity/Makefile << 'END-of-p5-Lingua-EN-NamedEntity/Makefile' X# New ports collection makefile for: p5-Lingua-EN-NamedEntity X# Date created: 2006-07-21 X# Whom: Jin-Shan Tseng X# X# $FreeBSD$ X# X XPORTNAME= Lingua-EN-NamedEntity XPORTVERSION= 1.7 XCATEGORIES= textproc perl5 XMASTER_SITES= ${MASTER_SITE_PERL_CPAN} XMASTER_SITE_SUBDIR= Lingua XPKGNAMEPREFIX= p5- X XMAINTAINER= tjs@cdpa.nsysu.edu.tw XCOMMENT= Basic Named Entity Extraction algorithm X XBUILD_DEPENDS= ${SITE_PERL}/${PERL_ARCH}/Compress/Zlib.pm:${PORTSDIR}/archivers/p5-Compress-Zlib \ X ${SITE_PERL}/Lingua/Stem/En.pm:${PORTSDIR}/textproc/p5-Lingua-Stem \ X ${SITE_PERL}/LWP/Simple.pm:${PORTSDIR}/www/p5-libwww XRUN_DEPENDS= ${BUILD_DEPENDS} X XPERL_CONFIGURE= yes X XMAN3= Lingua::EN::NamedEntity.3 X X.include X X.if ${PERL_LEVEL} < 500504 XIGNORE= requires perl 5.5.4 or later. Install lang/perl5.8 and try again X.endif X X.include END-of-p5-Lingua-EN-NamedEntity/Makefile echo x - p5-Lingua-EN-NamedEntity/distinfo sed 's/^X//' >p5-Lingua-EN-NamedEntity/distinfo << 'END-of-p5-Lingua-EN-NamedEntity/distinfo' XMD5 (Lingua-EN-NamedEntity-1.7.tar.gz) = 47cb8958c96e6d3d0394d8740d5617e0 XSHA256 (Lingua-EN-NamedEntity-1.7.tar.gz) = 9f561b59d5a2bd983abaefdd4d5081a40af2d0801ffa60c95a9df836aacf5a04 XSIZE (Lingua-EN-NamedEntity-1.7.tar.gz) = 21176 END-of-p5-Lingua-EN-NamedEntity/distinfo echo x - p5-Lingua-EN-NamedEntity/pkg-descr sed 's/^X//' >p5-Lingua-EN-NamedEntity/pkg-descr << 'END-of-p5-Lingua-EN-NamedEntity/pkg-descr' X"Named entities" is the NLP jargon for proper nouns which Xrepresent people, places, organisations, and so on. X XThis module provides a very simple way of extracting these from a text. XIf we run the "extract_entities" routine on a piece of news coverage of Xrecent UK political events, we should expect to see it return a list of Xhash references looking like this: X X{ entity => 'Mr Howard', class => 'person', scores => { ... }, }, X{ entity => 'Ministry of Defence', class => 'organisation', ... }, X{ entity => 'Oxfordshire', class => 'place', ... }, X XThe additional "scores" hash reference in there breaks down the various Xpossible classes for this entity in an open-ended scale. X X XWWW: http://search.cpan.org/dist/Lingua-EN-NamedEntity/ END-of-p5-Lingua-EN-NamedEntity/pkg-descr echo x - p5-Lingua-EN-NamedEntity/pkg-plist sed 's/^X//' >p5-Lingua-EN-NamedEntity/pkg-plist << 'END-of-p5-Lingua-EN-NamedEntity/pkg-plist' X%%SITE_PERL%%/Lingua/EN/NamedEntity.pm X%%SITE_PERL%%/Lingua/EN/NamedEntity/forename X%%SITE_PERL%%/Lingua/EN/NamedEntity/wordlist X%%SITE_PERL%%/%%PERL_ARCH%%/auto/Lingua/EN/NamedEntity/.packlist X@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Lingua/EN/NamedEntity X@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Lingua/EN X@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Lingua X@dirrmtry %%SITE_PERL%%/Lingua/EN/NamedEntity X@dirrmtry %%SITE_PERL%%/Lingua/EN X@dirrmtry %%SITE_PERL%%/Lingua END-of-p5-Lingua-EN-NamedEntity/pkg-plist exit --- p5-Lingua-EN-NamedEntity.shar ends here --- >Release-Note: >Audit-Trail: >Unformatted: