From owner-freebsd-hackers@FreeBSD.ORG Wed Jun 18 21:46:48 2008 Return-Path: Delivered-To: hackers@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BA9151065671; Wed, 18 Jun 2008 21:46:48 +0000 (UTC) (envelope-from scf@FreeBSD.org) Received: from mail.farley.org (farley.org [67.64.95.201]) by mx1.freebsd.org (Postfix) with ESMTP id 463FC8FC28; Wed, 18 Jun 2008 21:46:45 +0000 (UTC) (envelope-from scf@FreeBSD.org) Received: from thor.farley.org (HPooka@thor.farley.org [192.168.1.5]) by mail.farley.org (8.14.3/8.14.3) with ESMTP id m5IL8Mlr097004; Wed, 18 Jun 2008 16:08:22 -0500 (CDT) (envelope-from scf@FreeBSD.org) Date: Wed, 18 Jun 2008 16:08:22 -0500 (CDT) From: "Sean C. Farley" To: =?ISO-8859-15?Q?Dag-Erling_Sm=F8rgrav?= In-Reply-To: <86bq211rqw.fsf@ds4.des.no> Message-ID: References: <485453F2.60507@FreeBSD.org> <4854BC29.3060507@FreeBSD.org> <4854C96A.1080603@aueb.gr> <48556AAD.9010602@t-hosting.hu> <20080615212613.GA97326@nagual.pp.ru> <4855EDFE.3010708@FreeBSD.org> <86bq211rqw.fsf@ds4.des.no> User-Agent: Alpine 1.10 (BSF 962 2008-03-14) MIME-Version: 1.0 Content-Type: MULTIPART/MIXED; BOUNDARY="56599777-1701218234-1213823305=:70949" X-Spam-Status: No, score=-4.4 required=3.0 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on mail.farley.org Cc: Doug Barton , current@FreeBSD.org, Andrey Chernov , Diomidis Spinellis , hackers@FreeBSD.org, K?vesd?n G?bor , Max Khon Subject: Re: CFT: BSD-licensed grep [Fwd: cvs commit: ports/textproc/bsdgrep Makefile distinfo] X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 18 Jun 2008 21:46:48 -0000 This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. --56599777-1701218234-1213823305=:70949 Content-Type: TEXT/PLAIN; charset=utf-8; format=flowed Content-Transfer-Encoding: 8BIT On Mon, 16 Jun 2008, Dag-Erling Smørgrav wrote: > Doug Barton writes: >> Andrey Chernov writes: >>> Please note that BSD grep is not localized (and can't be per design) >>> and works only with standard C locale. It may not affect ports >>> system processing but shurely affects real texts handling. >> That is very troubling. In this day and age localization is a >> requirement. I cannot imagine being supportive of adding something to >> the base that does not have this capability. > > We don't have a locale-aware regex implementation. Henry Spencer wrote > one for Tcl 8, and it seems to be under an MIT-equivalent license, but > I'm not sure how hard it would be to extirpate. It might be easier to > lift it from PostgreSQL, which also uses it. Other BSD-license-friendly regex libraries: 1. PCRE (http://www.pcre.org/) (has a POSIX compliant interface too) 2. Oniguruma (http://www.geocities.jp/kosako3/oniguruma/) (from Ruby) 3. Lrexlib (http://lrexlib.luaforge.net/) (no apparent POSIX interface) Sean -- scf@FreeBSD.org --56599777-1701218234-1213823305=:70949--