From owner-freebsd-arch@FreeBSD.ORG Sat Sep 18 00:28:56 2004 Return-Path: Delivered-To: freebsd-arch@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 978FE16A4CE; Sat, 18 Sep 2004 00:28:56 +0000 (GMT) Received: from smtp02.syd.iprimus.net.au (smtp02.syd.iprimus.net.au [210.50.76.196]) by mx1.FreeBSD.org (Postfix) with ESMTP id 3D72B43D49; Sat, 18 Sep 2004 00:28:56 +0000 (GMT) (envelope-from tim@robbins.dropbear.id.au) Received: from robbins.dropbear.id.au (210.50.248.230) by smtp02.syd.iprimus.net.au (7.0.031.3) id 4148EDE5000C8E74; Sat, 18 Sep 2004 10:28:49 +1000 Received: by robbins.dropbear.id.au (Postfix, from userid 1000) id C45B04252; Sat, 18 Sep 2004 10:29:37 +1000 (EST) Date: Sat, 18 Sep 2004 10:29:37 +1000 From: Tim Robbins To: Brian Fundakowski Feldman Message-ID: <20040918002937.GA4944@cat.robbins.dropbear.id.au> References: <16715.4611.108597.354107@piglet.timing.com> <20040917.130549.22012205.imp@bsdimp.com> <20040917191240.GR36708@green.homeunix.org> <20040917200345.GT36708@green.homeunix.org> <2FE38EA4-08FE-11D9-A03E-000A95C705DC@speakeasy.net> <20040917233507.GW36708@green.homeunix.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20040917233507.GW36708@green.homeunix.org> User-Agent: Mutt/1.4.1i cc: Dag-Erling Sm?rgrav cc: Sean Chittenden cc: freebsd-arch@freebsd.org Subject: Re: diff(1) X-BeenThere: freebsd-arch@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussion related to FreeBSD architecture List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 18 Sep 2004 00:28:56 -0000 On Fri, Sep 17, 2004 at 07:35:07PM -0400, Brian Fundakowski Feldman wrote: > On Fri, Sep 17, 2004 at 04:06:20PM -0700, Sean Chittenden wrote: > > >I don't know how much they've changed it, but I do know that it still > > >uses whichever regexp engine you happen to have in libc. In our case, > > >that means good old Henry Spencer. Last I talked to him, he was going > > >to release a new, improved, and much faster regexp engine, but that > > >was years ago and I still haven't seen anything come out of it. > > > > Actually, that's not quite correct. Spencer's latest regexp(3) was > > integrated into PostgreSQL 7.4 to provide wide-byte regexp support > > (released on 2003-11-17). Last I checked, it was being used regularly > > and with good success as it's gotten a good ten months of production > > use. :) > > > > From the PostgreSQL 7.4 release notes: > > > > "Faster and more powerful regular expression code > > > > "The entire regular expression module has been replaced with a new > > version by Henry Spencer, originally written for Tcl. The code greatly > > improves performance and supports several flavors of regular > > expressions." > > I'll certainly have to try it out and see whether or not it's faster > than Oniguruma (http://www.geocities.jp/kosako1/oniguruma/). Either > one should be more than suitable for libc if it's got enough of a > performance improvement. Oniguruma is not suitable because it does not have proper locale support -- it uses its own character type tables instead of using , and its own multibyte character encoding functions instead of . Tim