Date: Sat, 18 Sep 2004 10:29:37 +1000 From: Tim Robbins <tjr@freebsd.org> To: Brian Fundakowski Feldman <green@freebsd.org> Cc: freebsd-arch@freebsd.org Subject: Re: diff(1) Message-ID: <20040918002937.GA4944@cat.robbins.dropbear.id.au> In-Reply-To: <20040917233507.GW36708@green.homeunix.org> References: <16715.4611.108597.354107@piglet.timing.com> <20040917.130549.22012205.imp@bsdimp.com> <20040917191240.GR36708@green.homeunix.org> <xzphdpwn04v.fsf@dwp.des.no> <20040917200345.GT36708@green.homeunix.org> <xzp1xh0a8wt.fsf@dwp.des.no> <2FE38EA4-08FE-11D9-A03E-000A95C705DC@speakeasy.net> <20040917233507.GW36708@green.homeunix.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On Fri, Sep 17, 2004 at 07:35:07PM -0400, Brian Fundakowski Feldman wrote: > On Fri, Sep 17, 2004 at 04:06:20PM -0700, Sean Chittenden wrote: > > >I don't know how much they've changed it, but I do know that it still > > >uses whichever regexp engine you happen to have in libc. In our case, > > >that means good old Henry Spencer. Last I talked to him, he was going > > >to release a new, improved, and much faster regexp engine, but that > > >was years ago and I still haven't seen anything come out of it. > > > > Actually, that's not quite correct. Spencer's latest regexp(3) was > > integrated into PostgreSQL 7.4 to provide wide-byte regexp support > > (released on 2003-11-17). Last I checked, it was being used regularly > > and with good success as it's gotten a good ten months of production > > use. :) > > > > From the PostgreSQL 7.4 release notes: > > > > "Faster and more powerful regular expression code > > > > "The entire regular expression module has been replaced with a new > > version by Henry Spencer, originally written for Tcl. The code greatly > > improves performance and supports several flavors of regular > > expressions." > > I'll certainly have to try it out and see whether or not it's faster > than Oniguruma (http://www.geocities.jp/kosako1/oniguruma/). Either > one should be more than suitable for libc if it's got enough of a > performance improvement. Oniguruma is not suitable because it does not have proper locale support -- it uses its own character type tables instead of using <wctype.h>, and its own multibyte character encoding functions instead of <wchar.h>. Tim
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20040918002937.GA4944>
