From owner-freebsd-hackers@FreeBSD.ORG Wed Nov 4 23:03:54 2009 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A7086106568D for ; Wed, 4 Nov 2009 23:03:54 +0000 (UTC) (envelope-from mel.flynn+fbsd.hackers@mailing.thruhere.net) Received: from mailhub.rachie.is-a-geek.net (rachie.is-a-geek.net [66.230.99.27]) by mx1.freebsd.org (Postfix) with ESMTP id 6E6328FC20 for ; Wed, 4 Nov 2009 23:03:54 +0000 (UTC) Received: from smoochies.rachie.is-a-geek.net (mailhub.lan.rachie.is-a-geek.net [192.168.2.11]) by mailhub.rachie.is-a-geek.net (Postfix) with ESMTP id B8BDE7E853 for ; Wed, 4 Nov 2009 14:03:52 -0900 (AKST) From: Mel Flynn To: freebsd-hackers@freebsd.org Date: Thu, 5 Nov 2009 00:03:50 +0100 User-Agent: KMail/1.12.1 (FreeBSD/8.0-RC1; KDE/4.3.1; i386; ; ) References: <200911032122.28905.mel.flynn+fbsd.hackers@mailing.thruhere.net> <200911041349.54943.mel.flynn+fbsd.hackers@mailing.thruhere.net> In-Reply-To: <200911041349.54943.mel.flynn+fbsd.hackers@mailing.thruhere.net> MIME-Version: 1.0 Content-Type: Text/Plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Message-Id: <200911050003.51080.mel.flynn+fbsd.hackers@mailing.thruhere.net> Subject: Grep -i and UTF-8 (Was: Re: Issue with grep -i (on i386 only?)) X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 04 Nov 2009 23:03:54 -0000 On Wednesday 04 November 2009 13:49:54 Mel Flynn wrote: > Using env MALLOC_OPTIONS= also has no impact at all (just in case defaults > aren't that). Since fgrep is fast and basically seeds the cache for grep, > I'm ruling out disks/io reads. In fact, /tmp on this laptop is memory disk > (one reason I couldn't go up to 64M :)). I honestly can't figure out what > my 'local problem' could be or your optimization. It hit me. Rather then a local problem, it's a locale problem: =>>> 16777216 =>>> en_US.UTF-8 =>>> fgrep 0.04 real 0.04 user 0.00 sys 0.04 real 0.02 user 0.02 sys 0.02 real 0.01 user 0.00 sys =>>> grep 0.04 real 0.04 user 0.00 sys 3.74 real 3.55 user 0.02 sys 1.95 real 1.83 user 0.03 sys =>>> en_US.ISO8859-1 =>>> fgrep 0.04 real 0.04 user 0.00 sys 0.04 real 0.03 user 0.00 sys 0.02 real 0.01 user 0.01 sys =>>> grep 0.05 real 0.03 user 0.00 sys 0.05 real 0.04 user 0.00 sys 0.08 real 0.04 user 0.03 sys =>>> en_US.US-ASCII =>>> fgrep 0.04 real 0.01 user 0.02 sys 0.05 real 0.03 user 0.01 sys 0.02 real 0.00 user 0.02 sys =>>> grep 0.04 real 0.03 user 0.00 sys 0.05 real 0.03 user 0.00 sys 0.08 real 0.06 user 0.01 sys -- Mel