From owner-freebsd-hackers@FreeBSD.ORG Wed Jun 18 09:39:14 2008 Return-Path: Delivered-To: hackers@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4FC19106568A; Wed, 18 Jun 2008 09:39:14 +0000 (UTC) (envelope-from des@des.no) Received: from tim.des.no (tim.des.no [194.63.250.121]) by mx1.freebsd.org (Postfix) with ESMTP id F2AD18FC22; Wed, 18 Jun 2008 09:39:13 +0000 (UTC) (envelope-from des@des.no) Received: from ds4.des.no (des.no [84.49.246.2]) by smtp.des.no (Postfix) with ESMTP id 4C9292087; Wed, 18 Jun 2008 11:39:11 +0200 (CEST) From: =?utf-8?Q?Dag-Erling_Sm=C3=B8rgrav?= To: Andrey Chernov References: <20080617002224.GA16122@nagual.pp.ru> <20080617002808.GB16122@nagual.pp.ru> <20080617004647.GA16546@nagual.pp.ru> <48576610.9080808@FreeBSD.org> <48577510.4020007@aueb.gr> <48577BD2.4070205@bluemedia.pl> <20080617102900.GA46479@nagual.pp.ru> <485798C4.2050605@FreeBSD.org> <20080618055851.GA85018@nagual.pp.ru> <86zlpjduew.fsf@ds4.des.no> <20080618083739.GA87100@nagual.pp.ru> Date: Wed, 18 Jun 2008 11:39:10 +0200 In-Reply-To: <20080618083739.GA87100@nagual.pp.ru> (Andrey Chernov's message of "Wed\, 18 Jun 2008 12\:37\:39 +0400") Message-ID: <867icndqv5.fsf@ds4.des.no> User-Agent: Gnus/5.110006 (No Gnus v0.6) Emacs/23.0.60 (berkeley-unix) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Mailman-Approved-At: Wed, 18 Jun 2008 10:30:05 +0000 Cc: Doug Barton , current@FreeBSD.org, Konrad Jankowski , Diomidis Spinellis , hackers@FreeBSD.org, Gabor Kovesdan , Max Khon , "Sean C. Farley" , K?vesd?n G?bor Subject: Re: CFT: BSD-licensed grep [Fwd: cvs commit: ports/textproc/bsdgrep Makefile distinfo] X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 18 Jun 2008 09:39:14 -0000 Andrey Chernov writes: > Single byte locales collation works through strcoll() via chains, i.e.=20 > seek all chains starting with given letter. Multibyte locales collation=20 > currently is not implemented and can't be properly implemented under=20 > existen single byte framework (it will consume resourses badly in that=20 > case). I know semi-hacking attempts to implement multibyte collattion via= =20 > single byte one, but all they are only for small ASCII + national alphabe= t=20 > subset, rest of Unicode left unsorted. Does that mean our wcsxfrm() doesn't work? IIUC, it should convert wide strings to strings that can be compared directly with strcmp()? In any case, this is a libc issue, right? As long as sort / grep uses the API correctly, they will work fine once libc is fixed? DES --=20 Dag-Erling Sm=C3=B8rgrav - des@des.no