From owner-freebsd-current@freebsd.org Sun Nov 1 21:30:32 2015 Return-Path: Delivered-To: freebsd-current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 07CEAA2462B for ; Sun, 1 Nov 2015 21:30:32 +0000 (UTC) (envelope-from baptiste.daroussin@gmail.com) Received: from mailman.ysv.freebsd.org (mailman.ysv.freebsd.org [IPv6:2001:1900:2254:206a::50:5]) by mx1.freebsd.org (Postfix) with ESMTP id D6E3510BD for ; Sun, 1 Nov 2015 21:30:31 +0000 (UTC) (envelope-from baptiste.daroussin@gmail.com) Received: by mailman.ysv.freebsd.org (Postfix) id D45EDA2462A; Sun, 1 Nov 2015 21:30:31 +0000 (UTC) Delivered-To: current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id BA2D2A24628 for ; Sun, 1 Nov 2015 21:30:31 +0000 (UTC) (envelope-from baptiste.daroussin@gmail.com) Received: from mail-wm0-x233.google.com (mail-wm0-x233.google.com [IPv6:2a00:1450:400c:c09::233]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4FF8A10BC for ; Sun, 1 Nov 2015 21:30:31 +0000 (UTC) (envelope-from baptiste.daroussin@gmail.com) Received: by wmeg8 with SMTP id g8so45230858wme.0 for ; Sun, 01 Nov 2015 13:30:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=sender:date:from:to:subject:message-id:references:mime-version :content-type:content-disposition:in-reply-to:user-agent; bh=iuSYDx8wLnD/XJIWM17yBYBykuPN9b5LMSdrtiBF1P8=; b=xmGpgn70cv5hd8rB7Vc36BJAhncwe3gGp3KIS1Px/3peDJmYa0pbLk4oOccSHqlBrP fZvH1/yuhdarfciQIjuzrPNJiO+7R55XKS0PyET9Bvh33BaknvMy/MbuhMdK8LcXFbpX jh15B7DDA1cLEPxkJhQfW2Y/ZYiAJVDD+zXRMKXvvPrS7Y8AHSPUgA2ZMb9nSulB7BWi P8YpSvkX47Lx6/PmNivdLkGgUD3pBcgKQ+8VpNOlqQ3fr/0cQ+cy9cUmJCP+3vULjgc5 HSWtyJ6kpINku/VLctVBzBp3YZeWszOl9oAq16sIRDsCIuRVg1UC0BJlyrzbQbUxUDIv j3FQ== X-Received: by 10.28.12.11 with SMTP id 11mr9052193wmm.99.1446413428795; Sun, 01 Nov 2015 13:30:28 -0800 (PST) Received: from ivaldir.etoilebsd.net ([2001:41d0:8:db4c::1]) by smtp.gmail.com with ESMTPSA id r13sm14594198wmg.12.2015.11.01.13.30.27 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 01 Nov 2015 13:30:28 -0800 (PST) Sender: Baptiste Daroussin Date: Sun, 1 Nov 2015 22:30:26 +0100 From: Baptiste Daroussin To: current@FreeBSD.org Subject: Re: [CFT] Unicode collation string and reworked locale definitions Message-ID: <20151101213026.GC89277@ivaldir.etoilebsd.net> References: <20151013222306.GE55137@ivaldir.etoilebsd.net> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="ABTtc+pdwF7KHXCz" Content-Disposition: inline In-Reply-To: <20151013222306.GE55137@ivaldir.etoilebsd.net> User-Agent: Mutt/1.5.24 (2015-08-30) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 01 Nov 2015 21:30:32 -0000 --ABTtc+pdwF7KHXCz Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Wed, Oct 14, 2015 at 12:23:06AM +0200, Baptiste Daroussin wrote: > Hi all, >=20 > I have been working for a while on bringing in Unicode string collation > support by merging code from Illumos (by Garrett D'Amore who kindly made = sure > his work was made under BSD license) and Dragonfly (by John Marino), and = some > ancient work done on FreeBSD by edwin@ but never merged. >=20 > The result is available in the projects/collation branch. >=20 > As a result of this work, is: > - Locales are now generated with the new localedef(1) tool from CLDR POSI= X files > - The generated files are now identified as "BSD 1.0" format > - Only "BSD 1.0" locales files are now read, all other version will be se= t to > "C" > - The localedef(1) tool has been imported from Illumos and modidied to use > tree(3) instead of the CDDL avl(3) > - A set of tool created by edwin@ and extended by marino@ for dragonfly h= as been > added to be able to generate locales > - Given our regex(3) does not support multibyte yet (actually it does not > support some single-byte codeset) it has been forced to always use loca= le C > - Remove colldef(1) and mklocale(1) > - Finish implementing the numeric BSD extension for ctypes > - Add a bunch of new locales: some arabian locales, hebrew locales, some > regional locales, etc. > - Make a bunch of ISO-8859-1 locales simple aliase on ISO-8859-15 where i= t makes > sense > - Add short version of locales > - Add @euro aliases on the locales where that make sense >=20 > Please test the branch and report issues. >=20 > Note that yes that means the COLLATION_FIX patch on glib2 will not be nec= essary > anymore > and yes the icu patch on postgresql will not be necessary anymore >=20 > Best regards, > Bapt All issues reported has been fixed, except if more issues are reported, this will be merged into head next saturday: November 7th Bapt --ABTtc+pdwF7KHXCz Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 iEYEARECAAYFAlY2hHIACgkQ8kTtMUmk6Eze8wCfTGe0SUqfD4TU7mLAvKIFhZO7 FysAnjRGb2HikYg5KIvhXBHa6LJijwn0 =PPhG -----END PGP SIGNATURE----- --ABTtc+pdwF7KHXCz--