Date: Wed, 12 Mar 2014 10:31:43 +0100 From: Gerhard Schmidt <estartu@ze.tum.de> To: stable@freebsd.org Subject: UTF-8 Sorting Message-ID: <5320297F.1080400@ze.tum.de>
next in thread | raw e-mail | index | archive | help
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi, I've a problem with FreeBSD, UTF-8 and Sorting. e.g. there is a file with the following content Meier Müller Öger Ofner Schmidt I have set my Terminal to ISO-8859-1 Encoding and call sort on this file I get the following output. Meier Müller Ofner Öger Schmidt Which is correctly sorted. When i change my Terminal to UTF-8 encoding and convert the file to UTF-8 and call sort again I get the following output. Meier Müller Ofner Schmidt Öger which is wrong. The problem seams to be that the LC_COLLATE file in the de_DE.UTF-8 locale is linked to ../la_LN.US-ASCII/LC_COLLATE (as are all LC_COLLATE Files in any UTF-8 locale). After some Research i found a Mail from Kuba Lida in December 2008 (yeah that's 5 Years ago) stating the same Problem and got no response. Why isn't there a UTF-8 LC_COLLATE file for any language. Kuba Lida believed there was a Problem with multibyte collate files in FreeBSD. Is this true and are there plans to fix this problem. The same test under Linux works without problem. Regards Estartu - -- - --------------------------------------------------------------------------- Gerhard Schmidt | http://www.augusta.de/~estartu | Fischbachweg 3 | | PGP Public Key 86856 Hiltenfingen | JabberID: estartu@augusta.de | on request Germany | | -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ iQCVAwUBUyApfwzx22nOTJQRAQJIbgP+MMSPepEsyG8Kx+QRDGJlfyQKK+r98/e+ ZiNPRMNjBpT7qrElJLvYfAuix3pOyqL3mq1DQJvZmqQxfoxEdy6GUf42i1Yk5gEX T05YtaeVRoXK/TetFt0UEcC3bXuXheu63aBpO4FU2v8CPTAyBwU6DUvV/v3AzXr6 j+mwws5n7so= =J2tH -----END PGP SIGNATURE-----
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?5320297F.1080400>