Date: Sun, 19 Jun 2016 21:40:51 +0200 From: Baptiste Daroussin <bapt@FreeBSD.org> To: Andrey Chernov <ache@freebsd.org> Cc: svn-src-head@freebsd.org, svn-src-all@freebsd.org, src-committers@freebsd.org, Hajimu UMEMOTO <ume@FreeBSD.org> Subject: Re: svn commit: r302016 - in head: share/timedef tools/tools/locale/tools Message-ID: <20160619194051.w4dueeutrgtjcfuf@ivaldir.etoilebsd.net> In-Reply-To: <1f579a9a-3aa4-e99b-f991-b42a1672a9b4@freebsd.org> References: <201606190344.u5J3iWO6079652@repo.freebsd.org> <3387744f-7abd-417c-599f-163e78b4170e@freebsd.org> <20160619145233.kjnfyxjb75fdcucz@ivaldir.etoilebsd.net> <20160619154918.etycha7kvj4v7nrp@ivaldir.etoilebsd.net> <e1c4e728-5abd-3618-1cc1-a63ac64cf63a@freebsd.org> <eea90a85-3731-6870-ba83-107e23337b2d@freebsd.org> <77076d87-41ad-0cce-c703-ed9cef1bdb8b@freebsd.org> <20160619171021.7ps2te7wgoxfqznv@ivaldir.etoilebsd.net> <a7182802-f25e-ae08-769b-15fcc786e224@freebsd.org> <1f579a9a-3aa4-e99b-f991-b42a1672a9b4@freebsd.org>
next in thread | previous in thread | raw e-mail | index | archive | help
--q2c376tbb6u7dqvn Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Sun, Jun 19, 2016 at 10:02:26PM +0300, Andrey Chernov wrote: > On 19.06.2016 20:21, Andrey Chernov wrote: > > On 19.06.2016 20:10, Baptiste Daroussin wrote: > >> On Sun, Jun 19, 2016 at 08:05:30PM +0300, Andrey Chernov wrote: > >>> On 19.06.2016 19:52, Andrey Chernov wrote: > >>>> On 19.06.2016 19:47, Andrey Chernov wrote: > >>>>> On 19.06.2016 18:49, Baptiste Daroussin wrote: > >>>>>> On Sun, Jun 19, 2016 at 04:52:34PM +0200, Baptiste Daroussin wrote: > >>>>>>> On Sun, Jun 19, 2016 at 07:57:49AM +0300, Andrey Chernov wrote: > >>>>>>>> On 19.06.2016 6:44, Hajimu UMEMOTO wrote: > >>>>>>>>> Log: > >>>>>>>>> - Prefer to use %d over %e where the day of the month should = be zero > >>>>>>>>> filled. > >>>>>>>>> - Since %e means the day of the month as well, regard %e as s= ame as %d > >>>>>>>>> in md_order. > >>>>>>>> > >>>>>>>> Nonsense doubled formats in sr_*_RS locales and nonsense md_orde= r there too. > >>>>>>> > >>>>>>> Crap I'll dig into it, thanks for spotting it. > >>>>>>> > >>>>>>> Best regards, > >>>>>>> Bapt > >>>>>> > >>>>>> Should be fixed by: > >>>>>> https://people.freebsd.org/~bapt/locale-triple-components.diff > >>>>>> > >>>>>> The perl script generating the locales was badly handling locales = with 3 > >>>>>> components, fixed now. > >>>>>> > >>>>>> Sorry about that > >>>>>> > >>>>>> Best regards, > >>>>>> Bapt > >>>>>> > >>>>> > >>>>> sr locales are badly named before anything else, proper format is > >>>>> language[_COUNTRY][.encoding][@variant] > >>>>> I.e. > >>>>> sr_RS.UTF-8@Latn > >>>>> sr_RS.ISO8859-2@Latn > >>>>> sr_RS.ISO8859-t@Cyrl > >>>>> > >>>>> I don't know, if out utilities (f.e. locale(1)) can handle @variant= suffix. > >>>>> > >>>>> > >>>> =3Dsr_RS.ISO8859-5@Cyrl > >>>> =3Dour, and not utilities only but libc too > >>> > >>> Since right now we have only single @variant per each encoding, and no > >>> @variant for other locales, we can just drop it and use right names l= ike: > >>> sr_RS.UTF-8 > >> > >> We have 2 for the above the Cyrl version and the Latn version > >> > >>> sr_RS.ISO8859-2 > >>> sr_RS.ISO8859-5 > >> > >> We can make aliases easily for that is that is what people want, not t= he we have > >> the same for other locales: > >> > >> zh_Hans_CN.GB18030 > >> zh_Hans_CN.GB2312 > >> zh_Hans_CN.GBK > >> zh_Hans_CN.UTF-8 > >> zh_Hans_CN.eucCN > >> zh_Hant_HK.UTF-8 > >> zh_Hant_TW.Big5 > >> zh_Hant_TW.UTF-8 > >> > >> With some aliases for some of the previously existing ones: > >> zh_TW.Big5 > >> zh_TW.UTF-8 > >> zh_CN.GB18030 > >> zh_CN.GB2312 > >> zh_CN.GBK > >> zh_CN.UTF-8 > >> zh_CN.eucCN > >> zh_HK.UTF-8 > >> > >> Some of the new locales (that didn't exist before) I have only importe= d the new > >> name format: > >> kk_Cyrl_KZ.UTF-8 > >> mn_Cyrl_MN.UTF-8 > >> sr_Cyrl_RS.ISO8859-5 > >> sr_Cyrl_RS.UTF-8 > >> sr_Latn_RS.ISO8859-2 > >> sr_Latn_RS.UTF-8 > >> > >> I would prefer staying on the new syntax given it is allowed by some R= FC and it > >> is slowly being adopted everywhere else. > >> > >> Btw another RFC: https://www.ietf.org/rfc/rfc5646.txt > >> > >> Best regards, > >> Bapt > >> > >=20 > > Old POSIX don't use RFC 5646, there are ISO-639 two-letter codes. When > > encodings are different, there is no needs to specify variants, they > > have meaning only for the same encoding. > >=20 > >=20 >=20 > I found POSIX reference about @-syntax: > [language[_territory][.codeset][@modifier]] > http://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap08.html > It says nothing about language, territory or codeset standards. >=20 > They comes from another standard, ISO 15897: > "8. Natural language, as specified in ISO 639 > 9. Territory, as two-letter form of ISO 3166" > I can't find there something about codeset/charmap standartization, but > perhaps I just overlook some reference. >=20 Thanks a lot for that reference I wonder how I failed at finding it, I read twice the locale definition in opengroup documents. I have found yet proof in the code, but from my testings (using nl_langinfo= ()) I can see that the @modifier is properly supported. I have 2 proposal then: 1/ Keep the current version and add aliases with "@" for the one in variants 2/ Rename all the variants we have now into the POSIX version with "@" What do you think? Best regards, Bapt --q2c376tbb6u7dqvn Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQIcBAEBCAAGBQJXZvUnAAoJEGOJi9zxtz5a2dcP/2+VFqGafUG6o7XnZ9rNSx+f DynRmTSbSgo0Uv/EEPOSkbTqvGg/lWgiL1+YxfX2aO++3rBQqLsS7McjBcUwYwPg cg1rkfZRhOPne3YT6YOlWs4nL3DWq5EUXiKX5R15mnQHxQQgkfF7aa5lhW746sgR K7ieDx6e1Jvt8KlZhvbgBpLiLYeaMbyxWqlyFNT5tEE2CYgwT9bhkRY7N6XkQ+le 74dU+UXW0gF1rAn5tDG9JwJvQ+cLH5x3cmf5gVaDiLPgOIOj3pZ/MtGrasWZDkua llqgVpGnQDBtgH7XLPhpIF/lH0227Di8e7OvYYFSRy5jQ3BCb6d9wOJEZJoPLi/K Stm5ejgLZfZu4l9u/pgAaCerjVv/iImHlHVXRL6IJwo4R+rMNxlkV1AmYPNrezZx f1EkRtDjRc7xLk7STg31GSMNoOgPgTl2lsZ85N/o3ttjSFjh2vFfMjwz8HEjqRL0 tf9weDPukIonqNBuQ5pCcLBp78I62kIxPg3+V+2oCdHr9zcbRKCieFocVLS7j1kt gqvDWxJWSbOKSfqJhXU/qJ1Aarp7CT82fIh7C1JBiCFd9FZA4qeQ5vKuytDkHNJw WGyikuET06nVcfcL2aopiW8Ykk0yu8jPxUkvpyUbfJSwmssO4JUSEmJrRmOGSXIe i8jb2c1QHy9RlWGsGt77 =X0qN -----END PGP SIGNATURE----- --q2c376tbb6u7dqvn--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20160619194051.w4dueeutrgtjcfuf>