Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 16 Nov 2015 22:06:59 +0100
From:      Baptiste Daroussin <bapt@freebsd.org>
To:        Andrey Chernov <ache@freebsd.org>
Cc:        Ed Schouten <ed@nuxi.nl>, arch@freebsd.org, marino@freebsd.org
Subject:   Re: Question about ASCII and nl_langinfo (locale work)
Message-ID:  <20151116210659.GB59189@ivaldir.etoilebsd.net>
In-Reply-To: <564A27CD.7090908@freebsd.org>
References:  <20151110222636.GN10134@ivaldir.etoilebsd.net> <CABh_MKkAJmfTrT5qMwvcOcFAviD9h8okOnsH7PJ2x7gxFvY5Yw@mail.gmail.com> <564A27CD.7090908@freebsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help

--1LKvkjL3sHcu1TtY
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Mon, Nov 16, 2015 at 10:00:29PM +0300, Andrey Chernov wrote:
> On 16.11.2015 20:35, Ed Schouten wrote:
> > I personally think it's a shame if we were to deviate from returning
> > "US-ASCII", for the reason that "US-ASCII" also happens to be the
> > preferred MIME name for the character set:
> >=20
> > http://www.iana.org/assignments/character-sets/character-sets.xhtml
> >=20
> > "ASCII" doesn't even seem to be an alias for this character set.
>=20
> Yes, I overlook it somehow. ASCII is not in the IANA, while both
> ANSI_X3.4-1968 and US-ASCII are.
>=20
> So, I reconsider the proposal. We can return ANSI_X3.4-1968 for POSIX/C
> (for Linux compatibility reasons) and left pure US-ASCII as it was
> (since it is used rarely).

To tell the truth, the locale change I made were painful enough (mostly my
fault)and I (for now) won't do anywork further beside fixing the fallouts i=
f any
are left. But I do support this proposal!
>=20
> > In my opinion a decent implementation of newlocale() should support
> > any of the character set names and aliases provided on the IANA page,
> > but let nl_langinfo(CODESET) return the preferred MIME name.
>=20
> BTW, we already have and return non-IANA codesets historically (inspired
> by X11). I.e. we have ISO8859-* instead of preferred names ISO-8859-*,
> moreover, ISO8859-* even not the aliases (!) and IANA knows nothing
> about them. Linux have IANA preferred names here, i.e. ISO-8859-*.
>=20
> So the question is: should we rename ISO8859-* to ISO-8859-* to be IANA
> and Linux compatible?
>=20
> We can strip first (or all) "_" and "-" from the environment names (as
> Linux does), to not violate POLA.

I would like to see that as well, lots of new comers I have seen setup the
locales the IANA way and are unhappy because that does not work. The first =
plan
in the collation branch was to introduce the IANA syntax via an alias but i=
n the
end I removed it, because there was already to many changes.

If one want to go further on the locale changes like the above proposal ple=
ase
proceed.

Best regards,
Bapt

--1LKvkjL3sHcu1TtY
Content-Type: application/pgp-signature; name="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1

iEYEARECAAYFAlZKRXMACgkQ8kTtMUmk6EyWtwCeIVKFFZPT1lXelA5fxjt1AWx2
x0MAoIgm9AqE/5zXGaAWma2msbykkL0j
=ar0A
-----END PGP SIGNATURE-----

--1LKvkjL3sHcu1TtY--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20151116210659.GB59189>