Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 29 Jan 2015 19:55:33 +0100
From:      Roland Smith <rsmith@xs4all.nl>
To:        Robert Simmons <rsimmons0@gmail.com>
Cc:        freebsd-python@freebsd.org
Subject:   Re: Unicode Problem
Message-ID:  <20150129185533.GA38445@slackbox.erewhon.home>
In-Reply-To: <CA%2BQLa9BszmgY-ca-z3L7C9NhDo0GCzwHsuGUwMJ5Xr6jo8p_1Q@mail.gmail.com>
References:  <CA%2BQLa9DnD4mYSBc3B_SUZ6shH64v8Lej_CMMVaK=s%2BPrQp5Gvw@mail.gmail.com> <20150129072908.GA37127@slackbox.erewhon.home> <CA%2BQLa9BszmgY-ca-z3L7C9NhDo0GCzwHsuGUwMJ5Xr6jo8p_1Q@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help

--KsGdsel6WgEHnImy
Content-Type: text/plain; charset=utf-8
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Thu, Jan 29, 2015 at 02:42:31AM -0500, Robert Simmons wrote:
> On Thu, Jan 29, 2015 at 2:29 AM, Roland Smith <rsmith@xs4all.nl> wrote:
> > On Thu, Jan 29, 2015 at 01:38:21AM -0500, Robert Simmons wrote:
> >> I'm having a unicode problem on FreeBSD lang/python34 that does not
> >> appear on MacOS X. I've condensed the problem to one single line to
> >> enter in the interpreter:
> >>
> >> FreeBSD:
> >> Python 3.4.2 (default, Jan 28 2015, 22:23:57)
> >> [GCC 4.2.1 Compatible FreeBSD Clang 3.4.1 (tags/RELEASE_34/dot1-final
> >> 208032)] on freebsd10
> >> Type "help", "copyright", "credits" or "license" for more information.
> >> >>> b'\xc3\xa2'.decode('utf-8')
> >> '\xe2'
> >>
> >> MacOS X:
> >> Python 3.4.2 (default, Oct 19 2014, 17:55:38)
> >> [GCC 4.2.1 Compatible Apple LLVM 6.0 (clang-600.0.54)] on darwin
> >> Type "help", "copyright", "credits" or "license" for more information.
> >> >>> b'\xc3\xa2'.decode('utf-8')
> >> '=C3=A2'
> >>
> >> Why is Python on FreeBSD incorrectly decoding this?
> >
> > Works fine here (FreeBSD 10.1-STABLE #0 r276653 amd64):
> >
> >     Python 3.4.2 (default, Nov  4 2014, 19:34:48)
> >     [GCC 4.2.1 Compatible FreeBSD Clang 3.4.1 (tags/RELEASE_34/dot1-fin=
al 208032)] on freebsd10
> >     Type "help", "copyright", "credits" or "license" for more informati=
on.
> >     >>> b'\xc3\xa2'.decode('utf-8')
> >     '=C3=A2'

(please don't top-post)

> What is the output from print(sys.stdout.encoding) on your system?

    Python 3.4.2 (default, Nov  4 2014, 19:34:48)=20
    [GCC 4.2.1 Compatible FreeBSD Clang 3.4.1 (tags/RELEASE_34/dot1-final 2=
08032)] on freebsd10
    Type "help", "copyright", "credits" or "license" for more information.
    >>> import sys
    >>> print(sys.stdout.encoding)
    UTF-8

> And, can you explain how to change that on mine so that it is UTF-8?
> Mine is a default fresh install, btw.

In /etc/login.conf, I set LC_ALL=3Den_US.UTF-8;

    default:\
            :passwd_format=3Dsha512:\
            :copyright=3D/etc/COPYRIGHT:\
            :welcome=3D/etc/motd:\
            :setenv=3DMAIL=3D/var/mail/$,BLOCKSIZE=3DK,LC_ALL=3Den_US.UTF-8=
:\
            :path=3D/sbin /bin /usr/sbin /usr/bin /usr/games /usr/local/sbi=
n /usr/local/bin

And I use a unicode aware X terminal (rxvt-unicode).

In case you're not using X11, the new vt(4) device uses UTF-8, but the old
sc(4) doesn't support it at all, AFAIK.

Roland
--=20
R.F.Smith                                   http://rsmith.home.xs4all.nl/
[plain text _non-HTML_ PGP/GnuPG encrypted/signed email much appreciated]
pgp: 5753 3324 1661 B0FE 8D93  FCED 40F6 D5DC A38A 33E0 (keyID: A38A33E0)

--KsGdsel6WgEHnImy
Content-Type: application/pgp-signature

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQIcBAABAgAGBQJUyoIlAAoJEED21dyjijPgeKkQALSYLDcpUFyzf2xMIDAMtpoN
2U7sZHEf+Afj/nMbOoj90fk2IjirziXB8LRUdnJgPHrvARecqsY/bi0rgrdv5jjt
W0/fYjuu17qKgSEGpijA9tqLfDyi1wAY91SmJlOhFiogJThiEZ/lFZRHRR4uhGo6
IcO2chbTO8ppV3Ch+mj7tC4MJofeUsdJaDDyhp8KIsakR/F5SEQ4kilwxAPMqllP
f2UrwJXLnSYvu8E4Ap0sNBz5k3K8DspNjK1HjzDa3twTmAwCJGdxVHjVlPQFEiqr
8Ps+oZX/3M5WMcP6yOWXun/fh3zDDWHTtC81B8oh9HVFCyqJTv/Q1nbANQCkgmA9
lpLhRrsVzWX87V0J8i8Hzf7jROVlT2PBuCxunWEXwvLyDU5ySterLz7jVFibwg0k
bLuVCfwXmgYyGLzV5iu2ldZxAPQUEv07Ef3vH3xuIDyWVS+TnFomdWgpAhdA8vio
9ZMqYD8cq4N6MRsVpDtyDw3EOONTh7sN2+8u0ztMPZfJf4cPKkDj+szC3WJexZT1
OCwFwS7pquG5wMzMIjs0F/0+logOqnWJCWFkeO+5NE3rDGpG08dTngtFMuTR6A6t
MDVTRtdyZ5f4M73UVRrh5LBmbypowC3aL7bqhsMfVutBv9VRj3bc7PSsR968i9Me
pRtmYwFy7QjzglUENTeC
=YSs+
-----END PGP SIGNATURE-----

--KsGdsel6WgEHnImy--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20150129185533.GA38445>