Date: Thu, 29 Jan 2015 17:03:21 +0100 From: Rainer Hurling <rhurlin@gwdg.de> To: Konstantin Belousov <kostikbel@gmail.com> Cc: Kubilay Kocak <koobs@FreeBSD.org>, Robert Simmons <rsimmons0@gmail.com>, freebsd-python@freebsd.org Subject: Re: Unicode Problem Message-ID: <54CA59C9.4090002@gwdg.de> In-Reply-To: <20150129095328.GQ42409@kib.kiev.ua> References: <CA%2BQLa9DnD4mYSBc3B_SUZ6shH64v8Lej_CMMVaK=s%2BPrQp5Gvw@mail.gmail.com> <CA%2BQLa9Amvu%2B1hg4Oo5J_CjoJNF2Ahs6LAkPrOAEncNU_iWXEtQ@mail.gmail.com> <54C9FE33.2070307@FreeBSD.org> <20150129095328.GQ42409@kib.kiev.ua>
index | next in thread | previous in thread | raw e-mail
Am 29.01.2015 um 10:53 schrieb Konstantin Belousov:
> On Thu, Jan 29, 2015 at 08:32:35PM +1100, Kubilay Kocak wrote:
>> On 29/01/2015 6:13 PM, Robert Simmons wrote:
>>> On further inspection I've found the following:
>>>
>>> FreeBSD
>>>>>> import sys
>>>>>> print(sys.getdefaultencoding())
>>> utf-8
>>>>>> print(sys.stdout.encoding)
>>> US-ASCII
>>>
>>> MacOS X:
>>>>>> import sys
>>>>>> print(sys.getdefaultencoding())
>>> utf-8
>>>>>> print(sys.stdout.encoding)
>>> UTF-8
>>>
>>> How do I modify stdout encoding to set it to UTF-8 in FreeBSD?
>>
>> Another data point from my 9-STABLE:
>>
>> Python 3.4.2 (default, Nov 3 2014, 13:38:18)
>> [GCC 4.2.1 Compatible FreeBSD Clang 3.4.1 (tags/RELEASE_34/dot1-final
>> 208032)] on freebsd9
>> Type "help", "copyright", "credits" or "license" for more information.
>>>>> b'\xc3\xa2'.decode('utf-8')
>> '??'
>>>>> import sys
>>>>> print(sys.getdefaultencoding())
>> utf-8
>>>>> print(sys.stdout.encoding)
>> UTF-8
>>>>>
>>
>> Python 2.7.9 (default, Jan 24 2015, 20:39:40)
>> [GCC 4.2.1 Compatible FreeBSD Clang 3.4.1 (tags/RELEASE_34/dot1-final
>> 208032)] on freebsd9
>> Type "help", "copyright", "credits" or "license" for more information.
>>>>> b'\xc3\xa2'.decode('utf-8')
>> u'\xe2'
>>>>> import sys
>>>>> print(sys.getdefaultencoding())
>> ascii
>>>>> print(sys.stdout.encoding)
>> UTF-8
>>>>>
On my box, with recent HEAD amd64, it is like for Koobs:
#locale
LANG=de_DE.UTF-8
LC_CTYPE="de_DE.UTF-8"
LC_COLLATE=C
LC_TIME="de_DE.UTF-8"
LC_NUMERIC="de_DE.UTF-8"
LC_MONETARY="de_DE.UTF-8"
LC_MESSAGES="de_DE.UTF-8"
LC_ALL=
#python3
Python 3.4.2 (default, Jan 11 2015, 07:51:41)
[GCC 4.2.1 Compatible FreeBSD Clang 3.5.0 (tags/RELEASE_350/final
216957)] on freebsd11
Type "help", "copyright", "credits" or "license" for more information.
### b'\xc3\xa2'.decode('utf-8')
'â'
[For python 27, option UCS4 enabled]
#python
Python 2.7.9 (default, Jan 24 2015, 10:35:50)
[GCC 4.2.1 Compatible FreeBSD Clang 3.5.1 (tags/RELEASE_351/final
225668)] on freebsd11
Type "help", "copyright", "credits" or "license" for more information.
### b'\xc3\xa2'.decode('utf-8')
u'\xe2'
So, obviously there is a difference between the python versions,
independently from the locale settings?
help
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?54CA59C9.4090002>
