From owner-freebsd-questions@FreeBSD.ORG Wed Nov 9 01:51:44 2011 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 13C3C106564A for ; Wed, 9 Nov 2011 01:51:44 +0000 (UTC) (envelope-from gmx@ross.cx) Received: from www81.your-server.de (www81.your-server.de [213.133.104.81]) by mx1.freebsd.org (Postfix) with ESMTP id C7D9D8FC0C for ; Wed, 9 Nov 2011 01:51:43 +0000 (UTC) Received: from [92.76.90.134] (helo=michael-think) by www81.your-server.de with esmtpsa (TLSv1:AES256-SHA:256) (Exim 4.72) (envelope-from ) id 1RNxKH-0005no-6e; Wed, 09 Nov 2011 02:51:41 +0100 Content-Type: text/plain; charset=iso-8859-15; format=flowed; delsp=yes To: freebsd-questions@freebsd.org, "Conrad J. Sabatier" References: <20111108184236.3a78ebf6@cox.net> Date: Wed, 09 Nov 2011 02:51:31 +0100 MIME-Version: 1.0 Content-Transfer-Encoding: Quoted-Printable From: "Michael Ross" Message-ID: In-Reply-To: <20111108184236.3a78ebf6@cox.net> User-Agent: Opera Mail/11.52 (Win32) X-Authenticated-Sender: gmx@ross.cx X-Virus-Scanned: Clear (ClamAV 0.97.3/13906/Tue Nov 8 17:25:05 2011) Cc: Subject: Re: "Unprintable" 8-bit characters X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 09 Nov 2011 01:51:44 -0000 Am 09.11.2011, 01:42 Uhr, schrieb Conrad J. Sabatier : > Pardon me if this may seem like a stupid question, but this is > something that's been bugging me for a long time, and none of my > research has turned up anything useful yet. > > I've been trying to understand what the deal is with regards to the > displaying of the "extended" 8-bit character set, i.e., 8-bit characte= rs > with the MSB set. > > More specifically, I'm trying to figure out how to get the "ls" comman= d > to properly display filenames containing characters in this extended > set. I have some MP3 files, for instance, whose names contain certain= > European characters, such as the lowercase "u" with umlaut (code 0xfc > in the Latin set, according to gucharmap), that I just can't get ls to= > display properly. These characters seem to be considered by ls as > "unprintable", and the best I've been able to produce in the ls > output is backslash interpretations of the characters using either the= > -B or -b options, otherwise the default "?" is displayed in their plac= e. Unsure if I understand you correctly. ("extended" 8-bit character set with MSB? utf-16?) I'm confused by this charset stuff in general. Assuming you want \0xfc displayed as "=FC", > cat test.py && python test.py && ls -l #!/usr/local/bin/python # -*- coding: utf-8 -*- f=3Dopen('\xfc','w') f.close() total 2 -rw-r--r-- 1 michael wheel 29 9 Nov 02:43 test.py -rw-r--r-- 1 michael wheel 0 9 Nov 02:44 =FC here is what works for me: in my login class in /etc/login.conf: :charset=3DISO-8859-1:\ :lang=3Dde_DE.ISO8859-1:\ ``cap_mkdb /etc/login.conf'' after changes in /etc/rc.conf: scrnmap=3D"iso-8859-1_to_cp437" font8x8=3D"cp850-8x8" font8x14=3D"cp850-8x14" font8x16=3D"cp850-8x16" and in /etc/ttys, console type is set to ``cons25l1'' Regards, Michael