From owner-freebsd-questions@FreeBSD.ORG Thu Feb 12 23:18:09 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id BA09F1065673 for ; Thu, 12 Feb 2009 23:18:09 +0000 (UTC) (envelope-from cswiger@mac.com) Received: from asmtpout014.mac.com (asmtpout014.mac.com [17.148.16.89]) by mx1.freebsd.org (Postfix) with ESMTP id A4A248FC08 for ; Thu, 12 Feb 2009 23:18:09 +0000 (UTC) (envelope-from cswiger@mac.com) MIME-version: 1.0 Content-transfer-encoding: 7BIT Content-type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Received: from cswiger1.apple.com ([17.227.140.124]) by asmtp014.mac.com (Sun Java(tm) System Messaging Server 6.3-7.03 (built Aug 7 2008; 32bit)) with ESMTPSA id <0KEZ003DH7DN8H70@asmtp014.mac.com> for freebsd-questions@freebsd.org; Thu, 12 Feb 2009 15:17:47 -0800 (PST) Message-id: From: Chuck Swiger To: Wojciech Puchar In-reply-to: <20090212235015.U97916@wojtek.tensor.gdynia.pl> Date: Thu, 12 Feb 2009 15:17:47 -0800 References: <499498A4.4000103@webvolution.net> <9BDCF06E-713F-4627-B8E6-8D15D5F3F120@mac.com> <20090212235015.U97916@wojtek.tensor.gdynia.pl> X-Mailer: Apple Mail (2.930.3) Cc: Daniel Leal , freebsd-questions@freebsd.org Subject: Re: accents in file names X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 12 Feb 2009 23:18:10 -0000 On Feb 12, 2009, at 2:50 PM, Wojciech Puchar wrote: >>> accented letter to my freebsd box, the accented letter simply >>> disappear. >> >> UFS supports 8-bit characters except for "/" and "\0", but you also >> need to run a terminal with UTF8 support and use a correct font to >> view such things. > > why? i use ISO-8859-2 You've answered "why" when you state that you set up a locale which supports ISO Latin-X charset. If you are running in the default C/ POSIX locale, using the US-ASCII character set and a font that only knows about 7-bit ASCII glyphs, then you won't get accented characters. > UFS doesn't deal with encoding at all, just store what you give That's right, which means you need to use filenames encoded in UTF8 rather than in arbitrary Unicode. People in Asia tend to want UTF-16 or UTF-32 encoding (although historical encodings like Big5, Shift- JIS, and now GB18030 for China are still rather popular, and those are multibyte encodings), and things like gcc's implementation of widechars or Python are standardizing on UTF-32. Regards, -- -Chuck