From owner-freebsd-questions@FreeBSD.ORG Mon Feb 16 21:37:04 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5703F106564A for ; Mon, 16 Feb 2009 21:37:04 +0000 (UTC) (envelope-from dleal@webvolution.net) Received: from front4.netvisao.pt (front4.netvisao.pt [213.228.128.37]) by mx1.freebsd.org (Postfix) with SMTP id CCFD08FC08 for ; Mon, 16 Feb 2009 21:37:02 +0000 (UTC) (envelope-from dleal@webvolution.net) Received: (qmail 4929 invoked from network); 16 Feb 2009 21:36:50 -0000 Received: from av-front1.netvisao.pt (213.228.128.152) by front4.netvisao.pt with SMTP; 16 Feb 2009 21:36:50 -0000 Received: (qmail 9754 invoked from network); 16 Feb 2009 21:37:53 -0000 Received: from ar-217-129-86-43.netvisao.pt (HELO [192.168.1.200]) (dleal@[217.129.86.43]) (envelope-sender ) by av-front1.netvisao.pt (qmail-ldap-1.03) with SMTP for ; 16 Feb 2009 21:37:53 -0000 Message-ID: <4999DCD2.3020807@webvolution.net> Date: Mon, 16 Feb 2009 21:38:26 +0000 From: Daniel Leal User-Agent: Thunderbird 2.0.0.18 (X11/20090102) MIME-Version: 1.0 To: =?UTF-8?B?TWloYWkgRG9uyJt1?= References: <499498A4.4000103@webvolution.net> <20090212235015.U97916@wojtek.tensor.gdynia.pl> <200902162205.17644.mihai.dontu@gmail.com> In-Reply-To: <200902162205.17644.mihai.dontu@gmail.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit X-TM-AS-Product-Ver: IMSS-7.0.0.3216-5.5.0.1026-16468.002 X-TM-AS-Result: No--8.362-5.0-31-1 X-imss-scan-details: No--8.362-5.0-31-1 Cc: Wojciech Puchar , freebsd-questions@freebsd.org Subject: Re: accents in file names X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 16 Feb 2009 21:37:04 -0000 Yes, that's right. I copied the files from win4bsd system. Mihai Donțu wrote: > On Friday 13 February 2009, Chuck Swiger wrote: > >> On Feb 12, 2009, at 2:50 PM, Wojciech Puchar wrote: >> >>>>> accented letter to my freebsd box, the accented letter simply >>>>> disappear. >>>>> >>>> UFS supports 8-bit characters except for "/" and "\0", but you also >>>> need to run a terminal with UTF8 support and use a correct font to >>>> view such things. >>>> >>> why? i use ISO-8859-2 >>> >> You've answered "why" when you state that you set up a locale which >> supports ISO Latin-X charset. If you are running in the default C/ >> POSIX locale, using the US-ASCII character set and a font that only >> knows about 7-bit ASCII glyphs, then you won't get accented characters. >> >> >>> UFS doesn't deal with encoding at all, just store what you give >>> >> That's right, which means you need to use filenames encoded in UTF8 >> rather than in arbitrary Unicode. >> > > UTF-8 is what we prefer these days, but the filesystem can handle anything > that is ASCII compatible (like you said: Shift_JIS, EUC-JP etc.). > > Now, I assume Daniel was copying "filé.txt" from a non-UFS (Windows box, > FAT32, NTFS etc) filesystem to UFS, because this is the only case I can think > of and in which such a problem might appear. > > >> People in Asia tend to want UTF-16 >> or UTF-32 encoding (although historical encodings like Big5, Shift- >> JIS, and now GB18030 for China are still rather popular, and those are >> multibyte encodings), and things like gcc's implementation of >> widechars or Python are standardizing on UTF-32. >> > >