From owner-freebsd-chat Thu Mar 29 21:32:43 2001 Delivered-To: freebsd-chat@freebsd.org Received: from mail.inka.de (quechua.inka.de [212.227.14.2]) by hub.freebsd.org (Postfix) with ESMTP id 3A1F737B71D for ; Thu, 29 Mar 2001 21:32:37 -0800 (PST) (envelope-from daemon@mips.inka.de) Received: from kemoauc.mips.inka.de (uucp@) by mail.inka.de with local-bsmtp id 14irWF-0001Xy-00; Fri, 30 Mar 2001 07:32:19 +0200 Received: (from daemon@localhost) by kemoauc.mips.inka.de (8.11.3/8.11.1) id f2TM6tX82494 for freebsd-chat@freebsd.org; Fri, 30 Mar 2001 00:06:55 +0200 (CEST) (envelope-from daemon) From: naddy@mips.inka.de (Christian Weisgerber) Subject: Re: Unicode, 8-bit cleanliness, etc. Date: Thu, 29 Mar 2001 22:06:55 +0000 (UTC) Message-ID: <9a0blv$2ghl$1@kemoauc.mips.inka.de> References: Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit Originator: naddy@mips.inka.de (Christian Weisgerber) To: freebsd-chat@freebsd.org Sender: owner-freebsd-chat@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org Rich Morin wrote: > I recently started playing with Mac OS X, which allows Unicode (UTF-8, > AFAIK) in its path names. Because I'm also using my trusty FreeBSD box, > I'm wondering if there's any reason to worry about compatibility. So, > is FreeBSD totally 8-bit clean or are there some tarpits I should avoid? 8-bit cleanness is a given, considering that Europeans have used various single-byte 8-bit character sets (ISO 8859, KOI, etc) on FreeBSD for many years. The filesystem of course doesn't care, apart from '/' (directory separator) and '\0' (string terminator) it doesn't assign any meaning to particular characters. UTF-8's multi-byte nature *will* produce many cosmetic problems (e.g. ls(1) not aligning columns properly) throughout the system. More work on this is needed. Note that there's a freebsd-i18n list that deals with such issues. -- Christian "naddy" Weisgerber naddy@mips.inka.de To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-chat" in the body of the message