From owner-freebsd-current Wed Apr 14 11:57:42 1999 Delivered-To: freebsd-current@freebsd.org Received: from work.mzaki.nom (173.pool15.tokyo.att.ne.jp [165.76.220.188]) by hub.freebsd.org (Postfix) with ESMTP id 9565214D2F; Wed, 14 Apr 1999 11:57:28 -0700 (PDT) (envelope-from mzaki@e-mail.ne.jp) Received: from localhost (localhost [127.0.0.1]) by work.mzaki.nom (8.9.2/8.9.2) with ESMTP id DAA05453; Thu, 15 Apr 1999 03:54:47 +0900 (JST) (envelope-from mzaki@e-mail.ne.jp) To: dcs@newsguy.com Cc: logix@foobar.franken.de, freebsd-bugs@FreeBSD.ORG, freebsd-current@FreeBSD.ORG Subject: Re: kern/5038: FreeBSD can't read MS Joliet CDs. In-Reply-To: <3714D32A.DF778838@newsguy.com> References: <199904140620.XAA24224@freefall.freebsd.org> <37149716.582342ED@newsguy.com> <19990414164548.A14876@foobar.franken.de> <19990415021736I.mzaki@e-mail.ne.jp> <3714D32A.DF778838@newsguy.com> X-Mailer: Mew version 1.94b7 on XEmacs 21.0 (20 minutes to Nikko) X-fingerprint: BD 86 CD D9 66 FD 75 B0 A8 43 4B C4 E2 1B F5 B5 Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-Id: <19990415035446A.mzaki@e-mail.ne.jp> Date: Thu, 15 Apr 1999 03:54:46 +0900 From: Motomichi Matsuzaki X-Dispatcher: imput version 990212(IM106) Lines: 49 Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG From: "Daniel C. Sobral" Subject: Re: kern/5038: FreeBSD can't read MS Joliet CDs. Date: Thu, 15 Apr 1999 02:40:58 +0900 dcs> > 2. Only Page 00 Unicode is shown (for Joliet CDs). dcs> ... dcs> > Note. dcs> > Byung's patch passes Unicode transparently. dcs> > So CJK filenames are shown if terminals do Unicodes well. dcs> dcs> Do you mean that your patch shows only page 00, but Byung's patch dcs> passes Unicode directly? My patch: * character in page 00 is converted to ASCII(+ISO-8859-1) * character in other pages is replaced by '?' Byung's patch: * character in page 00 is converted to ASCII(+ISO-8859-1) * character in other pages is passed as is For example, suppose to be a file on Joliet CD, which filename is encoded as [ 0041 0042 0043 2525 3030 3535 ] in Unicode. The kernel applied my patch shows it to userland via readdir(2) as (41)(42)(43)(3F)(3F)(3F) ... "abc???" The kernel applied Byung's patch shows it as (41)(42)(43)(25)(25)(30)(30)(35)(35) ... "abc%%0055" Byung reported, > With this patch, I was able to see long file names and > non-english characters(such as korean&chinese) so i believe it > does support unicode as long as your locale is set correctly. But with my locale 'ja_JP.EUC' (which is the major in Japan on *BSD box), these strings make no sense. To see filenames which contain Japanese characters, kernel should convert it to EUC(JP). Another possible answer is encoding to UTF-8, however, I don't know UTF-8 terminal compliant with Japanese. -- Motomichi Matsuzaki Dept. of Biological Science, Fuculty of Sciences, Univ. of Tokyo, Japan To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message