From owner-freebsd-hackers Wed Jun 10 19:51:53 1998 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id TAA27662 for freebsd-hackers-outgoing; Wed, 10 Jun 1998 19:51:53 -0700 (PDT) (envelope-from owner-freebsd-hackers@FreeBSD.ORG) Received: from coconut.itojun.org (root@coconut.itojun.org [210.160.95.97]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id TAA27573 for ; Wed, 10 Jun 1998 19:51:28 -0700 (PDT) (envelope-from itojun@itojun.org) Received: from localhost (itojun@localhost.itojun.org [127.0.0.1]) by coconut.itojun.org (8.8.8+3.0Wbeta12/3.6W) with ESMTP id LAA07853; Thu, 11 Jun 1998 11:50:08 +0900 (JST) To: Gary Kline cc: tlambert@primenet.com, hackers@FreeBSD.ORG In-reply-to: kline's message of Wed, 10 Jun 1998 19:31:42 MST. <199806110231.TAA09494@tao.thought.org> X-Template-Reply-To: itojun@itojun.org X-Template-Return-Receipt-To: itojun@itojun.org X-PGP-Fingerprint: F8 24 B4 2C 8C 98 57 FD 90 5F B4 60 79 54 16 E2 Subject: Re: internationalization From: Jun-ichiro itojun Itoh Date: Thu, 11 Jun 1998 11:50:08 +0900 Message-ID: <7849.897533408@coconut.itojun.org> Sender: owner-freebsd-hackers@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG >> I've been working on iso-2022 encoding support for runelocale (xpg4) >> library. At this moment I'm working on some specific packages >> (for example, nvi or scheduler software called "sch") but will be >> able to merge the modification into xpg4 library part. > Wonderful! With the broadly international reach of > FreeBSD I was hoping that someone in China|Japan|Taiwan > would be into this. There may be a broader need for > wide character support--say Sanskrit and Thai. ... No problem under my framework, if proper multilingual renderer such as modified xterm (sanskritterm, or thaiter?) is there. I'm only working on internal string representation of multibyte, and multilingual string. For X11 modification for total multilingualization, people at Waseda Univ (japan). is doing a big project. I have never seen their code (I dunno if it is redistributed or not) but the screenshot is simply STUNNING. http://www.mling.waseda.ac.jp/ (text is in Japanese but you can view the screenshot) >> > This was my first leaning, but I'm increasingly >> > going toward the ISO families. >> Yes, iso-2022 families are quite important for supporting >> asian languages. Unicode is, for us Japanese, quite incomplete and >> unexpandable. > Is there a way of explaining (briefly :) how the > iso-2022 character set is displayed? This point > came up the other day and I guessed that it was > done by a ((large)) table-lookup under X. I don't see what you exactly mean, could you please explain? If you are talking about font bitmap, bitmap font for asian characters are, of course, very large. This is because we have multiple (3 for Japanese, almost 10 for mainland chinese) 96x96 character sets. (/usr/X11/lib/X11/fonts/misc may have some asian bitmap font, if you have installed those) There are several outline (vector) font there, but it needs CPU power to be rendered. If you are talking about visibility control in CUI (curses), scrwidth() should pass you the necessary screen width to render a wchar_t letter. I don't know which standard defines scrwidth(), but it is implemented in runelocale code found in BSDI. >> > :-( ! >> > How does the ISO2022 model work here? Isn't it the >> > same for Japanese and Chinese? >> Yes, for Japanese, Chinese and Korean iso-2022 based model (euc-xx >> falls into the category) is really important. However, I personally >> believe that filenames must be kept in C locale for simplicity... > I'll check out iso-2022 further; if you know of any > english-language docs on this, please sent me a > pointer. "Understanding Japanese Information Processing" by Ken Lunde, from O'reilly should be a best starting point. If you would like to start it cheaper, there are various webpages. itojun To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-hackers" in the body of the message