From owner-freebsd-i18n Sun Nov 17 12: 9:46 2002 Delivered-To: freebsd-i18n@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 8D7B437B401 for ; Sun, 17 Nov 2002 12:09:45 -0800 (PST) Received: from nuit.iteration.net (nuit.iteration.net [198.92.249.80]) by mx1.FreeBSD.org (Postfix) with ESMTP id 7006B43E97 for ; Sun, 17 Nov 2002 12:09:44 -0800 (PST) (envelope-from keichii@nuit.iteration.net) Received: by nuit.iteration.net (Postfix, from userid 1001) id B64C811B646; Sun, 17 Nov 2002 12:09:43 -0800 (PST) Date: Sun, 17 Nov 2002 12:09:43 -0800 From: "Michael C. Wu" To: Mike Makonnen Cc: freebsd-i18n@FreeBSD.ORG Subject: Re: Creating new locales Message-ID: <20021117200943.GB42136@nuit.iteration.net> Reply-To: "Michael C. Wu" Mail-Followup-To: "Michael C. Wu" , Mike Makonnen , freebsd-i18n@FreeBSD.ORG References: <20021117041857.GA34284@matrix.identd.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20021117041857.GA34284@matrix.identd.net> User-Agent: Mutt/1.3.28i X-PGP-Fingerprint: 5025 F691 F943 8128 48A8 5025 77CE 29C5 8FA1 2E20 X-PGP-Key-ID: 0x8FA12E20 Sender: owner-freebsd-i18n@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG On Sat, Nov 16, 2002 at 08:18:57PM -0800, Mike Makonnen scribbled: | I get a font converter? No idea regarding the font issues. | | The charset encoding is utf8, so I created an LC_TYPE with a UTF2 | encoding. And that's about as far as I have gone. | I suppose next thing I have to do is create a keyboard map and a screen map. | Keymap.5 was helpfull about the first, but I'm not too sure about | what a screen map is and what I have to do about it. What's the | relationship between a font file, a keyboard map, and a screen | map (my most important question)? | | The amharic character set contains something between 300-400 | characters, so I was wondering how some other languages with | more thatn 127 characters manage to work on the console. How do | you map 2 or more keystrokes to one character? A basic | outline or pointer would really help. | reference ports/chinese/zhcon or big5con | My initial goal is to be able to cat a utf8 encoded amharic | text file. | I don't think FreeBSD has UTF8 support in console yet. Your best route would be to use X to do so. | Cheers, | Mike. | -- | GPG Key: http://www.identd.net/~mtm/gpg.key | pub 1024D/7D39509A 2002-10-08 Mike Makonnen | Key fingerprint = 5491 488A 0445 2DCC 777B 1F03 F3AB F9F8 7D39 509A ------------------------- To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-i18n" in the body of the message From owner-freebsd-i18n Tue Nov 19 14: 5:34 2002 Delivered-To: freebsd-i18n@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A65EC37B401 for ; Tue, 19 Nov 2002 14:05:32 -0800 (PST) Received: from mail.gmx.net (mail.gmx.net [213.165.64.20]) by mx1.FreeBSD.org (Postfix) with SMTP id 92A7843E8A for ; Tue, 19 Nov 2002 14:05:30 -0800 (PST) (envelope-from alexander.pohoyda@gmx.net) Received: (qmail 16947 invoked by uid 0); 19 Nov 2002 22:05:29 -0000 Received: from p508be3a0.dip.t-dialin.net (HELO oak.pohoyda.family) (80.139.227.160) by mail.gmx.net (mp020-rz3) with SMTP; 19 Nov 2002 22:05:29 -0000 Received: from oak.pohoyda.family (localhost [127.0.0.1]) by oak.pohoyda.family (8.12.3/8.12.3) with ESMTP id gAJM5FNi096445; Tue, 19 Nov 2002 23:05:16 +0100 (CET) Received: (from apog@localhost) by oak.pohoyda.family (8.12.3/8.12.3/Submit) id gAJM5Dtk096380; Tue, 19 Nov 2002 23:05:13 +0100 (CET) To: Mike Makonnen Cc: freebsd-i18n@freebsd.org, pascal@info.tsu.ru Subject: Re: Creating new locales References: <20021117041857.GA34284@matrix.identd.net> From: Alexander Pohoyda Date: 19 Nov 2002 23:05:12 +0100 In-Reply-To: Mike Makonnen's message of "Sat, 16 Nov 2002 20:18:57 -0800" Message-ID: <87ptt19pwn.fsf@oak.pohoyda.family> Lines: 62 X-Mailer: Gnus v5.7/Emacs 20.7 Sender: owner-freebsd-i18n@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG Mike Makonnen writes: > The charset encoding is utf8, so I created an LC_TYPE with a UTF2 > encoding. And that's about as far as I have gone. UTF2 is a typo, isn't it? Did you mean LC_CTYPE ? A file in /usr/share/locale/xxx.UTF-8 directory? To quote Markus Kuhn (http://www.cl.cam.ac.uk/~mgk25/unicode.html): --start Please do not write UTF-8 in any documentation text in other ways (such as utf8 or UTF_8), unless of course you refer to a variable name and not the encoding itself. --end > I suppose next thing I have to do is create a keyboard map and a screen map. > Keymap.5 was helpfull about the first, but I'm not too sure about > what a screen map is and what I have to do about it. Let me translate a description made by Ivan Pascal (http://www.tsu.ru/~pascal/unix/syscons/screen.html#screenmap): --start I have mentioned earlier, that during the output of the symbol, syscons just sends its (symbol's) code to video controller. That's not completely true. First, syscons is [re-]mapping the symbol using some internal recoding table. This "recoding table" (screenmap) is just a 256 byte-long table, so that any symbol to be outputed may be changed to some other symbol. ... --end This is used, for example, if your video controller cannot load software fonts (or does not handle them well), but can display characters you want. In this case you create a screenmap to map bytes you have to bytes your hardware expects to produce the result you want. > What's the > relationship between a font file, a keyboard map, and a screen > map (my most important question)? As far as I understand: Font file is loaded into your video controller to display bytes as symbols. Keyboard map (keymap) is used to convert keystrokes into bytes. Screenmap may be used to automatically convert bytes to some other bytes. > The amharic character set contains something between 300-400 > characters, so I was wondering how some other languages with > more thatn 127 characters manage to work on the console. How do > you map 2 or more keystrokes to one character? A basic > outline or pointer would really help. Maybe Ivan can comment on this. I CC him. -- Alexander Pohoyda To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-i18n" in the body of the message From owner-freebsd-i18n Tue Nov 19 20: 7:54 2002 Delivered-To: freebsd-i18n@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 7B75537B401 for ; Tue, 19 Nov 2002 20:07:53 -0800 (PST) Received: from smtp02.iprimus.net.au (smtp02.iprimus.net.au [210.50.76.70]) by mx1.FreeBSD.org (Postfix) with ESMTP id 651FA43E42 for ; Tue, 19 Nov 2002 20:07:52 -0800 (PST) (envelope-from tim@robbins.dropbear.id.au) Received: from dilbert.robbins.dropbear.id.au ([210.50.219.18]) by smtp02.iprimus.net.au with Microsoft SMTPSVC(5.0.2195.5600); Wed, 20 Nov 2002 15:07:50 +1100 Received: from dilbert.robbins.dropbear.id.au (6kmk8fq8qgp14yup@localhost [127.0.0.1]) by dilbert.robbins.dropbear.id.au (8.12.6/8.12.6) with ESMTP id gAK47JEi014158; Wed, 20 Nov 2002 15:07:40 +1100 (EST) (envelope-from tim@dilbert.robbins.dropbear.id.au) Received: (from tim@localhost) by dilbert.robbins.dropbear.id.au (8.12.6/8.12.6/Submit) id gAK46pFv014130; Wed, 20 Nov 2002 15:06:51 +1100 (EST) (envelope-from tim) Date: Wed, 20 Nov 2002 15:06:51 +1100 From: Tim Robbins To: Alexander Pohoyda Cc: Mike Makonnen , freebsd-i18n@FreeBSD.org Subject: Re: Creating new locales Message-ID: <20021120150651.A13502@dilbert.robbins.dropbear.id.au> References: <20021117041857.GA34284@matrix.identd.net> <87ptt19pwn.fsf@oak.pohoyda.family> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.2.5.1i In-Reply-To: <87ptt19pwn.fsf@oak.pohoyda.family>; from alexander.pohoyda@gmx.net on Tue, Nov 19, 2002 at 11:05:12PM +0100 X-OriginalArrivalTime: 20 Nov 2002 04:07:50.0804 (UTC) FILETIME=[64825D40:01C2904A] Sender: owner-freebsd-i18n@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG On Tue, Nov 19, 2002 at 11:05:12PM +0100, Alexander Pohoyda wrote: > Mike Makonnen writes: > > > The charset encoding is utf8, so I created an LC_TYPE with a UTF2 > > encoding. And that's about as far as I have gone. > > UTF2 is a typo, isn't it? Did you mean LC_CTYPE ? A file in > /usr/share/locale/xxx.UTF-8 directory? UTF2 is an old name for what is now called UTF-8, and it is what you need to put in LC_CTYPE source files (/usr/src/share/mklocale/*.src) to get something which resembles UTF-8 on FreeBSD 4.7 and earlier. (-STABLE and -CURRENT have a proper UTF-8 encoding now, and UTF2 is being phased out) See this document: Rob Pike and Ken Thompson, "Hello World", Proceedings of the Winter 1993 USENIX Technical Conference, USENIX Association, January 1993. [http://www.cs.bell-labs.com/sys/doc/utf.pdf] "Our proposal, which was originally known informally as UTF-2 and FSS-UTF, is now referred to as UTF-8 and has been approved by ISO to become Annex P to ISO 10646." Tim To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-i18n" in the body of the message