From owner-freebsd-doc Sat Dec 16 1: 5:45 2000 From owner-freebsd-doc@FreeBSD.ORG Sat Dec 16 01:05:42 2000 Return-Path: Delivered-To: freebsd-doc@freebsd.org Received: from peorth.iteration.net (peorth.iteration.net [208.190.180.178]) by hub.freebsd.org (Postfix) with ESMTP id 0FD7437B400; Sat, 16 Dec 2000 01:05:42 -0800 (PST) Received: by peorth.iteration.net (Postfix, from userid 1001) id 53DD357463; Sat, 16 Dec 2000 03:06:04 -0600 (CST) Date: Sat, 16 Dec 2000 03:06:04 -0600 From: "Michael C . Wu" To: doc@freebsd.org, i18n@freebsd.org Subject: Docbook and CJK languages Message-ID: <20001216030604.B46336@peorth.iteration.net> Reply-To: "Michael C . Wu" Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.2.5i X-PGP-Fingerprint: 5025 F691 F943 8128 48A8 5025 77CE 29C5 8FA1 2E20 X-PGP-Key-ID: 0x8FA12E20 Sender: keichii@peorth.iteration.net Sender: owner-freebsd-doc@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org While working on some freebsd-taiwan docbook, we discovered this problem with Docbook/SGML not handling 2 byte characters correctly. For example: I have this line of text ("AA" and "BB" are two examples of 2 byte chars) AABBAABBAABBAABB When I compile this with output specified to text files. The correct behavior to cut them into two lines would be: AABBAABBAABB/n AABB/n However, sometimes the output comes out looking like: AABBAABBAABBA/n ABB/n (Note the broken AA char in the last part of the first line) This causes the whole doc to be broken and unreadable. Since subsequent encoding/decoding is off-by-one. And the problem can repeat several times in the documentation. Is there any way to fix this? Is there an SGML tag that I can specify? Or is this a lacking feature of Docbook? -- +------------------------------------------------------------------+ | keichii@peorth.iteration.net | keichii@bsdconspiracy.net | | http://peorth.iteration.net/~keichii | Yes, BSD is a conspiracy. | +------------------------------------------------------------------+ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message