From owner-freebsd-doc Mon Mar 6 2:15: 9 2000 Delivered-To: freebsd-doc@freebsd.org Received: from nagual.pp.ru (pobrecita.freebsd.ru [194.87.13.42]) by hub.freebsd.org (Postfix) with ESMTP id 793F437BCFE; Mon, 6 Mar 2000 02:15:03 -0800 (PST) (envelope-from ache@nagual.pp.ru) Received: (from ache@localhost) by nagual.pp.ru (8.9.3/8.9.3) id NAA92836; Mon, 6 Mar 2000 13:15:02 +0300 (MSK) (envelope-from ache) Date: Mon, 6 Mar 2000 13:14:58 +0300 From: "Andrey A. Chernov" To: Nik Clayton Cc: doc@freebsd.org, www@freebsd.org, phantom@freebsd.org Subject: Re: SGML->HTML: entities translation is broken for non-Latin1 charsets Message-ID: <20000306131457.B92757@nagual.pp.ru> References: <20000304134300.A24194@nagual.pp.ru> <20000306021454.A87062@catkin.nothing-going-on.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 1.0.1i In-Reply-To: <20000306021454.A87062@catkin.nothing-going-on.org>; from nik@freebsd.org on Mon, Mar 06, 2000 at 02:14:55AM +0000 Organization: Biomechanoid Sender: owner-freebsd-doc@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org On Mon, Mar 06, 2000 at 02:14:55AM +0000, Nik Clayton wrote: > This is done by sgmlnorm. Last time this issue came up I didn't have a > good fix for it either. . . It seems that Hiroki Sato give us good workaround, so source patching is not needed for HTMLs processing. > If you look in jade/style/sdata.h, you'll see an array that lists entity > numbers to entity names. This is the root cause of the problem, and a > typical line from that file is > > { 0x00A9, "copy" }, > > which is why "©" becomes "\a9" when a file is processed by sgmlnorm. This can cause problems with converting to non-HTML formats (TeX, etc.), but not so urgent as www pages so can be discussed with Jade maintainers later. -- Andrey A. Chernov http://nagual.pp.ru/~ache/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message