Date: Mon, 6 Mar 2000 01:17:12 +0300 From: "Andrey A. Chernov" <ache@nagual.pp.ru> To: Chris Costello <chris@calldei.com> Cc: doc@FreeBSD.ORG, www@FreeBSD.ORG, phantom@FreeBSD.ORG, ru@FreeBSD.ORG Subject: Re: SGML->HTML: entities translation is broken for non-Latin1 charsets Message-ID: <20000306011712.A90731@nagual.pp.ru> In-Reply-To: <20000305160800.E97199@holly.calldei.com>; from chris@calldei.com on Sun, Mar 05, 2000 at 04:08:00PM -0600 References: <20000304134300.A24194@nagual.pp.ru> <20000305160800.E97199@holly.calldei.com>
next in thread | previous in thread | raw e-mail | index | archive | help
On Sun, Mar 05, 2000 at 04:08:00PM -0600, Chris Costello wrote: > On Saturday, March 04, 2000, Andrey A. Chernov wrote: > > Right way is to not translate &...; entities from sgml source at all and > > leave them in place. Browser always know better substitution for them. > > No, you'd have to have a list of some sort of the HTML > standard entities and use them when translating to HTML from > SGML. There are other SGML entities that begin with &...; such > as &rel.current; which can't be preserved into HTML. Then > there's the issue of other various formats like TeX and RTF. > Will we handle those the same too? Or will this just be an HTML > issue? Yes, I agree to have a list of HTML standard entities which will be not converted in sgml->html procedure. Other formats must use its own symbolic names or hardcoded values not from latin1 but from native tables in this case. But for HTML better variant is to keep all symbolic HTML entities untouched. If adding all of them will be hard, we need to add at least all entities above ASCII and not latin1 letters (like and © are). -- Andrey A. Chernov <ache@nagual.pp.ru> http://nagual.pp.ru/~ache/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20000306011712.A90731>