From owner-freebsd-doc Sun Mar 5 14:17:30 2000 Delivered-To: freebsd-doc@freebsd.org Received: from nagual.pp.ru (pobrecita.freebsd.ru [194.87.13.42]) by hub.freebsd.org (Postfix) with ESMTP id A2B7737B881; Sun, 5 Mar 2000 14:17:25 -0800 (PST) (envelope-from ache@nagual.pp.ru) Received: (from ache@localhost) by nagual.pp.ru (8.9.3/8.9.3) id BAA90802; Mon, 6 Mar 2000 01:17:13 +0300 (MSK) (envelope-from ache) Date: Mon, 6 Mar 2000 01:17:12 +0300 From: "Andrey A. Chernov" To: Chris Costello Cc: doc@FreeBSD.ORG, www@FreeBSD.ORG, phantom@FreeBSD.ORG, ru@FreeBSD.ORG Subject: Re: SGML->HTML: entities translation is broken for non-Latin1 charsets Message-ID: <20000306011712.A90731@nagual.pp.ru> References: <20000304134300.A24194@nagual.pp.ru> <20000305160800.E97199@holly.calldei.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 1.0.1i In-Reply-To: <20000305160800.E97199@holly.calldei.com>; from chris@calldei.com on Sun, Mar 05, 2000 at 04:08:00PM -0600 Organization: Biomechanoid Sender: owner-freebsd-doc@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org On Sun, Mar 05, 2000 at 04:08:00PM -0600, Chris Costello wrote: > On Saturday, March 04, 2000, Andrey A. Chernov wrote: > > Right way is to not translate &...; entities from sgml source at all and > > leave them in place. Browser always know better substitution for them. > > No, you'd have to have a list of some sort of the HTML > standard entities and use them when translating to HTML from > SGML. There are other SGML entities that begin with &...; such > as &rel.current; which can't be preserved into HTML. Then > there's the issue of other various formats like TeX and RTF. > Will we handle those the same too? Or will this just be an HTML > issue? Yes, I agree to have a list of HTML standard entities which will be not converted in sgml->html procedure. Other formats must use its own symbolic names or hardcoded values not from latin1 but from native tables in this case. But for HTML better variant is to keep all symbolic HTML entities untouched. If adding all of them will be hard, we need to add at least all entities above ASCII and not latin1 letters (like   and © are). -- Andrey A. Chernov http://nagual.pp.ru/~ache/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message