From owner-freebsd-doc Mon Mar 6 1:36:42 2000 Delivered-To: freebsd-doc@freebsd.org Received: from nagual.pp.ru (pobrecita.freebsd.ru [194.87.13.42]) by hub.freebsd.org (Postfix) with ESMTP id E09F037BCF1 for ; Mon, 6 Mar 2000 01:36:39 -0800 (PST) (envelope-from ache@nagual.pp.ru) Received: (from ache@localhost) by nagual.pp.ru (8.9.3/8.9.3) id MAA92663; Mon, 6 Mar 2000 12:36:23 +0300 (MSK) (envelope-from ache) Date: Mon, 6 Mar 2000 12:36:21 +0300 From: "Andrey A. Chernov" To: Doug Barton Cc: Hiroki Sato , doc@FreeBSD.ORG Subject: Re: SGML->HTML: entities translation is broken for non-Latin1charsets Message-ID: <20000306123621.A92642@nagual.pp.ru> References: <20000305203633.A89852@nagual.pp.ru> <200003051959.EAA00142@mail.geocities.co.jp> <20000305230729.A90274@nagual.pp.ru> <200003052105.GAA05635@mail.geocities.co.jp> <38C348D2.C5505BFC@gorean.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 1.0.1i In-Reply-To: <38C348D2.C5505BFC@gorean.org>; from Doug@gorean.org on Sun, Mar 05, 2000 at 09:57:38PM -0800 Organization: Biomechanoid Sender: owner-freebsd-doc@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org On Sun, Mar 05, 2000 at 09:57:38PM -0800, Doug Barton wrote: > > In addition, there seems some " " in www/ru/index.sgml. > > They should be replaced with " ". > > We ran into this problem at work. The   type entities are > definitely the way to go, and should be preserved in the final HTML > output. You don't want the numerical ones because they may decode into > something entirely different in someone else's character set. Any decent > HTML reference has a list of them, I can dig mine out if no one can find > one. Speaking about strict standard conformance, all numerical entities decoded per Unicode according to HTML specs, not per local page charset. But some browsers implementations may not follow this. In any case symbolic names are most safe. -- Andrey A. Chernov http://nagual.pp.ru/~ache/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message