From owner-freebsd-doc  Sun Mar  5 21:58:10 2000
Delivered-To: freebsd-doc@freebsd.org
Received: from dt051n0b.san.rr.com (dt051n0b.san.rr.com [204.210.32.11])
	by hub.freebsd.org (Postfix) with ESMTP id 9B37437BC75
	for <doc@FreeBSD.ORG>; Sun,  5 Mar 2000 21:57:58 -0800 (PST)
	(envelope-from Doug@gorean.org)
Received: from gorean.org (doug@master [10.0.0.2])
	by dt051n0b.san.rr.com (8.9.3/8.9.3) with ESMTP id VAA07619;
	Sun, 5 Mar 2000 21:57:38 -0800 (PST)
	(envelope-from Doug@gorean.org)
Message-ID: <38C348D2.C5505BFC@gorean.org>
Date: Sun, 05 Mar 2000 21:57:38 -0800
From: Doug Barton <Doug@gorean.org>
Organization: Triborough Bridge & Tunnel Authority
X-Mailer: Mozilla 4.72 [en] (X11; U; FreeBSD 4.0-CURRENT-0302 i386)
X-Accept-Language: en
MIME-Version: 1.0
To: Hiroki Sato <hrs@geocities.co.jp>
Cc: ache@nagual.pp.ru, doc@FreeBSD.ORG
Subject: Re: SGML->HTML: entities translation is broken for non-Latin1charsets
References: <20000305203633.A89852@nagual.pp.ru>
		<200003051959.EAA00142@mail.geocities.co.jp>
		<20000305230729.A90274@nagual.pp.ru> <200003052105.GAA05635@mail.geocities.co.jp>
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-freebsd-doc@FreeBSD.ORG
Precedence: bulk
X-Loop: FreeBSD.org

Hiroki Sato wrote:

>  In addition, there seems some "&#160;" in www/ru/index.sgml.
>  They should be replaced with "&nbsp;".

	We ran into this problem at work. The &nbsp; type entities are
definitely the way to go, and should be preserved in the final HTML
output. You don't want the numerical ones because they may decode into
something entirely different in someone else's character set. Any decent
HTML reference has a list of them, I can dig mine out if no one can find
one. 

Doug
-- 
"Welcome to the desert of the real." 

    - Laurence Fishburne as Morpheus, "The Matrix"


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-doc" in the body of the message