Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 6 Mar 2000 12:36:21 +0300
From:      "Andrey A. Chernov" <ache@nagual.pp.ru>
To:        Doug Barton <Doug@gorean.org>
Cc:        Hiroki Sato <hrs@geocities.co.jp>, doc@FreeBSD.ORG
Subject:   Re: SGML->HTML: entities translation is broken for non-Latin1charsets
Message-ID:  <20000306123621.A92642@nagual.pp.ru>
In-Reply-To: <38C348D2.C5505BFC@gorean.org>; from Doug@gorean.org on Sun, Mar 05, 2000 at 09:57:38PM -0800
References:  <20000305203633.A89852@nagual.pp.ru> <200003051959.EAA00142@mail.geocities.co.jp> <20000305230729.A90274@nagual.pp.ru> <200003052105.GAA05635@mail.geocities.co.jp> <38C348D2.C5505BFC@gorean.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Sun, Mar 05, 2000 at 09:57:38PM -0800, Doug Barton wrote:
> >  In addition, there seems some "&#160;" in www/ru/index.sgml.
> >  They should be replaced with "&nbsp;".
> 
> 	We ran into this problem at work. The &nbsp; type entities are
> definitely the way to go, and should be preserved in the final HTML
> output. You don't want the numerical ones because they may decode into
> something entirely different in someone else's character set. Any decent
> HTML reference has a list of them, I can dig mine out if no one can find
> one. 

Speaking about strict standard conformance, all numerical entities decoded
per Unicode according to HTML specs, not per local page charset. But some
browsers implementations may not follow this. In any case symbolic names
are most safe.

-- 
Andrey A. Chernov
<ache@nagual.pp.ru>
http://nagual.pp.ru/~ache/


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-doc" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20000306123621.A92642>