Date: Mon, 6 Mar 2000 12:36:21 +0300 From: "Andrey A. Chernov" <ache@nagual.pp.ru> To: Doug Barton <Doug@gorean.org> Cc: Hiroki Sato <hrs@geocities.co.jp>, doc@FreeBSD.ORG Subject: Re: SGML->HTML: entities translation is broken for non-Latin1charsets Message-ID: <20000306123621.A92642@nagual.pp.ru> In-Reply-To: <38C348D2.C5505BFC@gorean.org>; from Doug@gorean.org on Sun, Mar 05, 2000 at 09:57:38PM -0800 References: <20000305203633.A89852@nagual.pp.ru> <200003051959.EAA00142@mail.geocities.co.jp> <20000305230729.A90274@nagual.pp.ru> <200003052105.GAA05635@mail.geocities.co.jp> <38C348D2.C5505BFC@gorean.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On Sun, Mar 05, 2000 at 09:57:38PM -0800, Doug Barton wrote: > > In addition, there seems some " " in www/ru/index.sgml. > > They should be replaced with " ". > > We ran into this problem at work. The type entities are > definitely the way to go, and should be preserved in the final HTML > output. You don't want the numerical ones because they may decode into > something entirely different in someone else's character set. Any decent > HTML reference has a list of them, I can dig mine out if no one can find > one. Speaking about strict standard conformance, all numerical entities decoded per Unicode according to HTML specs, not per local page charset. But some browsers implementations may not follow this. In any case symbolic names are most safe. -- Andrey A. Chernov <ache@nagual.pp.ru> http://nagual.pp.ru/~ache/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20000306123621.A92642>