Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 8 Mar 2000 14:51:58 +0300
From:      "Andrey A. Chernov" <ache@nagual.pp.ru>
To:        Hiroki Sato <hrs@geocities.co.jp>
Cc:        phantom@FreeBSD.ORG, doc@FreeBSD.ORG
Subject:   Re: SGML->HTML: entities translation is broken for non-Latin1 charsets
Message-ID:  <20000308145158.A7844@nagual.pp.ru>
In-Reply-To: <200003081024.TAA24457@mail.geocities.co.jp>; from hrs@geocities.co.jp on Wed, Mar 08, 2000 at 07:20:36PM %2B0900
References:  <20000306003545.A90564@nagual.pp.ru> <20000305151810.A200@scorpion.crimea.ua> <20000306130945.A92757@nagual.pp.ru> <20000305203633.A89852@nagual.pp.ru> <20000306130945.A92757@nagual.pp.ru> <200003081024.TAA24457@mail.geocities.co.jp>

next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, Mar 08, 2000 at 07:20:36PM +0900, Hiroki Sato wrote:
>  the last conversion of >127 characters, so &lt; is output as
>  an entity &lt;, but &nbsp; is output as raw code #160.

As I already write, &nbsp; -> &#160; conversion is still valid per HTML
specs since numeric entities interpreted using Unicode, not local charset.
But not all browsers implements it properly :-(

&nbsp; -> \xA0 is invalid per standard sice binary \xA0 interpreted
according to local charset.

>  This problem is unavoidable as long as we use the current version
>  of tidy.  We can build doc with NO_TIDY flag to avoid the problem
>  tentatively (actually do so now in Japanese-doc), but I personally
>  don't think this is a reasonable way.
> 
>  To tell the truth, this was pointed out and submitted a patch to
>  fix it by Kuriyama-san before.  It seemed that tidy developers
>  didn't think it an important issue.

We need to distinguish between short term and long term solutions. Short
term solution is any workaround to unbroke non-Latin1 www and docs right
now. Long term solution is either utilities local patches or contacting
with their maintainers.

So, if you think that only way to unbroke FAQ right now is NO_TIDY, it
must be applied regardless of possible service/features lost in this
step. Things must be not broken first or not builded at all. All other
enhancements are optional.

-- 
Andrey A. Chernov
<ache@nagual.pp.ru>
http://nagual.pp.ru/~ache/


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-doc" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20000308145158.A7844>