Date: Thu, 15 Nov 2001 21:40:17 +0900 (JST) From: Hiroki Sato <hrs@eos.ocn.ne.jp> To: horcicka@FreeBSD.cz Cc: freebsd-doc@FreeBSD.org Subject: Re: Why TIDY can never work correctly with ISO-8859-2 and others Message-ID: <20011115.214017.71143189.hrs@sekine00.ee.noda.sut.ac.jp> In-Reply-To: <20011115105650.W57038-100000@dual.ms.mff.cuni.cz> References: <20011115105650.W57038-100000@dual.ms.mff.cuni.cz>
next in thread | previous in thread | raw e-mail | index | archive | help
Martin Horcicka <horcicka@FreeBSD.cz> wrote in <20011115105650.W57038-100000@dual.ms.mff.cuni.cz>: horcicka> And if you use char-encoding: raw - character entities with values above 255 horcicka> are not printed as entities - this is really bad in 8-bit encodings. Yes, Japanese docs also suffer from it. The input routine of tidy expands any entities first, even if -raw flag is specified. horcicka> In my opinion Tidy cannot be used for encodings it does not natively support horcicka> (i.e. for Russian and Czech (- still not in main CVS) translations of pages horcicka> and docs). I think so, too. As a workaround, we can apply a patch and use the modified version of tidy that can suppress to interpret given entities as entities themselves, but I do not know if it will be a good solution. -- | Hiroki Sato <hrs@FreeBSD.org> | <hrs@eos.ocn.ne.jp> To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20011115.214017.71143189.hrs>