From owner-freebsd-doc@FreeBSD.ORG Thu Feb 5 08:50:15 2004 Return-Path: Delivered-To: freebsd-doc@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 9214C16A4CE; Thu, 5 Feb 2004 08:50:15 -0800 (PST) Received: from smtp.eos.ocn.ne.jp (eos.ocn.ne.jp [211.6.83.117]) by mx1.FreeBSD.org (Postfix) with ESMTP id 65E8043D45; Thu, 5 Feb 2004 08:50:07 -0800 (PST) (envelope-from hrs@FreeBSD.org) Received: from delta.allbsd.org (p37180-adsao12honb4-acca.tokyo.ocn.ne.jp [219.161.134.180]) by smtp.eos.ocn.ne.jp (Postfix) with ESMTP id CC71A4D2A; Fri, 6 Feb 2004 01:50:05 +0900 (JST) Received: from localhost (alph.allbsd.org [192.168.0.10]) by delta.allbsd.org (8.12.9p2/8.12.9) with ESMTP id i15GnsA2075692; Fri, 6 Feb 2004 01:49:54 +0900 (JST) (envelope-from hrs@FreeBSD.org) Date: Fri, 06 Feb 2004 01:49:40 +0900 (JST) Message-Id: <20040206.014940.23072599.hrs@eos.ocn.ne.jp> To: ale@FreeBSD.org From: Hiroki Sato In-Reply-To: <20040205063847.GA13136@phantom.cris.net> References: <20040204.171343.23008681.hrs@eos.ocn.ne.jp> <402171B7.7020205@FreeBSD.org> <20040205063847.GA13136@phantom.cris.net> X-PGPkey-fingerprint: BDB3 443F A5DD B3D0 A530 FFD7 4F2C D3D8 2793 CF2D X-Mailer: Mew version 4.0.62 on Emacs 21.3.1 / Mule 5.0 (SAKAKI) Mime-Version: 1.0 Content-Type: Multipart/Signed; protocol="application/pgp-signature"; micalg=pgp-sha1; boundary="--Security_Multipart(Fri_Feb__6_01_49_40_2004_155)--" Content-Transfer-Encoding: 7bit cc: freebsd-doc@FreeBSD.org cc: hrs@FreeBSD.org cc: phantom@FreeBSD.org.ua Subject: Re: tidy flag X-BeenThere: freebsd-doc@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Documentation project List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 05 Feb 2004 16:50:15 -0000 ----Security_Multipart(Fri_Feb__6_01_49_40_2004_155)-- Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit Alexey Zelkin wrote in <20040205063847.GA13136@phantom.cris.net>: phantom> On Wed, Feb 04, 2004 at 11:27:03PM +0100, Alex Dupre wrote: phantom> > Ok, the question then becomes: is it possible to replace the -preserve phantom> > tidy-stable flag with the -numeric tidy-devel flag? Otherwise can you phantom> > send me a pratical example where -preserve is needed? We (Thierry Thomas phantom> > and me) will try ourself. phantom> phantom> Well. Try below html code with -preserve and without. You'll see a phantom> difference. Actually most annoying things was a 'entity expansion', but phantom> there were also some problems with non-ASCII symbols processing under phantom> some conditions (but unfortunatelly i don't remember details). phantom> phantom> phantom> phantom> NBSP -   phantom> COPY - © phantom> phantom> The problem is that the result of the expansion should depend on the html doc's charset/encoding. For example, in euc-jp, © should be {0x8f, 0xa2, 0xed}, but tidy always think it as 0xa9. And many browsers interpret © as a raw character in the html doc's charset (euc-jp, in this case).  , ©, ·, and other >159 characters in euc-jp are different from iso-8859-*. While according to the XML specification it is unambiguous (&#xxx; is always interpreted as a Unicode character), I think it is better that entity is preserved as it is at the present moment. Tidy does not know the relationship between euc-jp and Unicode, so a lot of Japanese docs will be broken without -preserve. -- | Hiroki SATO ----Security_Multipart(Fri_Feb__6_01_49_40_2004_155)-- Content-Type: application/pgp-signature Content-Transfer-Encoding: 7bit -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.3 (FreeBSD) iD8DBQBAInQkTyzT2CeTzy0RAsMjAJ0QPmr4dVhCifRvH/K7p5nhzbduMgCglj57 tAWjiW04IIXrbV1+f+q108Y= =DDXG -----END PGP SIGNATURE----- ----Security_Multipart(Fri_Feb__6_01_49_40_2004_155)----