From owner-freebsd-doc Sun Mar 5 21:58:10 2000 Delivered-To: freebsd-doc@freebsd.org Received: from dt051n0b.san.rr.com (dt051n0b.san.rr.com [204.210.32.11]) by hub.freebsd.org (Postfix) with ESMTP id 9B37437BC75 for ; Sun, 5 Mar 2000 21:57:58 -0800 (PST) (envelope-from Doug@gorean.org) Received: from gorean.org (doug@master [10.0.0.2]) by dt051n0b.san.rr.com (8.9.3/8.9.3) with ESMTP id VAA07619; Sun, 5 Mar 2000 21:57:38 -0800 (PST) (envelope-from Doug@gorean.org) Message-ID: <38C348D2.C5505BFC@gorean.org> Date: Sun, 05 Mar 2000 21:57:38 -0800 From: Doug Barton Organization: Triborough Bridge & Tunnel Authority X-Mailer: Mozilla 4.72 [en] (X11; U; FreeBSD 4.0-CURRENT-0302 i386) X-Accept-Language: en MIME-Version: 1.0 To: Hiroki Sato Cc: ache@nagual.pp.ru, doc@FreeBSD.ORG Subject: Re: SGML->HTML: entities translation is broken for non-Latin1charsets References: <20000305203633.A89852@nagual.pp.ru> <200003051959.EAA00142@mail.geocities.co.jp> <20000305230729.A90274@nagual.pp.ru> <200003052105.GAA05635@mail.geocities.co.jp> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: owner-freebsd-doc@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org Hiroki Sato wrote: > In addition, there seems some " " in www/ru/index.sgml. > They should be replaced with " ". We ran into this problem at work. The   type entities are definitely the way to go, and should be preserved in the final HTML output. You don't want the numerical ones because they may decode into something entirely different in someone else's character set. Any decent HTML reference has a list of them, I can dig mine out if no one can find one. Doug -- "Welcome to the desert of the real." - Laurence Fishburne as Morpheus, "The Matrix" To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message