From owner-freebsd-doc Sat Mar 4 2:43:14 2000 Delivered-To: freebsd-doc@freebsd.org Received: from nagual.pp.ru (pobrecita.freebsd.ru [194.87.13.42]) by hub.freebsd.org (Postfix) with ESMTP id 893E137B771; Sat, 4 Mar 2000 02:43:07 -0800 (PST) (envelope-from ache@nagual.pp.ru) Received: (from ache@localhost) by nagual.pp.ru (8.9.3/8.9.3) id NAA24226; Sat, 4 Mar 2000 13:43:04 +0300 (MSK) (envelope-from ache) Date: Sat, 4 Mar 2000 13:43:02 +0300 From: "Andrey A. Chernov" To: doc@freebsd.org, www@freebsd.org, phantom@freebsd.org, ru@freebsd.org Subject: SGML->HTML: entities translation is broken for non-Latin1 charsets Message-ID: <20000304134300.A24194@nagual.pp.ru> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 1.0.1i Organization: Biomechanoid Sender: owner-freebsd-doc@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org Looking at www.freebsd.org I found that sgml->html procedure replace things like   © etc. with their Latin1 8bit hardcoded values :-( Please fix it ASAP, non-Latin1 pages are very broken otherwise. F.e. both © and   have different 8bit codes in KOI8-R than in Latin1. Right way is to not translate &...; entities from sgml source at all and leave them in place. Browser always know better substitution for them. -- Andrey A. Chernov http://nagual.pp.ru/~ache/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message