From owner-freebsd-doc Sun Mar 5 13:36: 1 2000 Delivered-To: freebsd-doc@freebsd.org Received: from nagual.pp.ru (pobrecita.freebsd.ru [194.87.13.42]) by hub.freebsd.org (Postfix) with ESMTP id F1A5837BA65; Sun, 5 Mar 2000 13:35:55 -0800 (PST) (envelope-from ache@nagual.pp.ru) Received: (from ache@localhost) by nagual.pp.ru (8.9.3/8.9.3) id AAA90609; Mon, 6 Mar 2000 00:35:50 +0300 (MSK) (envelope-from ache) Date: Mon, 6 Mar 2000 00:35:47 +0300 From: "Andrey A. Chernov" To: Hiroki Sato , phantom@freebsd.org Cc: doc@freebsd.org Subject: Re: SGML->HTML: entities translation is broken for non-Latin1 charsets Message-ID: <20000306003545.A90564@nagual.pp.ru> References: <20000305203633.A89852@nagual.pp.ru> <200003051959.EAA00142@mail.geocities.co.jp> <20000305230729.A90274@nagual.pp.ru> <200003052105.GAA05635@mail.geocities.co.jp> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 1.0.1i In-Reply-To: <200003052105.GAA05635@mail.geocities.co.jp>; from hrs@geocities.co.jp on Mon, Mar 06, 2000 at 06:02:12AM +0900 Organization: Biomechanoid Sender: owner-freebsd-doc@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org On Mon, Mar 06, 2000 at 06:02:12AM +0900, Hiroki Sato wrote: > Perhaps it is impossible from command line. Alternatively, > try to add the following line to "includes.sgml" and > build the HTML files. > > > > This overrides the HTMLlat1 entity. I never try to build www pages from sgmls, perhaps Aleksey (phantom) can try. BTW, can it be replaced diretly in HTMLlat1 as a patch? I see no harm if   © etc. will appearse in latin1 docs too instead of hardcoded values. Browsers usually have manual code page switching ability, so user entered to FreeBSD www may have different code page manually selected. Since FreeBSD and mirrors Apache not instruct browser to switch to iso-8859-1, user can see FreeBSD code page in different encoding he pre-select. In such case all hardcoded values will be wrong, but all symbolic names still show properly. So I vote for symbolic names even for Latin1 pages. > # Actually, Japanese-doc(uses 8bit character code) has > # the same problem. Of course, all non-Latin1 pages are broken. > In addition, there seems some " " in www/ru/index.sgml. > They should be replaced with " ". Yes. -- Andrey A. Chernov http://nagual.pp.ru/~ache/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-doc" in the body of the message