From owner-freebsd-doc@FreeBSD.ORG Sat Dec 29 16:31:45 2012 Return-Path: Delivered-To: doc@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 6DC7416F; Sat, 29 Dec 2012 16:31:45 +0000 (UTC) (envelope-from marc@blackend.org) Received: from smtp6-g21.free.fr (smtp6-g21.free.fr [IPv6:2a01:e0c:1:1599::15]) by mx1.freebsd.org (Postfix) with ESMTP id 6FDAE8FC0A; Sat, 29 Dec 2012 16:31:41 +0000 (UTC) Received: from emphyrio.blackend.org (unknown [88.179.1.53]) by smtp6-g21.free.fr (Postfix) with ESMTP id C383782275; Sat, 29 Dec 2012 17:31:34 +0100 (CET) Received: from emphyrio.blackend.org (localhost [127.0.0.1]) by emphyrio.blackend.org (8.14.5/8.14.4) with ESMTP id qBTGVXB0001609; Sat, 29 Dec 2012 17:31:33 +0100 (CET) (envelope-from marc@emphyrio.blackend.org) Received: (from marc@localhost) by emphyrio.blackend.org (8.14.5/8.14.4/Submit) id qBTGVX0U001608; Sat, 29 Dec 2012 17:31:33 +0100 (CET) (envelope-from marc) Date: Sat, 29 Dec 2012 17:31:33 +0100 From: Marc Fonvieille To: Gabor Kovesdan Subject: Re: Please review, small SGML entity cleanup Message-ID: <20121229163133.GA1567@emphyrio.blackend.org> References: <20121228171424.GZ69724@acme.spoerlein.net> <50DECF3E.2020502@FreeBSD.org> <20121229124833.GC69724@acme.spoerlein.net> <50DEEA17.90802@FreeBSD.org> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <50DEEA17.90802@FreeBSD.org> X-Useless-Header: blackend.org X-Operating-System: FreeBSD 8.3-STABLE User-Agent: Mutt/1.5.21 (2010-09-15) Cc: doc@FreeBSD.org X-BeenThere: freebsd-doc@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Documentation project List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 29 Dec 2012 16:31:45 -0000 On Sat, Dec 29, 2012 at 02:03:19PM +0100, Gabor Kovesdan wrote: > On 2012.12.29. 13:48, Ulrich Spörlein wrote: > > On Sat, 2012-12-29 at 12:08:46 +0100, Gabor Kovesdan wrote: > >> >On 2012.12.28. 18:14, Ulrich Spörlein wrote: > >>> > >The DE and FR articles are a hodgepodge of SGML entities and direct, > >>> > >8bit chars, with the former being the majority. This patch cleans this > >>> > >up a little, although we should eventually switch this all to UTF-8, > >>> > >obviously. > >> >Don't they work with direct chars? Once we made a step to that direction > > They probably will, and I have no clue why we used entities for German > > and French, but the usual encodings for Russian and Japanese, etc. > > > >> >so this one would be one step back. If possible, it would be better to > >> >convert the entities to direct chars instead of the opposite. > > In the end, sure. But that's a larger project of moving from > > de_DE.ISO8859-1 -> de_DE (with an implied UTF-8 encoding, as is required > > by XML anyway, the implied part, not the exact encoding). > It isn't required by XML. If you omit the encoding part of the XML > declaration, the content is treated as UTF-8 but it is not a requirement > at all. > > > > I don't think this commit is a step back, because the documents need to > > be converted using a long series of s/ü/ü/g, anyway. And the > > current mish-mash is just weird. > I don't see any reason why we cannot do this conversion right now in > ISO-8859-1. (Actually, I did, but people kept introducing new redundant > entities.) ISO-8859-1 isn't any harder to type than UTF-8. If you want > consistency (which imho isn't that important at this point since there > are lots of upcoming changes) then why not move to the right direction > of consistency? > I'm currently working on a translation without entities and everything is fine (see http://people.freebsd.org/~blackend/doc/fr_FR.ISO8859-1/books/handbook/bsdinstall.html). It's "pure" ISO-8859-1. -- Marc