Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 29 Dec 2012 17:31:33 +0100
From:      Marc Fonvieille <blackend@FreeBSD.org>
To:        Gabor Kovesdan <gabor@FreeBSD.org>
Cc:        doc@FreeBSD.org
Subject:   Re: Please review, small SGML entity cleanup
Message-ID:  <20121229163133.GA1567@emphyrio.blackend.org>
In-Reply-To: <50DEEA17.90802@FreeBSD.org>
References:  <20121228171424.GZ69724@acme.spoerlein.net> <50DECF3E.2020502@FreeBSD.org> <20121229124833.GC69724@acme.spoerlein.net> <50DEEA17.90802@FreeBSD.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Sat, Dec 29, 2012 at 02:03:19PM +0100, Gabor Kovesdan wrote:
> On 2012.12.29. 13:48, Ulrich Spörlein wrote:
> > On Sat, 2012-12-29 at 12:08:46 +0100, Gabor Kovesdan wrote:
> >> >On 2012.12.28. 18:14, Ulrich Spörlein wrote:
> >>> > >The DE and FR articles are a hodgepodge of SGML entities and direct,
> >>> > >8bit chars, with the former being the majority. This patch cleans this
> >>> > >up a little, although we should eventually switch this all to UTF-8,
> >>> > >obviously.
> >> >Don't they work with direct chars? Once we made a step to that direction
> > They probably will, and I have no clue why we used entities for German
> > and French, but the usual encodings for Russian and Japanese, etc.
> >
> >> >so this one would be one step back. If possible, it would be better to
> >> >convert the entities to direct chars instead of the opposite.
> > In the end, sure. But that's a larger project of moving from
> > de_DE.ISO8859-1 -> de_DE (with an implied UTF-8 encoding, as is required
> > by XML anyway, the implied part, not the exact encoding).
> It isn't required by XML. If you omit the encoding part of the XML 
> declaration, the content is treated as UTF-8 but it is not a requirement 
> at all.
> >
> > I don't think this commit is a step back, because the documents need to
> > be converted using a long series of s/&uuml;/ü/g, anyway. And the
> > current mish-mash is just weird.
> I don't see any reason why we cannot do this conversion right now in 
> ISO-8859-1. (Actually, I did, but people kept introducing new redundant 
> entities.) ISO-8859-1 isn't any harder to type than UTF-8. If you want 
> consistency (which imho isn't that important at this point since there 
> are lots of upcoming changes) then why not move to the right direction 
> of consistency?
>

I'm currently working on a translation without entities and everything
is fine (see
http://people.freebsd.org/~blackend/doc/fr_FR.ISO8859-1/books/handbook/bsdinstall.html).
It's "pure" ISO-8859-1.

-- 
Marc



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20121229163133.GA1567>