From owner-freebsd-doc@FreeBSD.ORG Fri Aug 3 14:19:46 2012 Return-Path: Delivered-To: doc@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 4CFDB1065674 for ; Fri, 3 Aug 2012 14:19:46 +0000 (UTC) (envelope-from simon@qxnitro.org) Received: from mail-yw0-f54.google.com (mail-yw0-f54.google.com [209.85.213.54]) by mx1.freebsd.org (Postfix) with ESMTP id ED0238FC15 for ; Fri, 3 Aug 2012 14:19:45 +0000 (UTC) Received: by yhfs35 with SMTP id s35so1078795yhf.13 for ; Fri, 03 Aug 2012 07:19:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=qxnitro.org; s=google; h=mime-version:sender:x-originating-ip:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=lZofQR+UPmk/Hi9ZJQbyGeSTlK0T/soRSmCVtWAKiSE=; b=Lu194bnp7FXCM+7j9QX2vlhPx9bFgvEtdCH9x9cR1dq9k5zF+9zBVMb/psv8SiLts5 lF8bJZzNGFFE3ywTMdNTVffra0w7ocwBlGw+eUhXY/LEZBT7FAc4NiMN6ZCQ8m0bOvYC gWbQBtdFsbiCQgi5YDaATMqZixZ5b1LPNy0EE= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:sender:x-originating-ip:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type :content-transfer-encoding:x-gm-message-state; bh=lZofQR+UPmk/Hi9ZJQbyGeSTlK0T/soRSmCVtWAKiSE=; b=Q4MaPy2Xfqyk0WFo6UFSKAeAtgJTU4Jv2YIhrynIjjHjWCXWpS3JjwuxxYdlxJGw2w mbkleiTMZbzcY/IGQlUj03ut+4mbaxlO9YDIhmXkve/B9dTOkyMCJJ382m82RRTd/M6j WYKz9GPUob3q6HW2cqp8zdgPuvPg4Vnytq/ODd78xExr5tUTe9FPdvFbPxNIUPWzHbbC ykupB1UYS0ONzra6768rteLCBpaM6z2cvK/L+ERTAa+RNAtgUmMLNx4kO72uwmlkil4R LqW461lGZ/OJE6k2FSfTO6eQt6GE6D9M99TfJ1xNiouzkANgcn2sUmOY/qrUD1R9FQM/ 4YFw== MIME-Version: 1.0 Received: by 10.50.47.196 with SMTP id f4mr3646650ign.21.1344003584690; Fri, 03 Aug 2012 07:19:44 -0700 (PDT) Sender: simon@qxnitro.org Received: by 10.64.18.74 with HTTP; Fri, 3 Aug 2012 07:19:44 -0700 (PDT) X-Originating-IP: [2620:0:1040:201:5991:b1e1:4b0b:1df0] In-Reply-To: <20120803141538.GG1202@acme.spoerlein.net> References: <501BAFBD.3010008@FreeBSD.org> <20120803141538.GG1202@acme.spoerlein.net> Date: Fri, 3 Aug 2012 15:19:44 +0100 X-Google-Sender-Auth: Z_sHHNIl_gqhHaVUdXVom5uXbZI Message-ID: From: "Simon L. B. Nielsen" To: =?UTF-8?Q?Ulrich_Sp=C3=B6rlein?= Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Gm-Message-State: ALoCoQkodo0iNezU8E59NEW8rj05BWh+WwhY2h2VMyUBifh6rKgBe81A0Ph3S2nZxRGpxDFdFMiu Cc: doc@freebsd.org, Gabor Kovesdan , www@freebsd.org Subject: Re: RFC: doc/www cleanup X-BeenThere: freebsd-doc@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Documentation project List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 03 Aug 2012 14:19:46 -0000 On Fri, Aug 3, 2012 at 3:15 PM, Ulrich Sp=C3=B6rlein wrot= e: > On Fri, 2012-08-03 at 14:33:04 +0100, Simon L. B. Nielsen wrote: >> On Fri, Aug 3, 2012 at 12:02 PM, Gabor Kovesdan wrot= e: >> > 2, Relaxing character entity usage: To be able to read non-ASCII chara= cters >> > on ASCII-only systems, we have been using character entities, like &aa= cute;. >> > But in CJK languages, Greek and Russian every character is non-ASCII s= o >> > practically they cannot be used nor were they used. So they are only u= sed in >> > ISO-8859 encodings (except Greek, which is also from this family). In = fact, >> > displaying these Latin-based characters nowadays isn't that problemati= c any >> > more. Furthermore, if you edit text in a given language then we can su= ppose >> > that you understand the language so you know what you should see and y= ou >> > know how to configure your system if you don't see the desired result.= As a >> > result, these entities nowadays don't have any real advantage any more= but >> > they highly "pollute" the text and make it much harder to edit and rea= d. One >> >> I agree that the entities should generally not be used. I think we >> should just switch to UTF-8 and charecterset wherever possible to >> simplify it even more. >> >> And on that note, kill the useless character-set part of all our >> language directories which generate horrible paths with no additional >> value. >> >> > exception is using characters in a specific language that aren't prese= nt >> > there, e.g. a non-English developer name in the English documentation,= etc. >> >> UTF-8 would fix that. > > Last time I brought this up (trying to get rid of silly entities and > the bogus charset name of the directories), I was told that our > toolchain didn't fully grok UTF-8 yet, which was the reason we still had > this de_DE.ISO8859-1 nonsense. Ah, ok. > The move to XML should really, really convert all files to UTF-8, drop > that from the directories, and get rid of entities like ä or > é, etc.o Unfortunately I can only agree 100% ;-). --=20 Simon L. B. Nielsen