From owner-freebsd-hackers@FreeBSD.ORG Tue Nov 12 20:17:52 2013 Return-Path: Delivered-To: hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id CEA521E0; Tue, 12 Nov 2013 20:17:52 +0000 (UTC) Received: from mail-in-04.arcor-online.net (mail-in-04.arcor-online.net [151.189.21.44]) (using TLSv1 with cipher ADH-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 6DDF022D4; Tue, 12 Nov 2013 20:17:52 +0000 (UTC) Received: from mail-in-20-z2.arcor-online.net (mail-in-20-z2.arcor-online.net [151.189.8.85]) by mx.arcor.de (Postfix) with ESMTP id B7E0DAA979; Tue, 12 Nov 2013 21:17:38 +0100 (CET) Received: from mail-in-07.arcor-online.net (mail-in-07.arcor-online.net [151.189.21.47]) by mail-in-20-z2.arcor-online.net (Postfix) with ESMTP id BCC246FAE27; Tue, 12 Nov 2013 21:17:38 +0100 (CET) X-Greylist: Passed host: 188.105.81.237 X-DKIM: Sendmail DKIM Filter v2.8.2 mail-in-07.arcor-online.net 5EF6A107AFB X-Greylist: Passed host: 188.105.81.237 X-Greylist: Passed host: 188.105.81.237 X-Greylist: Passed host: 188.105.81.237 X-Greylist: Passed host: 188.105.81.237 Received: from lorvorc.mips.inka.de (dslb-188-105-081-237.pools.arcor-ip.net [188.105.81.237]) by mail-in-07.arcor-online.net (Postfix) with ESMTPS id 5EF6A107AFB; Tue, 12 Nov 2013 21:17:38 +0100 (CET) Received: from lorvorc.mips.inka.de (localhost [127.0.0.1]) by lorvorc.mips.inka.de (8.14.7/8.14.7) with ESMTP id rACKHbeR053605; Tue, 12 Nov 2013 21:17:37 +0100 (CET) (envelope-from naddy@lorvorc.mips.inka.de) Received: (from naddy@localhost) by lorvorc.mips.inka.de (8.14.7/8.14.7/Submit) id rACKHb2P053604; Tue, 12 Nov 2013 21:17:37 +0100 (CET) (envelope-from naddy) Date: Tue, 12 Nov 2013 21:17:37 +0100 From: Christian Weisgerber To: "Julian H. Stacey" Subject: Re: patch for /usr/src/usr.bin/fmt/ (not 8 bit clean) for German & French Message-ID: <20131112201737.GA52200@lorvorc.mips.inka.de> References: <201311120110.rAC1A1jc066753@fire.js.berklix.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <201311120110.rAC1A1jc066753@fire.js.berklix.net> User-Agent: Mutt/1.5.22 (2013-10-16) Cc: hackers@freebsd.org, FreeBSD-gnats-submit@freebsd.org, "Bernhard Riedel \(Work\)" , Astrid Jekat X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.16 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 Nov 2013 20:17:52 -0000 Julian H. Stacey: > I don't know about ISO 8859-1 and UTF-8, (I dislike & avoid > national char set stuff as much as possible), but I want That is your problem right there. > to be able to edit files that simultaneously contain eg all > of English German & French etc, so setting some var to eg > just German would be inappropriate. 8 bit clean would be ideal, > next best would be my patches I suppose. You MUST define a character set for this. "8-bit clean" is meaningless for a tool that deals with runs of characters. Without a defined character set, you have no idea what those bytes mean. Is 0x90 a printable character? Is it a control character? Is it part of a multibyte character? And setting, for example, LC_CTYPE=de_DE.ISO8859-1 does in no way limit you to German. For LC_CTYPE purposes, the language/country part of the locale specification isn't used. This is definitely a PEBKAC. -- Christian "naddy" Weisgerber naddy@mips.inka.de