From owner-freebsd-current@FreeBSD.ORG Tue Sep 27 00:16:52 2005 Return-Path: X-Original-To: freebsd-current@FreeBSD.ORG Delivered-To: freebsd-current@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id C3C1B16A41F for ; Tue, 27 Sep 2005 00:16:52 +0000 (GMT) (envelope-from emaste@phaedrus.sandvine.ca) Received: from mailserver.sandvine.com (sandvine.com [199.243.201.138]) by mx1.FreeBSD.org (Postfix) with ESMTP id 1E44643D49 for ; Tue, 27 Sep 2005 00:16:51 +0000 (GMT) (envelope-from emaste@phaedrus.sandvine.ca) Received: from labgw2.phaedrus.sandvine.com ([192.168.3.11]) by mailserver.sandvine.com with Microsoft SMTPSVC(5.0.2195.6713); Mon, 26 Sep 2005 20:16:47 -0400 Received: by labgw2.phaedrus.sandvine.com (Postfix, from userid 12627) id 5937513650; Mon, 26 Sep 2005 20:16:50 -0400 (EDT) Date: Mon, 26 Sep 2005 20:16:50 -0400 From: Ed Maste To: Garrett Wollman Message-ID: <20050927001650.GA9994@sandvine.com> References: <20050926195807.GD95971@sandvine.com> <17208.30606.117170.36398@khavrinen.csail.mit.edu> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <17208.30606.117170.36398@khavrinen.csail.mit.edu> User-Agent: Mutt/1.4.2.1i X-OriginalArrivalTime: 27 Sep 2005 00:16:47.0710 (UTC) FILETIME=[BFE597E0:01C5C2F8] Cc: freebsd-current@FreeBSD.ORG Subject: Re: Bsdtar and archive torture tests X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 27 Sep 2005 00:16:52 -0000 On Mon, Sep 26, 2005 at 06:34:54PM -0400, Garrett Wollman wrote: > What is your locale set to? > > POSIX pax interchange format, the default for bsdtar, requires file > names in archives to be represented in UTF-8. , when > interpreted as UTF-8, is the character U+00E0 (LATIN SMALL LETTER A > WITH GRAVE ACCENT). Hmm, good point. I haven't set it to anything; locale(1) shows that the LC_ variables are set to "C". So then I can see how this happens, but it's still surprising (to me) behaviour. I guess that, with my locale set to "C", does not represent valid characters, and it's not possible to convert to UTF-8. I wonder if bsdtar should emit a warning or similar in this case? -- Ed Maste, Sandvine Incorporated