From owner-freebsd-hackers@FreeBSD.ORG Mon Apr 27 19:48:42 2009 Return-Path: Delivered-To: freebsd-hackers@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id EAE5B106566B for ; Mon, 27 Apr 2009 19:48:42 +0000 (UTC) (envelope-from das@FreeBSD.ORG) Received: from zim.MIT.EDU (ZIM.MIT.EDU [18.95.3.101]) by mx1.freebsd.org (Postfix) with ESMTP id A9DD38FC19 for ; Mon, 27 Apr 2009 19:48:42 +0000 (UTC) (envelope-from das@FreeBSD.ORG) Received: from zim.MIT.EDU (localhost [127.0.0.1]) by zim.MIT.EDU (8.14.3/8.14.2) with ESMTP id n3RJn4hO011170 for ; Mon, 27 Apr 2009 15:49:04 -0400 (EDT) (envelope-from das@FreeBSD.ORG) Received: (from das@localhost) by zim.MIT.EDU (8.14.3/8.14.2/Submit) id n3RJn4Vk011169 for freebsd-hackers@freebsd.org; Mon, 27 Apr 2009 15:49:04 -0400 (EDT) (envelope-from das@FreeBSD.ORG) Date: Mon, 27 Apr 2009 15:49:04 -0400 From: David Schultz To: freebsd-hackers@FreeBSD.ORG Message-ID: <20090427194904.GA11137@zim.MIT.EDU> Mail-Followup-To: freebsd-hackers@freebsd.org References: <20090427183836.GA10793@zim.MIT.EDU> <49F5FE45.2090101@freebsd.org> <20090427193326.GA7654@britannica.bec.de> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20090427193326.GA7654@britannica.bec.de> Cc: Subject: Re: SoC 2009: BSD-licensed libiconv in base system X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 27 Apr 2009 19:48:43 -0000 On Mon, Apr 27, 2009, Joerg Sonnenberger wrote: > On Mon, Apr 27, 2009 at 11:49:41AM -0700, Tim Kientzle wrote: > > David Schultz wrote: > >> ... whether it would make more sense to standardize on something like > >> UCS-4 for the internal representation. > > > > YES. Without this, wchar_t is useless. > > I strongly disagree. Everything can be represented as UCS-4 is a bad > assumption, but something Americans and Europeans naturally don't have > to care about. ...but isn't this moot at present because there are no widely-accepted encodings that include characters that aren't supported by UCS-4? Citrus doesn't seem to support any such encodings in any case. If this ever really becomes an issue, we could always stuff locale-dependent encodings into unused UCS-4 code pages. However, it doesn't seem worthwhile to deliberately burden programmers over concerns that are presently, and for the foreseeable future, hypothetical.