From owner-freebsd-arch Fri Sep 1 8:22:10 2000 Delivered-To: freebsd-arch@freebsd.org Received: from relay.butya.kz (butya-gw.butya.kz [212.154.129.94]) by hub.freebsd.org (Postfix) with ESMTP id E7FBD37B422; Fri, 1 Sep 2000 08:22:03 -0700 (PDT) Received: by relay.butya.kz (Postfix, from userid 1000) id 2CF9928775; Fri, 1 Sep 2000 22:22:01 +0700 (ALMST) Received: from localhost (localhost [127.0.0.1]) by relay.butya.kz (Postfix) with ESMTP id 2313F28677; Fri, 1 Sep 2000 22:22:01 +0700 (ALMST) Date: Fri, 1 Sep 2000 22:22:01 +0700 (ALMST) From: Boris Popov To: "Andrey A. Chernov" Cc: freebsd-arch@FreeBSD.ORG, freebsd-i18n@FreeBSD.ORG, Konstantin Chuguev Subject: Re: Proposal to include iconv library in the base system. In-Reply-To: <20000901185945.A29804@nagual.pp.ru> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-arch@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG On Fri, 1 Sep 2000, Andrey A. Chernov wrote: > On Thu, Aug 24, 2000 at 05:39:39PM +0700, Boris Popov wrote: > > FreeBSD already contains a few character conversion schemes for > > msdosfs, nwfs, cd9660fs and syscon mapping tables. However, the usage > > We need XLAT converters for them, not Unicode one, as I understand Unicode > data loaded into kernel will be too big. Correct, for 8 bit charsets XLAT table dynamically built in the user space and then uploaded to the kernel space. This can't be done for multibyte charsets, though. > > The questionable part is a which set of character sets should be > > included in the base system and which should be supplied as packages. > > We need to include all charsets we have locale support in the base system. Noted. > > Secondly, where should the functions be placed? Initially, the iconv > > /usr/libdata/iconv Ok. > What I am not understand at this moment: how iconv handles non-convertable > characters? I don't see any way to set fill character in described > interface. I'll cite iconv(3) man page: If a sequence of input bytes does not form a valid charac- ter in the specified charset, conversion stops after the previous successfully converted character. If the input buffer ends with an incomplete character or shift sequence, conversion stops after the previous successfully converted bytes. -- Boris Popov http://www.butya.kz/~bp/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-arch" in the body of the message