From owner-freebsd-bugs@FreeBSD.ORG Fri Jan 21 17:10:21 2005 Return-Path: Delivered-To: freebsd-bugs@hub.freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 197C916A4D4 for ; Fri, 21 Jan 2005 17:10:21 +0000 (GMT) Received: from freefall.freebsd.org (freefall.freebsd.org [216.136.204.21]) by mx1.FreeBSD.org (Postfix) with ESMTP id F188A43D48 for ; Fri, 21 Jan 2005 17:10:20 +0000 (GMT) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (gnats@localhost [127.0.0.1]) by freefall.freebsd.org (8.13.1/8.13.1) with ESMTP id j0LHAKWH044028 for ; Fri, 21 Jan 2005 17:10:20 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.13.1/8.13.1/Submit) id j0LHAKKa044027; Fri, 21 Jan 2005 17:10:20 GMT (envelope-from gnats) Date: Fri, 21 Jan 2005 17:10:20 GMT Message-Id: <200501211710.j0LHAKKa044027@freefall.freebsd.org> To: freebsd-bugs@FreeBSD.org From: David Yu Subject: Re: kern/76520: Add new kernel-side libiconv converter for mounting NTFS under UTF-8 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: David Yu List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 21 Jan 2005 17:10:21 -0000 The following reply was made to PR kern/76520; it has been noted by GNATS. From: David Yu To: Andrey Chernov Cc: freebsd-gnats-submit@FreeBSD.ORG Subject: Re: kern/76520: Add new kernel-side libiconv converter for mounting NTFS under UTF-8 Date: Fri, 21 Jan 2005 09:07:51 -0800 The encodings this converter intended to support are just general encodings, i.e., encodings that can be directly computed from/to unicode without mapping tables. Please don't confuse them with other encodings. The current situation is that we still use translation table even with encodings that can be efficiently (in terms of space) computed. Besides, those patches from R. Imura for UTF-8 just increased the character size from 2 to 4 bytes. However, a character in UTF-8 can be as long as 6 bytes. Andrey Chernov wrote: > On Fri, Jan 21, 2005 at 01:40:45AM +0000, David Yu wrote: > >>>Description: >> >>The kernel-side libiconv currently used cannot convert unicode characters to UTF-8 which are longer than 2 bytes due to the lack of general encoding >>converter. This patch adds a new converter for encoding UCS-2 and UTF-8, but can easily extend to cover all general encodings. > > > There is no needs to adds something into kernel or we ends up in tons of > unneded encodings embedded, encoding modules must be made as klds. I > even remember I saw some code in the freebsd-i18n archive. >