From owner-freebsd-fs@FreeBSD.ORG Sun Jul 3 01:03:09 2005 Return-Path: X-Original-To: freebsd-fs@freebsd.org Delivered-To: freebsd-fs@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A87E416A427; Sun, 3 Jul 2005 01:02:43 +0000 (GMT) (envelope-from ps@mu.org) Received: from elvis.mu.org (elvis.mu.org [192.203.228.196]) by mx1.FreeBSD.org (Postfix) with ESMTP id D07E644CCA; Sun, 3 Jul 2005 00:54:23 +0000 (GMT) (envelope-from ps@mu.org) Received: by elvis.mu.org (Postfix, from userid 1000) id 0B5A5616F6; Sat, 2 Jul 2005 17:17:36 -0700 (PDT) X-Original-To: ps@mu.org Delivered-To: ps@mu.org Received: from mx2.freebsd.org (mx2.freebsd.org [216.136.204.119]) by elvis.mu.org (Postfix) with ESMTP id 6C8365CA0B; Sun, 16 Jan 2005 18:26:25 -0800 (PST) Received: from hub.freebsd.org (hub.freebsd.org [216.136.204.18]) by mx2.freebsd.org (Postfix) with ESMTP id 371CE563D6; Mon, 17 Jan 2005 02:26:04 +0000 (GMT) (envelope-from owner-freebsd-current@freebsd.org) Received: from hub.freebsd.org (localhost [127.0.0.1]) by hub.freebsd.org (Postfix) with ESMTP id 6745C16A4D9; Mon, 17 Jan 2005 02:26:02 +0000 (GMT) Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 5DE9116A4CF for ; Mon, 17 Jan 2005 02:25:55 +0000 (GMT) Received: from wproxy.gmail.com (wproxy.gmail.com [64.233.184.201]) by mx1.FreeBSD.org (Postfix) with ESMTP id 6291843D39 for ; Mon, 17 Jan 2005 02:25:54 +0000 (GMT) (envelope-from chiahsing@gmail.com) Received: by wproxy.gmail.com with SMTP id 68so690166wri for ; Sun, 16 Jan 2005 18:25:53 -0800 (PST) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:references; b=rBsAioiUN7SUoILomxWUv/zVhcowg1+NR1vSzbQvM2iOGNj0R8w1ePojBG5unW+Xe3gyQLufSdOAvb/8yTIB7Z55JfgPw1ia18izH1de2KpupfsB/3Ggc2UO37OeSJVB3tMMyApW2hvLg1J48Lt2CL23EwPTbe6tvZQ7JWrjhj8= Received: by 10.54.52.40 with SMTP id z40mr71151wrz; Sun, 16 Jan 2005 18:25:52 -0800 (PST) Received: by 10.54.31.21 with HTTP; Sun, 16 Jan 2005 18:25:52 -0800 (PST) Message-ID: From: David Yu To: Christopher Vance In-Reply-To: <20050117020039.GB630@nu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit References: <20050117020039.GB630@nu.org> X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Sender: owner-freebsd-current@freebsd.org Errors-To: owner-freebsd-current@freebsd.org X-Spam-Checker-Version: SpamAssassin 3.0.1 (2004-10-22) on elvis.mu.org X-Spam-Status: No, score=-2.5 required=5.0 tests=BAYES_00,RCVD_BY_IP autolearn=ham version=3.0.1 X-Spam-Level: Cc: freebsd-fs@freebsd.org, freebsd-current@freebsd.org Subject: Re: NTFS unicode converting problem X-BeenThere: freebsd-fs@freebsd.org Reply-To: David Yu List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Date: Sun, 03 Jul 2005 01:03:13 -0000 X-Original-Date: Sun, 16 Jan 2005 18:25:52 -0800 X-List-Received-Date: Sun, 03 Jul 2005 01:03:13 -0000 In my case, those Chinese filenames are still in UCS-2LE, and all characters are in plane 0. I tried to modify codes so that it can directly store the conversion result into the dirent structure, but the convchr() funtion failed everytime for Chinese character while there is no problem with Ascii characters. I thought converting from UCS-2 to UTF-8 should be very easy? On Mon, 17 Jan 2005 13:00:39 +1100, Christopher Vance wrote: > On Sun, Jan 16, 2005 at 01:44:04PM -0800, David Yu wrote: > >Hi, it seems that NTFS in FreeBSD uses a 16-bit long wchar to store > >filename. When I wanted to convert some Chinese filename into UTF-8, > >the conversion was failed because a single Chinese character needs 3 > >bytes in UTF-8. Is anyone already working on this problem? If not, I > >would like to do something about it. Any suggestions? > > From memory, old Windows used UCS-2, while newer Windows uses UTF-16. > Was the bad character in plane 0 or higher? > > -- > Christopher Vance > _______________________________________________ freebsd-current@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-current To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org"