From owner-freebsd-bugs@FreeBSD.ORG Wed Jul 2 04:50:22 2014 Return-Path: Delivered-To: freebsd-bugs@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 1E100F2A for ; Wed, 2 Jul 2014 04:50:22 +0000 (UTC) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2001:1900:2254:206a::16:76]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 05ACD2521 for ; Wed, 2 Jul 2014 04:50:22 +0000 (UTC) Received: from bugs.freebsd.org ([127.0.1.118]) by kenobi.freebsd.org (8.14.8/8.14.8) with ESMTP id s624oLSq000351 for ; Wed, 2 Jul 2014 05:50:21 +0100 (BST) (envelope-from bugzilla-noreply@freebsd.org) From: bugzilla-noreply@freebsd.org To: freebsd-bugs@FreeBSD.org Subject: [Bug 191540] the FAT32 implementation bugs out on Unicode file names Date: Wed, 02 Jul 2014 04:50:22 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: kern X-Bugzilla-Version: 11.0-CURRENT X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Only Me X-Bugzilla-Who: jamie@dyslexicfish.net X-Bugzilla-Status: Needs Triage X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: freebsd-bugs@FreeBSD.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: cc Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 02 Jul 2014 04:50:22 -0000 https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=191540 Jamie Landeg-Jones changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |jamie@dyslexicfish.net --- Comment #3 from Jamie Landeg-Jones --- With unix filesystems, slap out any 'binary' characters in a filename and they will be recorded 'as is'. So, when you do an 'ls', the filename data is preserved so that if it was originally a UTF-8 encoded name originally, it will still be a UTF-8 name, and displayed correctly on a UTF-8 terminal. msdos file systems don't work this way, and translate the filenames before storing. If the msdos filesystem doesn't know what the original character set is, it won't be coded correctly, and subsequently won't be displayed correctly. You therefore need to tell it on mount what character set you are using, with the -L option: -L locale Specify locale name used for file name conversions for DOS and Win'95 names. By default ISO 8859-1 assumed as local character set. Your test filenames are in UTF-8 fornat, so if you repeat your exercise, but instead mount the partition with: mount_msdosfs -L en_GB.UTF-8 /dev/md7 fat , then everything will work as expected. .... You are attempting to -- You are receiving this mail because: You are the assignee for the bug.