Date: Mon, 20 Sep 2010 20:31:33 +0100 From: Pete French <pete@twisted.org.uk> To: freebsd-bugs@FreeBSD.org, jh@FreeBSD.org Subject: Re: bin/150727: diff on UTF-8 text files thinks they are binary - regression from 7.X Message-ID: <E1Oxm5N-000D1p-C3@toybox.twisted.org.uk> In-Reply-To: <201009201332.o8KDWmlo074276@freefall.freebsd.org>
next in thread | previous in thread | raw e-mail | index | archive | help
> I couldn't reproduce this with simple UTF-8 files: I just looked through my example files in detail, and it turns out the problem is not with UTF-8 after all, but with NULL characters which are also in the file. This is what trips up 'diff' - and though it it a charge from 7.X I am not sure that it is really a bug. Sorry for the noise - the code I used to verify that the file was a valid UTF-8 file accepts the zero bytes quite happily and says that it is a text file.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?E1Oxm5N-000D1p-C3>