Date: Mon, 08 Oct 2018 11:05:06 +0000 From: bugzilla-noreply@freebsd.org To: bugs@FreeBSD.org Subject: [Bug 225692] iswprint() wrong for some FULL WIDTH characters in UTF-8 locale Message-ID: <bug-225692-227-NmLaChSL2W@https.bugs.freebsd.org/bugzilla/> In-Reply-To: <bug-225692-227@https.bugs.freebsd.org/bugzilla/>
index | next in thread | previous in thread | raw e-mail
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=225692 Yuri Pankov <yuripv@yuripv.net> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |yuripv@yuripv.net --- Comment #14 from Yuri Pankov <yuripv@yuripv.net> --- There are several issues described in this bug report, let's take a look at U+2002 first, it is <EN_SPACE>, included in "space" character class, which localedef(1) does NOT currently identify as "print" character class. The POSIX locale definition says the following: -------------------------------------------------------------------------------- print Define characters to be classified as printable characters, including the <space>. In the POSIX locale, all characters in class graph shall be included; no characters in class cntrl shall be included. In a locale definition file, characters specified for the keywords upper, lower, alpha, digit, xdigit, punct, graph, and the <space> are automatically included in this class. No character specified for the keyword cntrl shall be specified. -------------------------------------------------------------------------------- So it doesn't explicitly say anything about "space" here, though it doesn't NOT explicitly forbid including "space" in the "print" class, and I *think* we can extend the <space> character being included to the entire "space" character class. See review D17467. -- You are receiving this mail because: You are the assignee for the bug.help
Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-225692-227-NmLaChSL2W>
