Date: Wed, 05 Jul 2023 14:27:34 +0000 From: bugzilla-noreply@freebsd.org To: bugs@FreeBSD.org Subject: [Bug 272384] The iconv converter from GB18030 to UTF-8 is broken Message-ID: <bug-272384-227@https.bugs.freebsd.org/bugzilla/>
next in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D272384 Bug ID: 272384 Summary: The iconv converter from GB18030 to UTF-8 is broken Product: Base System Version: 13.2-RELEASE Hardware: Any OS: Any Status: New Severity: Affects Some People Priority: --- Component: bin Assignee: bugs@FreeBSD.org Reporter: bruno@clisp.org The iconv converter from GB18030 to UTF-8 is broken: It maps only 63486 characters. It should map 1112064 characters. All valid Unicode code points (U+0000..U+D7FF, U+E000..U+10FFFF) are representable in GB18030. See https://en.wikipedia.org/wiki/GB_18030#Mapping for details. How to reproduce: $ cc -Wall -o table-from table-from.c $ ./table-from GB18030 > GB18030.TXT Actual output: see actual-GB18030.TXT Expected output: one of expected-GB18030-2005.TXT (for a GB18030:2005 compl= iant converter) or expected-GB18030-2022.TXT (for a GB18030:2022 compliant converter). --=20 You are receiving this mail because: You are the assignee for the bug.=
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-272384-227>