Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 05 Jul 2023 14:27:34 +0000
From:      bugzilla-noreply@freebsd.org
To:        bugs@FreeBSD.org
Subject:   [Bug 272384] The iconv converter from GB18030 to UTF-8 is broken
Message-ID:  <bug-272384-227@https.bugs.freebsd.org/bugzilla/>

next in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D272384

            Bug ID: 272384
           Summary: The iconv converter from GB18030 to UTF-8 is broken
           Product: Base System
           Version: 13.2-RELEASE
          Hardware: Any
                OS: Any
            Status: New
          Severity: Affects Some People
          Priority: ---
         Component: bin
          Assignee: bugs@FreeBSD.org
          Reporter: bruno@clisp.org

The iconv converter from GB18030 to UTF-8 is broken: It maps only 63486
characters. It should map 1112064 characters. All valid Unicode code points
(U+0000..U+D7FF, U+E000..U+10FFFF) are representable in GB18030. See
https://en.wikipedia.org/wiki/GB_18030#Mapping for details.

How to reproduce:
$ cc -Wall -o table-from table-from.c
$ ./table-from GB18030 > GB18030.TXT

Actual output: see actual-GB18030.TXT

Expected output: one of expected-GB18030-2005.TXT (for a GB18030:2005 compl=
iant
converter) or expected-GB18030-2022.TXT (for a GB18030:2022 compliant
converter).

--=20
You are receiving this mail because:
You are the assignee for the bug.=



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-272384-227>