[Webkit-unassigned] [Bug 258251] New: REGRESSION(264918 at main): GB18030 encoding isn't hooked up correctly

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Sat Jun 17 16:04:59 PDT 2023


https://bugs.webkit.org/show_bug.cgi?id=258251

            Bug ID: 258251
           Summary: REGRESSION(264918 at main): GB18030 encoding isn't hooked
                    up correctly
           Product: WebKit
           Version: WebKit Nightly Build
          Hardware: Unspecified
                OS: Unspecified
            Status: NEW
          Severity: Normal
          Priority: P2
         Component: Text
          Assignee: webkit-unassigned at lists.webkit.org
          Reporter: mmaxfield at apple.com
                CC: mmaxfield at apple.com

We're encoding U+E78D to 0x83 0x36 0xCB 0x32 which seems totally wrong. That is neither 0xA6 0xD9 nor 0x84 0x31 0x82 0x36 (which are the sequences on L2/23-003R[1]). If we round-trip our byte sequence back to a code point, it decodes to U+E82E which is just a totally different PUA character.

[1] https://www.unicode.org/L2/L2023/23003r-gb18030-recommendations.pdf

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.webkit.org/pipermail/webkit-unassigned/attachments/20230617/9eb372b8/attachment.htm>


More information about the webkit-unassigned mailing list