[webkit-changes] [WebKit/WebKit] 4659e8: Adopt GB18030-2022 <-> Unicode mapping recommendation
Vitor Roriz
noreply at github.com
Mon Sep 16 13:46:20 PDT 2024
Branch: refs/heads/main
Home: https://github.com/WebKit/WebKit
Commit: 4659e88a8afd048b77069f7aca297ef4b50b1200
https://github.com/WebKit/WebKit/commit/4659e88a8afd048b77069f7aca297ef4b50b1200
Author: Vitor Roriz <vitor.roriz at apple.com>
Date: 2024-09-16 (Mon, 16 Sep 2024)
Changed paths:
M LayoutTests/imported/w3c/web-platform-tests/encoding/legacy-mb-schinese/gb18030/gb18030-decoder.any.js
M LayoutTests/imported/w3c/web-platform-tests/encoding/legacy-mb-schinese/gb18030/gb18030-encoder.html
M Source/WebCore/PAL/pal/text/TextCodecCJK.cpp
Log Message:
-----------
Adopt GB18030-2022 <-> Unicode mapping recommendation
https://bugs.webkit.org/show_bug.cgi?id=279761
rdar://124367525
Reviewed by Alex Christensen.
As per [1] the Unicode Technical Committee recommends an updated mapping between GB18030 and Unicode
which we follow here.
Related tests need to be updated and exported to WPT. The PDF [1] contains a text
file attachment with the recommended mapping, which is:
Transcoding From Unicode Standard to GB 18030:
U+E78D -> 0xA6D9
U+E78E -> 0xA6DA
U+E78F -> 0xA6DB
U+E790 -> 0xA6DC
U+E791 -> 0xA6DD
U+E792 -> 0xA6DE
U+E793 -> 0xA6DF
U+E794 -> 0xA6EC
U+E795 -> 0xA6ED
U+E796 -> 0xA6F3
U+E81E -> 0xFE59
U+E826 -> 0xFE61
U+E82B -> 0xFE66
U+E82C -> 0xFE67
U+E832 -> 0xFE6D
U+E843 -> 0xFE7E
U+E854 -> 0xFE90
U+E864 -> 0xFEA0
Transcoding From GB 18030 to Unicode Standard:
0x82359037 -> U+9FB4
0x82359038 -> U+9FB5
0x82359039 -> U+9FB6
0x82359130 -> U+9FB7
0x82359131 -> U+9FB8
0x82359132 -> U+9FB9
0x82359133 -> U+9FBA
0x82359134 -> U+9FBB
0x84318236 -> U+FE10
0x84318237 -> U+FE11
0x84318238 -> U+FE12
0x84318239 -> U+FE13
0x84318330 -> U+FE14
0x84318331 -> U+FE15
0x84318332 -> U+FE16
0x84318333 -> U+FE17
0x84318334 -> U+FE18
0x84318335 -> U+FE19
[1] https://www.unicode.org/L2/L2023/23003r-gb18030-recommendations.pdf brings new mapping recommendations between GB18030 and Unicode.
* LayoutTests/imported/w3c/web-platform-tests/encoding/legacy-mb-schinese/gb18030/gb18030-decoder.any.js:
* LayoutTests/imported/w3c/web-platform-tests/encoding/legacy-mb-schinese/gb18030/gb18030-encoder.html:
* Source/WebCore/PAL/pal/text/TextCodecCJK.cpp:
(PAL::gb180302022Encode):
(PAL::gb180302022Decode):
Canonical link: https://commits.webkit.org/283724@main
To unsubscribe from these emails, change your notification settings at https://github.com/WebKit/WebKit/settings/notifications
More information about the webkit-changes
mailing list