[Webkit-unassigned] [Bug 235307] TextCodec should treat lone surrogates as the replacement character

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Wed Nov 30 05:23:11 PST 2022


https://bugs.webkit.org/show_bug.cgi?id=235307

--- Comment #3 from Anne van Kesteren <annevk at annevk.nl> ---
That test doesn't seem to test windows-1252 (due to JSFiddle forcing UTF-8), but when actually testing windows-1252 all browsers seem to agree as well: https://github.com/web-platform-tests/wpt/pull/37250.

However,

1. Comment 0 also describes a problem on Windows that might still exist.
2. Code inspection shows that https://github.com/WebKit/WebKit/blob/5e81d33ff5c0150dbabbebbe2e96fb08ff4d6ad3/Source/WebCore/PAL/pal/text/TextCodecUTF8.cpp#L461-L472 does not do surrogate handling.

(Also, if as comment 0 suggests this is somehow intentional, which I suspect it's not, it shouldn't be called UTF-8.)

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.webkit.org/pipermail/webkit-unassigned/attachments/20221130/37be47b7/attachment.htm>


More information about the webkit-unassigned mailing list