[Webkit-unassigned] [Bug 193339] Either StringView::UpconvertedCharacters::UpconvertedCharacters() or StringImpl::createCFString() is using the wrong encoding

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Mon Jan 14 06:35:46 PST 2019


https://bugs.webkit.org/show_bug.cgi?id=193339

--- Comment #9 from Darin Adler <darin at apple.com> ---
(In reply to Myles C. Maxfield from comment #6)
> I thought it wasn't true the Latin1 is just the first 255 characters of
> Unicode, but I'm checking now.

Here’s one reason you might be confused:

When a website specifies Latin-1 as its character encoding, compatible web browsers treat the content of the website as windows-1252 instead, which is like Latin-1 but the bytes in the range 0x80-0x9F for 32 different characters, rather than for U+0080 through U+009F.

You can see this in the WhatWG encoding specification where the names for windows-1252 include strings like "l1", "latin1", and even "ascii".

That encoding is what TextCodecLatin1.h/cpp implements. TextCodecLatin1.h/cpp could be renamed to avoid confusion with actual Latin-1.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.webkit.org/pipermail/webkit-unassigned/attachments/20190114/ad8d130d/attachment.html>


More information about the webkit-unassigned mailing list