[Webkit-unassigned] [Bug 166322] [GTK] Consider probabilistically guessing text encoding if unspecified by document

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Wed Dec 21 10:13:26 PST 2016


https://bugs.webkit.org/show_bug.cgi?id=166322

Michael Catanzaro <mcatanzaro at igalia.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |gns at gnome.org

--- Comment #4 from Michael Catanzaro <mcatanzaro at igalia.com> ---
Also Alexey, could you tell me what encoding Safari uses if it's unspecified? I presume it's either ISO 8859-1 or UTF-8? In WebKitGTK+ we use ISO 8859-1, and think we can't change it because using UTF-8 breaks some websites (e.g. some Brazillian sites, I think Gustavo can provide an example link).

(In reply to comment #3)
> In Firefox, the user has to manually choose which language to detect
> encoding for, and also, there are heuristics based on browsing history.

Hmmm. [1] is displayed properly by Firefox at least for me, but not in Epiphany. (Scroll down a bit to see broken names.) There is no initial FEFF byte to indicate the right Unicode encoding, so either Firefox is able to detect the right encoding probabilistically, or it must assume UTF-8 by default.

[1] http://ftp-nyc.osuosl.org/pub/gnome/core/3.23/3.23.3/NEWS

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.webkit.org/pipermail/webkit-unassigned/attachments/20161221/1e438b51/attachment.html>


More information about the webkit-unassigned mailing list