<html>
<head>
<base href="https://bugs.webkit.org/" />
</head>
<body><span class="vcard"><a class="email" href="mailto:mcatanzaro@igalia.com" title="Michael Catanzaro <mcatanzaro@igalia.com>"> <span class="fn">Michael Catanzaro</span></a>
</span> changed
<a class="bz_bug_link
bz_status_UNCONFIRMED "
title="UNCONFIRMED - [GTK] Consider probabilistically guessing text encoding if unspecified by document"
href="https://bugs.webkit.org/show_bug.cgi?id=166322">bug 166322</a>
<br>
<table border="1" cellspacing="0" cellpadding="8">
<tr>
<th>What</th>
<th>Removed</th>
<th>Added</th>
</tr>
<tr>
<td style="text-align:right;">CC</td>
<td>
</td>
<td>gns@gnome.org
</td>
</tr></table>
<p>
<div>
<b><a class="bz_bug_link
bz_status_UNCONFIRMED "
title="UNCONFIRMED - [GTK] Consider probabilistically guessing text encoding if unspecified by document"
href="https://bugs.webkit.org/show_bug.cgi?id=166322#c4">Comment # 4</a>
on <a class="bz_bug_link
bz_status_UNCONFIRMED "
title="UNCONFIRMED - [GTK] Consider probabilistically guessing text encoding if unspecified by document"
href="https://bugs.webkit.org/show_bug.cgi?id=166322">bug 166322</a>
from <span class="vcard"><a class="email" href="mailto:mcatanzaro@igalia.com" title="Michael Catanzaro <mcatanzaro@igalia.com>"> <span class="fn">Michael Catanzaro</span></a>
</span></b>
<pre>Also Alexey, could you tell me what encoding Safari uses if it's unspecified? I presume it's either ISO 8859-1 or UTF-8? In WebKitGTK+ we use ISO 8859-1, and think we can't change it because using UTF-8 breaks some websites (e.g. some Brazillian sites, I think Gustavo can provide an example link).
(In reply to <a href="show_bug.cgi?id=166322#c3">comment #3</a>)
<span class="quote">> In Firefox, the user has to manually choose which language to detect
> encoding for, and also, there are heuristics based on browsing history.</span >
Hmmm. [1] is displayed properly by Firefox at least for me, but not in Epiphany. (Scroll down a bit to see broken names.) There is no initial FEFF byte to indicate the right Unicode encoding, so either Firefox is able to detect the right encoding probabilistically, or it must assume UTF-8 by default.
[1] <a href="http://ftp-nyc.osuosl.org/pub/gnome/core/3.23/3.23.3/NEWS">http://ftp-nyc.osuosl.org/pub/gnome/core/3.23/3.23.3/NEWS</a></pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>