<html>
<head>
<base href="https://bugs.webkit.org/" />
</head>
<body>
<p>
<div>
<b><a class="bz_bug_link
bz_status_NEW "
title="NEW - [encoding] Support for GB18030"
href="https://bugs.webkit.org/show_bug.cgi?id=159891#c8">Comment # 8</a>
on <a class="bz_bug_link
bz_status_NEW "
title="NEW - [encoding] Support for GB18030"
href="https://bugs.webkit.org/show_bug.cgi?id=159891">bug 159891</a>
from <span class="vcard"><a class="email" href="mailto:ishida@w3.org" title="r12a <ishida@w3.org>"> <span class="fn">r12a</span></a>
</span></b>
<pre>For me, interoperability means that users get identical results when they perform identical steps regardless of the platform or browser they are using.
This particular issue is about what happens when a Unicode code point is converted to a GB18030 sequence of bytes. For example, U+FA5C 臭 is converted by Firefox, Chrome and Edge to the GB18030 byte sequence %84%30%A1%38. Safari, however, converts that Unicode character to the byte sequence %B3%F4 (which is the GB18030 character that corresponds to U+81ED 臭.
In other words, suppose i'm writing the following:
"There are two similar han ideographs which represent the sound xiù, 臭 and 臭. In Unicode these are compatilibity equivalents."
If that document gets converted to the GB18030 encoding by any of the other major browser engines it will look the same, since GB18030 has code points for both forms. If however the conversion happens in Webkit the sentence will be changed from the original and become rather confusing for the reader, since both ideographs shown are now the same. The text no longer says what the writer intended.
"There are two similar han ideographs which represent the sound xiù, 臭 and 臭. In Unicode these are compatilibity equivalents."</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>