[Webkit-unassigned] [Bug 66056] The XML parser doesn't ignore user's encoding choice for XML files
bugzilla-daemon at webkit.org
bugzilla-daemon at webkit.org
Sat Aug 13 12:53:02 PDT 2011
https://bugs.webkit.org/show_bug.cgi?id=66056
--- Comment #13 from Leif Halvard Silli <xn--mlform-iua at xn--mlform-iua.no> 2011-08-13 12:53:02 PST ---
(In reply to comment #8)
> That's a good point. Another thing is that an XML document may be in a subframe inside an HTML one.
Perhaps I misunderstood what you meant by this? At first I interpreted this as support of the solution I suggested - namely, to adhere to XML 1.0 encoding rules. But perhaps not?
So, to test, I produced the page you were talking about - this HTML page is WINDOWS-1252 labelled (originally it is UTF-8 encoded) with a polyglot XHTML subframe (hence it is UTF-8 encoded and it also includes the HTML charset declaration) :
http://malform.no/testing/html5/bom/frame
BROWSER RESULTS:
* IE9 and Firefox: they treat the subframe as UTF-8 - thus respecting the encoding default, over both mother page's encoding and over the user's choice as well.
* Webkit: it lets the subframe inherit the encoding from the HTML page. (At the very least it respects the UTF-8 encoding if there is BOM.)
* Opera: behaves like Webkit (except that it doesn't even respect the BOM)
* IE8 and below sniffs it as HTML, and also respect the HTML encoding declaration. (If I drop the encoding declaration, then it defaults to WINDOWS-1251.
If you ask me, this is a big failure for Webkit and Opera, in every way. Completely illogical behaviour.
--
Configure bugmail: https://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
More information about the webkit-unassigned
mailing list