[Webkit-unassigned] [Bug 16621] New: Safari ignores encoding description in invalid HTML if it's too far from the start

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Thu Dec 27 00:36:02 PST 2007


http://bugs.webkit.org/show_bug.cgi?id=16621

           Summary: Safari ignores encoding description in invalid HTML if
                    it's too far from the start
           Product: WebKit
           Version: 525+ (Nightly build)
          Platform: Macintosh
        OS/Version: Mac OS X 10.4
            Status: NEW
          Severity: Normal
          Priority: P2
         Component: Page Loading
        AssignedTo: webkit-unassigned at lists.webkit.org
        ReportedBy: ap at webkit.org


>From bug 12526 comment 3.

Our heuristic for <meta> charset declarations differs from what Firefox does,
and what is documented in HTML5. Namely, we do not check for <meta> during
normal parsing and re-start parsing if the charset changes late in the game. We
only pre-parse the first 512 bytes of the document, or the whole <head>,
whichever is larger. This is usually enough, but we know of pages that aren't
decoded correctly because of this difference.

The following two pages have a very long script (~ 10kB) at the beginning, and
charset declaration in <meta> is not honored. 

http://db66.vnet.cn/
http://www.ddm.com/event/event84.asp?code=-548

Restarting parsing at any point is a big can of worms though - e.g., some
scripts with side effects may run twice because of that.


-- 
Configure bugmail: http://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.



More information about the webkit-unassigned mailing list