[Webkit-unassigned] [Bug 7346] Parsing DOCTYPE with missing end quote includes part of the HTML document in the systemId

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Wed Mar 7 03:12:58 PST 2007


ddkilzer at webkit.org changed:

           What    |Removed                     |Added
            Summary|Web Archive fails to save   |Parsing DOCTYPE with missing
                   |frameset page properly      |end quote includes part of
                   |                            |the HTML document in the
                   |                            |systemId

------- Comment #7 from ddkilzer at webkit.org  2007-03-07 03:12 PDT -------
The initial problem is that the DOCTYPE tag is missing an ending quote on its
systemId.  (It also is missing a space between the publicId and the systemId,
but that seems to be handled properly.)


The result is that instead of the systemId being this URL:


It's now everything from that URL to the first double quote in the HTML

<!-- Generated by javadoc on Tue Nov 08 14:06:10 EET 2005-->

Firefox manages to parse the systemId correctly (as seen when loading the
example URL and then saving it as "Web Page, Complete").  WebKit should be able
to recognize the end of the DOCTYPE tag by the ">" character and stop consuming
the rest of the HTML document until it finds an ending double quote.

I have confirmed that adding the missing double quote to the DOCTYPE tag causes
the webarchive file to be saved properly.

Configure bugmail: http://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

More information about the webkit-unassigned mailing list