[webkit-qt] Webkit as javascript+css aware html parser

Balazs Kelemen kb at inf.u-szeged.hu
Thu Mar 25 12:08:04 PDT 2010


>
> Let me clarify this- a lot of web pages generate their content (that
> the user see) via javascript and / or CSS. So if we simply get the
> page source and try to construct the DOM of HTML Tags, we won't get
> any useful information. So there is a need of a parser that will not
> only construct DOM from HTML tags but will also run javascripts and
> use style information to modify the DOM.
>
> In nut shell, something that a browser does but without the actual
> rendering. Webkit has all the required components, I just need to trim
> it down to bare minimum HTML parsing, Javascript + CSS execution =>
> Final DOM.
>
> I would like to run this trim down version of webkit with a crawler,
> so there arise another requirement- it should be able to run without
> XServer.
>
> -Tarandeep
>
>

What is the final purpose of this staff?

balazs


More information about the webkit-qt mailing list