[Webkit-unassigned] [Bug 253358] New: Optimize HTML Parser to remove 'semi-colon' in entity array

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Fri Mar 3 16:42:44 PST 2023


            Bug ID: 253358
           Summary: Optimize HTML Parser to remove 'semi-colon' in entity
           Product: WebKit
           Version: Safari Technology Preview
          Hardware: Unspecified
                OS: Unspecified
            Status: NEW
          Severity: Normal
          Priority: P2
         Component: DOM
          Assignee: webkit-unassigned at lists.webkit.org
          Reporter: ahmad.saleem792 at gmail.com

Hi Team,

While merging blink commit in bug 250640 as PR below:


Darin's suggested that we should look for future optimization as Blink's comment mentioned to remove 'semi-colon' storage in array:

 # Reuse substrings from earlier entries. This saves 1-2000
    # characters, but it's O(n^2) and not very smart. The optimal
    # solution has to solve the "Shortest Common Superstring" problem
    # and that is NP-Complete or worse.
    # This would be even more efficient if we didn't store the
    # semi-colon in the array but as a bit in the entry.


I am just creating this to explore it in future to fix this bug as well.

If someone can guide, I am happy to look into it and if someone else can grab and do it quickly. I am happy as well.


Darin's comment for safe-keeping:

'''This is a very good point. It would be useful to continue with this optimization and remove the semicolons from the array. I am almost certain this could be done with no performance impact.'''


Just wanted to raise this bug.


You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.webkit.org/pipermail/webkit-unassigned/attachments/20230304/ccd09b49/attachment.htm>

More information about the webkit-unassigned mailing list