[Webkit-unassigned] [Bug 166322] New: Probabilistically guess text encoding if unspecified by document

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Wed Dec 21 08:07:00 PST 2016


https://bugs.webkit.org/show_bug.cgi?id=166322

            Bug ID: 166322
           Summary: Probabilistically guess text encoding if unspecified
                    by document
    Classification: Unclassified
           Product: WebKit
           Version: WebKit Nightly Build
          Hardware: PC
                OS: Linux
            Status: NEW
          Severity: Normal
          Priority: P2
         Component: WebCore Misc.
          Assignee: webkit-unassigned at lists.webkit.org
          Reporter: mcatanzaro at igalia.com

If an HTML document doesn't specify its text encoding nor do the HTTP headers, we should try to detect encoding probabilistically like Firefox does, instead of just getting it wrong. Unlike Firefox, we probably don't want to have to maintain our own encoding detectors. We already depend on ICU, so let's use ICU's character set detection API. [1]

[1] http://userguide.icu-project.org/conversion/detection

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.webkit.org/pipermail/webkit-unassigned/attachments/20161221/1c857d38/attachment.html>


More information about the webkit-unassigned mailing list