[Webkit-unassigned] [Bug 73519] New: [Qt] QtWebKit does not apply correct on some pages with CJK characters

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Wed Nov 30 21:19:18 PST 2011


https://bugs.webkit.org/show_bug.cgi?id=73519

           Summary: [Qt] QtWebKit does not apply correct on some pages
                    with CJK characters
           Product: WebKit
           Version: 528+ (Nightly build)
          Platform: PC
        OS/Version: Linux
            Status: UNCONFIRMED
          Severity: Normal
          Priority: P3
         Component: WebKit Qt
        AssignedTo: webkit-unassigned at lists.webkit.org
        ReportedBy: adawit at kde.org


The following bug was reported downstream against the kwebkitpart, but was validated to be an upstream issue using QtTestBrowser:

https://bugs.kde.org/show_bug.cgi?id=287690

          Summary: KWebkitPart does not apply correct locale encoding
                        settings on some pages with CJK characters.
          Product: kwebkitpart
          Version: unspecified
         Platform: Gentoo Packages
       OS/Version: Linux
           Status: UNCONFIRMED
         Severity: normal
         Priority: NOR
        Component: general
       AssignedTo: webkit-devel at kde.org
       ReportedBy: moriramar at gmail.com


Version:           unspecified (using KDE 4.7.2)
OS:                Linux

When I open some pages with both simplified Chinese characters and traditional Chinese characters, some characters are not displayed correctly. Pages
containing both Chinese characters and Japanese characters might cause this problem as well.

Personal guess:
These pages might be encoded in zh_CN.GBK or zh_CN.GB18030 (which contains more character encodings), while KWebkitPart might apply zh_CN.GB2312 (which is
generally considered as a subset of GBK).

Reproducible: Always

Steps to Reproduce:
1. Install a font covering CJK characters. Bitstream Cyberbit, WenQuanYi Zen Hei, WenQuanYi Microhei or Droid is OK.
2. Make sure zh_CN.GBK, zh_CN.GB2312, zh_CN.GB18030, zh_CN.UTF-8 locales are available on the system.
3. Open Konqueror 4.7.2 and enable Webkit mode.
4. Go to http://www.acfun.tv/v/ac265957/ , which might be a little slow.

Actual Results:
In the top bold title line of the page content, a black box with white question mark appears. In the next line, there are two black boxes seperated by a "W" character, followed by a "o" character. Trying "View >> Encoding >> Simplified Chinese >>" any GB* locales does not solve the problem. Opening this kind of pages has a chance to crash Konqueror.

Expected Results:
No these black boxes and "W" or "o" characters in these two line. KHTML can show this page well when encoding is set to "Simplified Chinese >> GBK" or "Simplified Chinese >> GB18030", which can be referred to.

-- 
Configure bugmail: https://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the webkit-unassigned mailing list