[Webkit-unassigned] [Bug 18270] New: treating x-user-defined different from windows-1255 breaks some Indian web sites
bugzilla-daemon at webkit.org
bugzilla-daemon at webkit.org
Tue Apr 1 18:07:55 PDT 2008
http://bugs.webkit.org/show_bug.cgi?id=18270
Summary: treating x-user-defined different from windows-1255
breaks some Indian web sites
Product: WebKit
Version: 525.x (Safari 3.1)
Platform: All
URL: http://loksatta.com/
OS/Version: All
Status: NEW
Severity: Normal
Priority: P2
Component: Page Loading
AssignedTo: webkit-unassigned at lists.webkit.org
ReportedBy: jungshik.shin at gmail.com
1. Download MillenniumVarun font from http://loksatta.com/MillenniumVarun.zip
2. Go to http://loksatta.com/
3. Half of characters are rendered with empty boxes or question marks.
4. Set the encoding manually to 'Western European'
5. All the characters are rendered correctly.
The page has 3 meta charset declarations. The first is 'x-user-defined' and
Safari (as well as FF) maps [0x80-0xff] to a PUA block ( U+F780-U+F7FF ?), but
MillenniumVarun font installed above does not have any glyph in that range.
It's their fault. They should have done either of the following:
- use 'Symbol' cmap (rather than Microsoft Windows 1255 cmap) in their font
- specify their page encoding as windows-1255 (or iso-8859-1) rather than
x-user-defined. webkit treats them synonymously.
Nonetheless, MS IE does not have a problem with the above page (and pages like
that). I didn't
enable webfont for IE so that it's 'thanks to' IE's handling of
'x-user-defined'.
Perhaps, webkit also has to treat x-user-defined as an alias to windows-1255.
That would save some code, too (TextCodecUserDefined.cpp).
There are still a lot of Indian web pages with this problem, I believe although
Unicode becomes more and more widespread.
--
Configure bugmail: http://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
More information about the webkit-unassigned
mailing list