[Webkit-unassigned] [Bug 22339] Support Indian web sites with EOT by on-the-fly transcoding to Unicode

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Wed Feb 25 16:47:21 PST 2009


https://bugs.webkit.org/show_bug.cgi?id=22339





------- Comment #16 from jshin at chromium.org  2009-02-25 16:47 PDT -------
BTW, I'm afraid Tamil script is not such a good target script for  a
proof-of-concept implementation because it does not have some of complexities
other Indic scripts have. Devanagari script (for Hindi) would be better, IMHO.
I guess the interface defined for FontEncoder is not sufficient for scripts
more complex than Tamil. 

Well, I realize that exactly the same argument can be used to advocate for
Tamil as the target for this pilot implementation. (it's simpler and easier to
implement). And, because the code was ported from Padma which can handle other
Indic scripts, perhaps, I should not be worried. :-)

As for TSCII (comment #3 and comment #4), it's different from other Indic font
encodings in that it does not 'infringe' upon the ASCII range. In a
TSCII-encoded html file,  bytes for '<', '>', 'd', 'i', 'v' have the same
meaning as in ASCII. So, we can convert the html document in TSCII as **a
whole** to Unicode. And, Tamil being simpler than other Indic scripts, this may
be the case of other Tamil font encodings as well. 

In most other font encodings, the document-wide conversion does not work
because what 0x61, 0x62, 0x41 mean varies within a single document depending on
what font is in effect. When they're in text nodes that are enclosed by '<font
face="fontHindiFoo">' or styled with 'font-family: fontHindiFoo', they
represent some fragments of Indic syllables/grapheme clusters. So, transcoding
has to be limited to those text nodes. 

In practice, I admit that this distinction is not that important except that
having TSCII text decoder would speed up the rendering of TSCII-encoded
documents because with that, TSCII-encoded documents do not have to go through
what's being added in this bug. 

In conclusion, I'm taking back what I wrote about the need to implement TSCII
in a separate bug. 


-- 
Configure bugmail: https://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.



More information about the webkit-unassigned mailing list