[Webkit-unassigned] [Bug 55441] EUC-JP implementation doesn't fully match CP51932

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Tue Mar 1 18:36:54 PST 2011


https://bugs.webkit.org/show_bug.cgi?id=55441





--- Comment #2 from NARUSE, Yui <naruse at airemix.jp>  2011-03-01 18:36:54 PST ---
(In reply to comment #1)
> Are any Web sites known to be affected by this? It would be good to have some URLs for real life testing.

For example,
http://d.hatena.ne.jp/eggmoon/20061004/p1
http://blog.livedoor.jp/blog_ch/archives/50992738.html
http://d.hatena.ne.jp/nsjisc/20100605/1275745170

People on business know NEC special characters and NEC selected IBM extended characters
are Vender depended, and don't use. But casual users don't know it and post such characters to blog
or other CGM applications.

The content of this missing characters on WebKit are following.
You can imagine casual users use circled characters and Roman numbers
http://legacy-encoding.sourceforge.jp/wiki/index.php?NEC%C6%C3%BC%EC%CA%B8%BB%FA%28cp51932%29
http://legacy-encoding.sourceforge.jp/wiki/index.php?NEC%C1%AA%C4%EAIBM%B3%C8%C4%A5%CA%B8%BB%FA%28cp51932%29

> > Current Webkit seems to use ICU's ibm-33722_P12A_P12A-2004_U2.
> > It consists 
> > * US-ASCII
> > * JIS X 0201 Katakana
> > * JIS X 0208
> > * IBM extended characters (IBM's mapping)
> > http://demo.icu-project.org/icu-bin/convexp?conv=ibm-33722_P12A_P12A-2004_U2&s=ALL
> 
> The same ICU converter explorer page says that windows-51932 is an alias name for this encoding. Is it a mistake in ICU that windows-51932 is different from what it should be?

Encoding aliasing depends the converter's policy; especially ICU includes historical reasons from AIX or other IBM products.
What I can say is the mapping is different from original Microsoft Codepage 51932, and is not suitable for Web.
Because its decoder can't see some characters and its encoder sends strange characters which aren't available on other than WebKit.

> Has an ICU bug been filed about that?

I added http://bugs.icu-project.org/trac/ticket/8390

-- 
Configure bugmail: https://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.



More information about the webkit-unassigned mailing list