[Webkit-unassigned] [Bug 24906] 0x5C of EUC-JP is not Yen Sign but U+005C

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Sun Mar 29 09:26:49 PDT 2009


https://bugs.webkit.org/show_bug.cgi?id=24906





------- Comment #10 from naruse at airemix.jp  2009-03-29 09:26 PDT -------
(In reply to comment #9)
> I see, interesting. So, this change will fix some pages, but break others
> (those that use e.g. MS Gothic), correct?

The glyph of 0x5c of EUC-JP can be both backslash and yen sign.
So they are still correct, even if the glyph may differ on various
environments.
# this is specified CP51932 and original eucJP definition.

But 0x5C must be logically backslash: U+005C.
Current implementation breaks this and this affect copy from the page and paste
to something.

> It is a tough choice to make - perhaps we'll need to apply font-specific
> transcoding to text using those fonts.

Web Browser is not only for seeing, but for machine.
Hacks like this --- replace characters logically --- affect to machine.
For example, copy texts from the page, scripting on the page, and so on.

We must consider both rendered glyphs and logically displayed characters.


For your information in English document, CP932 (base encoding of CP51932)
definition.
http://msdn.microsoft.com/ja-jp/goglobal/cc305152(en-us).aspx
5C = U+005C : REVERSE SOLIDUS (YEN SIGN)
This shows 0x5C of CP932 (Shift-JIS variant) is logically U+005C : REVERSE
SOLIDUS,
but uses YEN SIGN glyph.


-- 
Configure bugmail: https://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.



More information about the webkit-unassigned mailing list