[webkit-reviews] review requested: [Bug 8769]
TextEncoding::fromUnicode() - support non-BMP characters and
convert to NFC : [Attachment 8164] patch for ICU branch
bugzilla-request-daemon at opendarwin.org
bugzilla-request-daemon at opendarwin.org
Mon May 8 01:51:06 PDT 2006
Alexey Proskuryakov <ap at nypop.com> has asked for review:
Bug 8769: TextEncoding::fromUnicode() - support non-BMP characters and convert
to NFC
http://bugzilla.opendarwin.org/show_bug.cgi?id=8769
Attachment 8164: patch for ICU branch
http://bugzilla.opendarwin.org/attachment.cgi?id=8164&action=edit
------- Additional Comments from Alexey Proskuryakov <ap at nypop.com>
The ICU branch already had non-BMP entities properly supported, but the
normalization behavior was even worse - it didn't try to convert to legacy
encodings such as Latin-1 nearly as hard as CFString tries. E.g., decomposed "c
with cedilla" was encoded as c followed with an entity for cedilla, even though
Latin-1 has a code for this (composed) character.
More information about the webkit-reviews
mailing list