[webkit-reviews] review requested: [Bug 8769] TextEncoding::fromUnicode() - support non-BMP characters and convert to NFC : [Attachment 8164] patch for ICU branch

bugzilla-request-daemon at opendarwin.org bugzilla-request-daemon at opendarwin.org
Mon May 8 01:51:06 PDT 2006


Alexey Proskuryakov <ap at nypop.com> has asked  for review:
Bug 8769: TextEncoding::fromUnicode() - support non-BMP characters and convert
to NFC
http://bugzilla.opendarwin.org/show_bug.cgi?id=8769

Attachment 8164: patch for ICU branch
http://bugzilla.opendarwin.org/attachment.cgi?id=8164&action=edit

------- Additional Comments from Alexey Proskuryakov <ap at nypop.com>
The ICU branch already had non-BMP entities properly supported, but the
normalization behavior was even worse - it didn't try to convert to legacy
encodings such as Latin-1 nearly as hard as CFString tries. E.g., decomposed "c
with cedilla" was encoded as c followed with an entity for cedilla, even though
Latin-1 has a code for this (composed) character.



More information about the webkit-reviews mailing list