[Webkit-unassigned] [Bug 8769] New: TextEncoding::fromUnicode() - support non-BMP characters and convert to NFC

bugzilla-daemon at opendarwin.org bugzilla-daemon at opendarwin.org
Sun May 7 04:05:29 PDT 2006


           Summary: TextEncoding::fromUnicode() - support non-BMP characters
                    and convert to NFC
           Product: WebKit
           Version: 420+ (nightly)
          Platform: Macintosh
               URL: http://www.w3.org/TR/charmod-norm
        OS/Version: Mac OS X 10.4
            Status: NEW
          Severity: enhancement
          Priority: P2
         Component: WebCore Misc.
        AssignedTo: webkit-unassigned at opendarwin.org
        ReportedBy: ap at nypop.com

Since the CFString branch seems to stay for a while, here are some improvements
to its fromUnicode().

- properly combine surrogate pairs coming from CFString;
- normalize the string to NFC (canonically composed form).

I don't think any browser actually does the latter, but this is obviously more
important on Mac OS X than on Windows. If Mail.app uses this code, this may
correct problems with sending files having accented characters in their names
to Windows recipients.

form-data-encoding.html has been modified to cover more Latin-1 cases, and
another case has been added for UTF-8.

Configure bugmail: http://bugzilla.opendarwin.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

More information about the webkit-unassigned mailing list