[Webkit-unassigned] [Bug 14776] New: UTF-8 encoding different, umlaut in path breaks URLs

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Thu Jul 26 23:51:36 PDT 2007


           Summary: UTF-8 encoding different, umlaut in path breaks URLs
           Product: WebKit
           Version: 522+ (nightly)
          Platform: Macintosh
        OS/Version: Mac OS X 10.4
            Status: UNCONFIRMED
          Severity: Normal
          Priority: P2
         Component: New Bugs
        AssignedTo: webkit-unassigned at lists.webkit.org
        ReportedBy: court3nay at gmail.com

I have a URL that looks like


That's a "u" with an umlaut.  This works as expected on 419.2.1, but fails on
522 (nightly)

The webserver reports this fetch on 522 (it 404s)
  GET /image_assets/8/Circuit_N%C3%BCrburgring_tiny.png HTTP/1.1 404

The webserver reports success on 419 with a different encoding -- it uses the
"u" with the combining diaresis
  GET /image_assets/8/Circuit_Nu%CC%88rburgring_tiny.png HTTP/1.1 200

%C3%BC is the utf-8 character for ├╝ as a single umlaut character.  
u%CC%88 is a u with modification, the combining diaresis (CC88 as the umlaut)

When you type Nu%CC%88 in the URL and hit enter, it converts the "u" to the
umlauted u automatically, and the file can be found by the web server.  The
request shows Nu%CC%88.

If I type it manually -- hit alt-u to get the dots, then hit "u" -- it requests
the %C3%BC version which fails on 522.

It seems like the default character/rendering has changed between versions. 
The new version is not the same as Firefox on the mac (it works)

Configure bugmail: http://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

More information about the webkit-unassigned mailing list