[Webkit-unassigned] [Bug 178174] New: Add support in named capture group identifiers for direct surrogate pairs

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Wed Oct 11 11:24:28 PDT 2017


https://bugs.webkit.org/show_bug.cgi?id=178174

            Bug ID: 178174
           Summary: Add support in named capture group identifiers for
                    direct surrogate pairs
           Product: WebKit
           Version: WebKit Nightly Build
          Hardware: Unspecified
                OS: Unspecified
            Status: NEW
          Severity: Normal
          Priority: P2
         Component: JavaScriptCore
          Assignee: webkit-unassigned at lists.webkit.org
          Reporter: msaboff at apple.com

Currently, a named capture group can have a backslash escaped RegExpUnicodeEscapeSequence (e.g. \{12345}).  It also looks like it should take approriate unicode characters directly, including those made up of surrogate pairs.

Failing Test262 -
JSTests/test262/test/built-ins/RegExp/named-groups/unicode-property-names.js
Exception: SyntaxError: Invalid regular expression: invalid group specifier name
at JSTests/test262/test/built-ins/RegExp/named-groups/unicode-property-names.js:16

This test is a UTF-8 source file with embedded non-BMP Unicode characters as both named group identifiers and property names.  I made a JS source file with a couple of the Unicode properties from the original test file, and the jsc command throws 
    SyntaxError: Invalid character '\ud801'
See attached test source file.

I think we need the jsc command to handle UTF-8 source files before we can fix this bug.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.webkit.org/pipermail/webkit-unassigned/attachments/20171011/5e272f64/attachment.html>


More information about the webkit-unassigned mailing list