[Webkit-unassigned] [Bug 108347] CSS3 'word-break: break-all' spoils CJK line-break restrictions

bugzilla-daemon at webkit.org bugzilla-daemon at webkit.org
Thu Mar 14 11:14:09 PDT 2013


https://bugs.webkit.org/show_bug.cgi?id=108347





--- Comment #22 from murakami at antenna.co.jp  2013-03-14 11:16:35 PST ---
For 'word-break: normal', the current WebKit behavior, "a(b)c" unbreakable, 
is good and should not be changed.

You should read the Unicode Standard Annex #14, Unicode Line Breaking Algorithm:
http://www.unicode.org/reports/tr14/

    LB30  Do not break between letters, numbers, or ordinary symbols 
    and opening or closing parentheses.

        (AL | HL | NU) × OP
        CP × (AL | HL | NU)

Here, AL=letters and ordinary symbols (non-CJK), HL=Hebrew letters, 
NU=numbers, OP=Open Punctuation, CP=Close Parenthesis, ×=No break allowed.

In 'word-break: break-all' mode, this LB30 rule should be ignored.
The expected behavior for break-all mode can be obtained by treating 
all AL, HL, NU and AI(Ambiguous) characters same as ID (Ideographic).
The following rules will be also affected:

    LB23 Do not break within ‘a9’, ‘3a’, or ‘H%’
        ...
        (AL | HL) × NU
        NU × (AL | HL)

    LB25  Do not break between the following pairs of classes relevant to numbers:
        ...
        IS × NU
        NU × NU
        SY × NU
        ...

    LB28  Do not break between alphabetics (“at”).
        (AL | HL) × (AL | HL)

    LB29  Do not break between numeric punctuation and alphabetics (“e.g.”).
        IS × (AL | HL)

-- 
Configure bugmail: https://bugs.webkit.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the webkit-unassigned mailing list