diff options
author | Timothy Flynn <trflynn89@pm.me> | 2022-02-05 10:43:13 -0500 |
---|---|---|
committer | Ali Mohammad Pur <Ali.mpfard@gmail.com> | 2022-02-05 22:30:10 +0330 |
commit | 2212aa2388c4f6abae577daa2cbb27f8796939d4 (patch) | |
tree | 7ae0b0f248e7e2c537a892a5b6bbaea8b15a6ea1 /Toolchain | |
parent | 54845c4bf2fe9db0ff9e661f18f883dc3e7f89ba (diff) | |
download | serenity-2212aa2388c4f6abae577daa2cbb27f8796939d4.zip |
LibRegex: Support non-ASCII whitespace characters when matching \s or \S
ECMA-262 defines \s as:
Return the CharSet containing all characters corresponding to a code
point on the right-hand side of the WhiteSpace or LineTerminator
productions.
The LineTerminator production is simply: U+000A, U+000D, U+2028, or
U+2029. Unfortunately there isn't a Unicode property that covers just
those code points.
The WhiteSpace production is: U+0009, U+000B, U+000C, U+FEFF, or any
code point with the Space_Separator general category.
If the Unicode generators are disabled, this will fall back to ASCII
space code points.
Diffstat (limited to 'Toolchain')
0 files changed, 0 insertions, 0 deletions