Displaying 1 result from an estimated 1 matches for "pcre2_ucp".
2023 Jul 25
1
Bug in perl=TRUE regexp matching?
...h \d, \s, or \w, and always match \D, \S, and \W, although this may
be different for characters in the range 128-255 when locale-specific
matching is happening. These escape sequences retain their original
meanings from before Unicode support was available, mainly for
efficiency reasons. If the PCRE2_UCP option is set, the behaviour is
changed so that Unicode properties are used to determine character
types, as follows...
So this doesn't seem like a bug to me.
Does that mean that the following is incorrect?
> one UTF-8 string will force all the matching to be done in Unicode
It depend...