search for: surr_bad

Displaying 1 result from an estimated 1 matches for "surr_bad".

2020 Apr 04
0
Possible Bug In Validation of UTF-8 Sequences
...ining the high surrogate with the same high surrogate, instead of the high surrogate with the subsequent character that hopefully is the low surrogate[7]. Here is some code that could be added to regression tests: ??? surr_good <- '\ud840\udc00'??????????? # auto-converts to normal ??? surr_bad <- paste0('\ud840', '\udc00') # surrogates remain ??? good <- c('hello', 'world', surr_good, '\ud7ff', '\ue000', '\U0010ffff') ??? bad <- c(surr_bad, '\ud800', '\udfff', '\U00110000') On R3.6.3: ??? nchar(goo...