search for: x_utf8

Displaying 3 results from an estimated 3 matches for "x_utf8".

Did you mean: x_iutf8
2017 Aug 01
3
special latin1 do not print as glyphs in current devel on windows
...t;20AC" in Unicode, endash "96" in cp1252, "2013" in Unicode. The same error seems to happen with enc2utf8(x) Now with iconv() the result is as expected. iconv(x, to = "UTF-8") The second problem IMO is that encoding markers get lost with the enc2* functions x_utf8 <- enc2utf8(x) Encoding(x_utf8) x_nat <- enc2native(x_utf8) Encoding(x_nat) Again, this is not the case with iconv() x_iutf8 <- iconv(x, to = "UTF-8") Encoding(x_iutf8) x_inat <- iconv(x_iutf8, from = "UTF-8") Encoding(x_inat) [[alternative HTML version deleted]]
2017 Aug 01
0
special latin1 do not print as glyphs in current devel on windows
...t; > enc2utf8(x) > > Now with iconv() the result is as expected. > > iconv(x, to = "UTF-8") > > > The second problem IMO is that encoding markers get lost with the enc2* > functions As you are changing encodings, you do not want to preserve encoding! > x_utf8 <- enc2utf8(x) > Encoding(x_utf8) > x_nat <- enc2native(x_utf8) > Encoding(x_nat) In an actual Latin-1 locale on Linux > x_utf8 <- c("??", "\u20ac", "\u2013") > Encoding(x_utf8) [1] "latin1" "UTF-8" "UTF-8" &g...
2017 Aug 01
2
special latin1 do not print as glyphs in current devel on windows
...t; I am not interested in preserving encodings. What I am worried about is that the encoding is not marked anymore, i.e. that Encoding() returns "unknown". In cp1252 encoding on Windows (note that I am using the cp1252 escape "\x80" and not the Unicode "\u20AC") > x_utf8 <- enc2utf8(c("?", "\x80")) > Encoding(x_utf8) [1] "UTF-8" "UTF-8" > x_nat <- enc2native(x_utf8) > Encoding(x_nat) [1] "unknown" "unknown" See also Kirill's message to this list: "ASCII strings are marked as ASCII...