Gustaf Rydevik
2009-Jan-16 13:13 UTC
[R] data frames with å, ä, and ö (=non-ASCII-characters) from windows to mac os x
Hi, I ran into this issue previously and managed to solve it, but I've forgotten how and am getting frustrated... I have a data frame (see below) with scandinavian characters in R (2.7.1) running on a Win Xp-computer. I save the data frame in an RData-file on a usb stick, and load() it in R (2.8.0) running on OS X 10.5. Now the name of the data frame and all factor labels with scandinavian characters are scrambled. How do I make R in OS X read my data frame?>From what I've managed to find in the list archives and the FAQ I either1) run Sys.setlocale("LC_ALL","en_US.UTF-8") ### Doesn't change anything or 2) run defaults write org.R-project.R force.LANG en_US.UTF-8 in the terminal, which doesn't help either. I must admit that I couldn't quite follow what documentation i found on locales, so I might have messed up somewhere along the line. Many thanks in advance for your help! Regards, Gustaf -------- L?nkarta <- structure(list(LANKOD = structure(c(11L, 19L, 10L, 13L, 21L, 7L, 9L, 18L, 8L, 3L, 16L, 6L, 5L, 4L, 15L, 2L, 20L, 17L, 1L, 14L, 12L), .Label = c("AB", "AC", "BD", "C", "D", "E", "F", "G", "H", "I", "K", "M", "N", "O", "S", "T", "U", "W", "X", "Y", "Z" ), class = "factor"), L?n = structure(c(1L, 4L, 3L, 5L, 6L, 7L, 8L, 2L, 9L, 10L, 20L, 21L, 13L, 14L, 15L, 16L, 17L, 18L, 12L, 19L, 11L), .Label = c("Blekinge l?n", "Dalarnas l?n", "Gotlands l?n", "G?vleborgs l?n", "Hallands l?n", "J?mtlands l?n", "J?nk?pings l?n", "Kalmar l?n", "Kronobergs l?n", "Norrbottens l?n", "Sk?ne l?n", "Stockholms l?n", "S?dermanlands l?n", "Uppsala l?n", "V?rmlands l?n", "V?sterbottens l?n", "V?sternorrlands l?n", "V?stmanlands l?n", "V?stra G?talands l?n", "?rebro l?n", "?sterg?tlands l?n"), class "factor")), .Names = c("LANKOD", "L?n"), class = "data.frame", row.names = c("0", "1", "2", "3", "4", "5", "6", "7", "8", "9", "10", "11", "12", "13", "14", "15", "16", "17", "18", "19", "20")) -- Gustaf Rydevik, M.Sci. tel: +46(0)703 051 451 address:Essingetorget 40,112 66 Stockholm, SE skype:gustaf_rydevik
Ivan Alves
2009-Jan-16 13:39 UTC
[R] data frames with å, ä, and ö (=non-ASCII-characters) from windows to mac os x
Hi, On my system (see below), it works fine (inputing the code below at the R prompt). Make sure that the encoding of the input file is encoded UTF-8. Rgds, Ivan > sessionInfo() R version 2.8.1 Patched (2009-01-14 r47602) i386-apple-darwin9.6.0 locale: en_GB.UTF-8/en_GB.UTF-8/C/C/en_GB.UTF-8/en_GB.UTF-8 attached base packages: [1] stats graphics grDevices utils datasets methods base > structure(list(LANKOD = structure(c(11L, 19L, 10L, 13L, 21L,7L, 9L, 18L, 8L, 3L, 16L, 6L, 5L, 4L, 15L, 2L, 20L, 17L, 1L,14L, 12L), .Label = c("AB", "AC", "BD", "C", "D", "E", "F", "G","H", "I", "K", "M", "N", "O", "S", "T", "U", "W", "X", "Y", "Z"), class = "factor"), L?n = structure(c(1L, 4L, 3L, 5L, 6L, 7L,8L, 2L, 9L, 10L, 20L, 21L, 13L, 14L, 15L, 16L, 17L, 18L, 12L,19L, 11L), .Label = c("Blekinge l?n", "Dalarnas l?n", "Gotlands l?n","G?vleborgs l?n","Hallands l?n", "J?mtlands l?n", "J?nk?pings l?n","Kalmar l?n", "Kronobergs l?n", "Norrbottens l?n", "Sk?ne l?n","Stockholms l?n", "S?dermanlands l?n", "Uppsala l?n", "V?rmlands l?n","V?sterbottens l?n", "V?sternorrlands l?n", "V?stmanlands l?n","V?stra G?talands l?n", "?rebro l?n", "?sterg?tlands l?n"), class ="factor")), .Names = c("LANKOD","L?n"), class = "data.frame", row.names = c("0", "1", "2", "3","4", "5", "6", "7", "8", "9", "10", "11", "12", "13", "14", "15","16", "17", "18", "19", "20")) LANKOD L?n 0 K Blekinge l?n 1 X G?vleborgs l?n 2 I Gotlands l?n 3 N Hallands l?n 4 Z J?mtlands l?n 5 F J?nk?pings l?n 6 H Kalmar l?n 7 W Dalarnas l?n 8 G Kronobergs l?n 9 BD Norrbottens l?n 10 T ?rebro l?n 11 E ?sterg?tlands l?n 12 D S?dermanlands l?n 13 C Uppsala l?n 14 S V?rmlands l?n 15 AC V?sterbottens l?n 16 Y V?sternorrlands l?n 17 U V?stmanlands l?n 18 AB Stockholms l?n 19 O V?stra G?talands l?n 20 M Sk?ne l?n > L?nkarta <- structure(list(LANKOD = structure(c(11L, 19L, 10L, 13L, 21L,7L, 9L, 18L, 8L, 3L, 16L, 6L, 5L, 4L, 15L, 2L, 20L, 17L, 1L,14L, 12L), .Label = c("AB", "AC", "BD", "C", "D", "E", "F", "G","H", "I", "K", "M", "N", "O", "S", "T", "U", "W", "X", "Y", "Z"), class = "factor"), L?n = structure(c(1L, 4L, 3L, 5L, 6L, 7L,8L, 2L, 9L, 10L, 20L, 21L, 13L, 14L, 15L, 16L, 17L, 18L, 12L,19L, 11L), .Label = c("Blekinge l?n", "Dalarnas l?n", "Gotlands l?n","G?vleborgs l?n","Hallands l?n", "J?mtlands l?n", "J?nk?pings l?n","Kalmar l?n", "Kronobergs l?n", "Norrbottens l?n", "Sk?ne l?n","Stockholms l?n", "S?dermanlands l?n", "Uppsala l?n", "V?rmlands l?n","V?sterbottens l?n", "V?sternorrlands l?n", "V?stmanlands l?n","V?stra G?talands l?n", "?rebro l?n", "?sterg?tlands l?n"), class ="factor")), .Names = c("LANKOD","L?n"), class = "data.frame", row.names = c("0", "1", "2", "3","4", "5", "6", "7", "8", "9", "10", "11", "12", "13", "14", "15","16", "17", "18", "19", "20")) > ls() [1] "L?nkarta" > On 16 Jan 2009, at 14:13, Gustaf Rydevik wrote:> Hi, > I ran into this issue previously and managed to solve it, but I've > forgotten how and am getting frustrated... > > I have a data frame (see below) with scandinavian characters in R > (2.7.1) running on a Win Xp-computer. I save the data frame in an > RData-file on a usb stick, and load() it in R (2.8.0) running on OS X > 10.5. Now the name of the data frame and all factor labels with > scandinavian characters are scrambled. How do I make R in OS X read my > data frame? >> From what I've managed to find in the list archives and the FAQ I >> either > 1) run > Sys.setlocale("LC_ALL","en_US.UTF-8") ### Doesn't change anything > or > 2) run > defaults write org.R-project.R force.LANG en_US.UTF-8 > in the terminal, which doesn't help either. > I must admit that I couldn't quite follow what documentation i found > on locales, so I might have messed up somewhere along the line. > > Many thanks in advance for your help! > > Regards, > > Gustaf > > > -------- > > L?nkarta <- > structure(list(LANKOD = structure(c(11L, 19L, 10L, 13L, 21L, > 7L, 9L, 18L, 8L, 3L, 16L, 6L, 5L, 4L, 15L, 2L, 20L, 17L, 1L, > 14L, 12L), .Label = c("AB", "AC", "BD", "C", "D", "E", "F", "G", > "H", "I", "K", "M", "N", "O", "S", "T", "U", "W", "X", "Y", "Z" > ), class = "factor"), L?n = structure(c(1L, 4L, 3L, 5L, 6L, 7L, > 8L, 2L, 9L, 10L, 20L, 21L, 13L, 14L, 15L, 16L, 17L, 18L, 12L, > 19L, 11L), .Label = c("Blekinge l?n", "Dalarnas l?n", "Gotlands l?n", > "G?vleborgs l?n", "Hallands l?n", "J?mtlands l?n", "J?nk?pings l?n", > "Kalmar l?n", "Kronobergs l?n", "Norrbottens l?n", "Sk?ne l?n", > "Stockholms l?n", "S?dermanlands l?n", "Uppsala l?n", "V?rmlands l?n", > "V?sterbottens l?n", "V?sternorrlands l?n", "V?stmanlands l?n", > "V?stra G?talands l?n", "?rebro l?n", "?sterg?tlands l?n"), class > "factor")), .Names = c("LANKOD", > "L?n"), class = "data.frame", row.names = c("0", "1", "2", "3", > "4", "5", "6", "7", "8", "9", "10", "11", "12", "13", "14", "15", > "16", "17", "18", "19", "20")) > > > > -- > Gustaf Rydevik, M.Sci. > tel: +46(0)703 051 451 > address:Essingetorget 40,112 66 Stockholm, SE > skype:gustaf_rydevik > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code.
Prof Brian Ripley
2009-Jan-16 13:48 UTC
[R] [R-SIG-Mac] data frames with å, ä, and ö (=non-ASCII-characters) from windows to mac os x
You need to use CP1252 not UTF-8 to read the data. It tells you how to do so on the help page ... under 'encoding'. So something like A <- read.table(con <- file("myfile", encoding="CP1252"));close(con) Please don't cross-post ... I am being brief because you did. On Fri, 16 Jan 2009, Gustaf Rydevik wrote:> Hi, > I ran into this issue previously and managed to solve it, but I've > forgotten how and am getting frustrated... > > I have a data frame (see below) with scandinavian characters in R > (2.7.1) running on a Win Xp-computer. I save the data frame in an > RData-file on a usb stick, and load() it in R (2.8.0) running on OS X > 10.5. Now the name of the data frame and all factor labels with > scandinavian characters are scrambled. How do I make R in OS X read my > data frame? >> From what I've managed to find in the list archives and the FAQ I either > 1) run > Sys.setlocale("LC_ALL","en_US.UTF-8") ### Doesn't change anything > or > 2) run > defaults write org.R-project.R force.LANG en_US.UTF-8 > in the terminal, which doesn't help either. > I must admit that I couldn't quite follow what documentation i found > on locales, so I might have messed up somewhere along the line. > > Many thanks in advance for your help! > > Regards, > > Gustaf > > > -------- > > L?nkarta <- > structure(list(LANKOD = structure(c(11L, 19L, 10L, 13L, 21L, > 7L, 9L, 18L, 8L, 3L, 16L, 6L, 5L, 4L, 15L, 2L, 20L, 17L, 1L, > 14L, 12L), .Label = c("AB", "AC", "BD", "C", "D", "E", "F", "G", > "H", "I", "K", "M", "N", "O", "S", "T", "U", "W", "X", "Y", "Z" > ), class = "factor"), L?n = structure(c(1L, 4L, 3L, 5L, 6L, 7L, > 8L, 2L, 9L, 10L, 20L, 21L, 13L, 14L, 15L, 16L, 17L, 18L, 12L, > 19L, 11L), .Label = c("Blekinge l?n", "Dalarnas l?n", "Gotlands l?n", > "G?vleborgs l?n", "Hallands l?n", "J?mtlands l?n", "J?nk?pings l?n", > "Kalmar l?n", "Kronobergs l?n", "Norrbottens l?n", "Sk?ne l?n", > "Stockholms l?n", "S?dermanlands l?n", "Uppsala l?n", "V?rmlands l?n", > "V?sterbottens l?n", "V?sternorrlands l?n", "V?stmanlands l?n", > "V?stra G?talands l?n", "?rebro l?n", "?sterg?tlands l?n"), class > "factor")), .Names = c("LANKOD", > "L?n"), class = "data.frame", row.names = c("0", "1", "2", "3", > "4", "5", "6", "7", "8", "9", "10", "11", "12", "13", "14", "15", > "16", "17", "18", "19", "20")) > > -- > Gustaf Rydevik, M.Sci. > tel: +46(0)703 051 451 > address:Essingetorget 40,112 66 Stockholm, SE > skype:gustaf_rydevik-- Brian D. Ripley, ripley at stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272866 (PA) Oxford OX1 3TG, UK Fax: +44 1865 272595