thr3ads.net - R help - [R] Reading a file with mixed cyrillic/latin characters [Jan 2010]

If this information is useful, please help other people find it:
Share via:

Stephan Kolassa

2010-Jan-12 11:25 UTC

[R] Reading a file with mixed cyrillic/latin characters

Dear useRs,

I am trying to read a tab-delimited Unicode text file containing both 
latin and cyrillic characters and failing miserably. The file looks like 
this (I hope it comes across right):

A	B	C
3	foo	???
5	bar	???

read.table("foo.txt",sep="\t",header=TRUE)

I am guessing that I can use the fileEncoding argument to read.table() 
to read this, but I can find no list of supported values of 
fileEncoding, and fileEncoding="Unicode" gives an error.

The FAQ and the FAQ for Windows don't help. I have searched both the 
list archives and RSeek and am still seeking enlightenment. I am running 
R 2.10.1 on Windows XP, sessionInfo() below.

Cheers
Stephan


R version 2.10.1 (2009-12-14)
i386-pc-mingw32

locale:
[1] LC_COLLATE=German_Germany.1252  LC_CTYPE=German_Germany.1252 
LC_MONETARY=German_Germany.1252 LC_NUMERIC=C
[5] LC_TIME=German_Germany.1252

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base

Possibly Parallel Threads

Search for more apparently analagous threads

R help - Jan 2010 - Reading a file with mixed cyrillic/latin characters

[R] Reading a file with mixed cyrillic/latin characters

Possibly Parallel Threads

Wisdom of the Ancients