Displaying 10 results from an estimated 10 matches for "utf8towcs".
2012 Jul 05
1
Invalid input in 'utf8towcs' when saving script file
Hello,
When I try to save my script file before closing the R console session I get this error.
Error: invalid input 'C:\Documents and Settings\xxxx\xxxx\datafile' in 'utf8towcs'
Does anyone know what can cause this error?
I use the RGui (R verison 2.14.0) in Windows and the problem appears when I try to re-save the script file. Using Save as and rename it works.
Kind regards,
Marine Andersson
2011 May 21
1
DocumentTermMatrix error
Hi all,
I have tried to create a DocumentTermMatrix with a tm package, but i get this error :
Error in tolower(txt) :
invalid input 'PROD Z LAHKO GNETNO MELJNO GLINO, ... in 'utf8towcs'
I tried doing this as it is showed in :
http://www.r-project.org/doc/Rnews/Rnews_2008-2.pdf (An Introduction to Text Mining),
with this R code :
setwd("C:/Users/mpavlic/Desktop/temp")
tekst <- Corpus(DirSource("."))
>Warning message:
>In readLines(y...
2012 May 31
1
possible bug in "R Editor"
...ear all,
I clicked "File-New Script" to open a R Editor, typed some commands in
it and then saved it to a file. If the location where I tried to save
the script contained Chinese Character, R Editor complained,
Error: invalid input 'E:\Some.Chinese.Characters\new_file.R' in 'utf8towcs'
> sessionInfo()
R version 2.15.0 (2012-03-30)
Platform: i386-pc-mingw32/i386 (32-bit)
locale:
[1] LC_COLLATE=Chinese (Simplified)_People's Republic of China.936
[2] LC_CTYPE=Chinese (Simplified)_People's Republic of China.936
[3] LC_MONETARY=Chinese (Simplified)_People's Repu...
2008 May 01
1
Locale problem with umlauts in factor levels in 2.7.0 (patched) from grid or lattice
...locate who's at fault
Dieter
library(lattice)
dt = data.frame(x=rnorm(100),y=1:100,levs= as.factor(c("Gru","Gr?")))
stripplot(x ~ y|levs, data = dt)
#Error in grid.Call.graphics("L_text", as.graphicsAnnot(x$label), x$x, :
# invalid input 'Gr?' in 'utf8towcs'
># Works
as.graphicsAnnot(as.factor(c("Gru","Gr?")))
R version 2.7.0 Patched (2008-04-30 r45572)
i386-pc-mingw32
locale:
LC_COLLATE=German_Germany.1252;LC_CTYPE=German_Germany.1252;
LC_MONETARY=German_Germany.1252;LC_NUMERIC=C;LC_TIME=German_Germany.1252
attached...
2009 Nov 01
4
convert list to Dataframe
...???????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????*????????????????????????????????????????????????????????????????????????????????????;???????????????;China;Zhejiang;;;28.695035;119.751054'
in 'utf8towcs'
>
Can anyone suggest what I can do?
P.S. Actually, I would love to remove all the non-English tweets but I have
no clue about how to do that.
--
View this message in context: http://old.nabble.com/convert-list-to-Dataframe-tp26148889p26148889.html
Sent from the R help mailing list arc...
2010 Jan 22
1
Invalid input error in tm package
...edsurgj00978-0007.pdf'*
*> inspect(surgj)*
*A corpus with 2 text documents
The metadata consists of 2 tag-value pairs and a data frame
Available tags are:
create_date creator
Available variables in the data frame are:
MetaID
[[1]]
%PDF-1.3
Error: invalid input '%Åþë×' in 'utf8towcs'*
Could anybody help me to identify where I went wrong and what I need to do
to proceed further?
Thanks,
Shreyasee
[[alternative HTML version deleted]]
2011 Feb 10
2
Help using "tm" text mining package - preprocessing
...d error, and
then the Term Document Matrix command gives me a peculiar error:
> other.TDM <- TermDocumentMatrix(textd, control = list(stopwords = TRUE))
Error in tolower(txt) :
invalid input 'Valentino bag, breakfasting at West Palm Beach caf? Testa .
. . VALENTINO, in' in 'utf8towcs'
>
Is it something to do with the structure of the documents I've read in.
The "tm" documentation is *extremely* abstract, at my Neanderthal level.
Thanks to anyone who can help
--
View this message in context: http://r.789695.n4.nabble.com/Help-using-tm-text-mining-pa...
2012 Jan 08
2
cannot find package in Packages>>Install Packages
...¼Œå
>> æ £æ”¿æ²»æ–—äº‰ä¸ ä¼šä¸¢æŽ‰æ€§å‘½ï¼Œè€ å å‡ºæ ¥å Žæ›´æ˜¯ä¸€æ
>> ¡å¥½æ±‰ã€‚åŒ—é£Žè¿˜æ˜¯èˆ ä¸ å¾—*éœ¸åœ°ä½ ã€ è‚‰ã€ ä¹¦ã€
>> å¥³äººå’Œç½‘ç»œçš„ï¼Œä¸ è¿‡ç‰¢é‡Œä¸ ä¼šæ 供这些。å
>> ¦â€¦;山西,浙江;China;**Zhejiang;;;28.695035;119.**751054'
>> in 'utf8towcs'
>>
>>>
>>>
>> Can anyone suggest what I can do?
>>
>> P.S. Actually, I would love to remove all the non-English tweets but I
>> have
>> no clue about how to do that.
>>
>> --
>>
>
> David Winsemius, MD
> Heritage...
2008 Jun 06
3
Lattice: key does not accept German umlaute
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
library(lattice)
## works as expected
xyplot(1~1, key = list(text = list(c("Maenner"))))
## works as expected
xyplot(1~1, key = list(text = list(c("Maenner"))), xlab = "M\344nner")
## gives an error
xyplot(1~1, key = list(text = list(c("M\344nner"))))
Is this a bug?
TIA,
Bernd
-----BEGIN PGP
2008 Jun 06
6
Subsetting to unique values
I want to take the first row of each unique ID value from a data frame.
For instance
> ddTable <-
data.frame(Id=c(1,1,2,2),name=c("Paul","Joe","Bob","Larry"))
I want a dataset that is
Id Name
1 Paul
2 Bob
> unique(ddTable)
Will give me all 4 rows, and
> unique(ddTable$Id)
Will give me c(1,2), but not accompanied by the name column.