thr3ads.net - search: "utf8towcs"

Displaying 10 results from an estimated 10 matches for "utf8towcs".

Invalid input in 'utf8towcs' when saving script file

2012 Jul 05

Invalid input in 'utf8towcs' when saving script file

Hello, When I try to save my script file before closing the R console session I get this error. Error: invalid input 'C:\Documents and Settings\xxxx\xxxx\datafile' in 'utf8towcs' Does anyone know what can cause this error? I use the RGui (R verison 2.14.0) in Windows and the problem appears when I try to re-save the script file. Using Save as and rename it works. Kind regards, Marine Andersson

DocumentTermMatrix error

2011 May 21

DocumentTermMatrix error

Hi all, I have tried to create a DocumentTermMatrix with a tm package, but i get this error : Error in tolower(txt) : invalid input 'PROD Z LAHKO GNETNO MELJNO GLINO, ... in 'utf8towcs' I tried doing this as it is showed in : http://www.r-project.org/doc/Rnews/Rnews_2008-2.pdf (An Introduction to Text Mining), with this R code : setwd("C:/Users/mpavlic/Desktop/temp") tekst <- Corpus(DirSource(".")) >Warning message: >In readLines(y...

possible bug in "R Editor"

2012 May 31

possible bug in "R Editor"

...ear all, I clicked "File-New Script" to open a R Editor, typed some commands in it and then saved it to a file. If the location where I tried to save the script contained Chinese Character, R Editor complained, Error: invalid input 'E:\Some.Chinese.Characters\new_file.R' in 'utf8towcs' > sessionInfo() R version 2.15.0 (2012-03-30) Platform: i386-pc-mingw32/i386 (32-bit) locale: [1] LC_COLLATE=Chinese (Simplified)_People's Republic of China.936 [2] LC_CTYPE=Chinese (Simplified)_People's Republic of China.936 [3] LC_MONETARY=Chinese (Simplified)_People's Repu...

Locale problem with umlauts in factor levels in 2.7.0 (patched) from grid or lattice

2008 May 01

Locale problem with umlauts in factor levels in 2.7.0 (patched) from grid or lattice

...locate who's at fault Dieter library(lattice) dt = data.frame(x=rnorm(100),y=1:100,levs= as.factor(c("Gru","Gr?"))) stripplot(x ~ y|levs, data = dt) #Error in grid.Call.graphics("L_text", as.graphicsAnnot(x$label), x$x, : # invalid input 'Gr?' in 'utf8towcs' ># Works as.graphicsAnnot(as.factor(c("Gru","Gr?"))) R version 2.7.0 Patched (2008-04-30 r45572) i386-pc-mingw32 locale: LC_COLLATE=German_Germany.1252;LC_CTYPE=German_Germany.1252; LC_MONETARY=German_Germany.1252;LC_NUMERIC=C;LC_TIME=German_Germany.1252 attached...

convert list to Dataframe

2009 Nov 01

convert list to Dataframe

...???????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????*????????????????????????????????????????????????????????????????????????????????????;???????????????;China;Zhejiang;;;28.695035;119.751054' in 'utf8towcs' > Can anyone suggest what I can do? P.S. Actually, I would love to remove all the non-English tweets but I have no clue about how to do that. -- View this message in context: http://old.nabble.com/convert-list-to-Dataframe-tp26148889p26148889.html Sent from the R help mailing list arc...

Invalid input error in tm package

2010 Jan 22

Invalid input error in tm package

...edsurgj00978-0007.pdf'* *> inspect(surgj)* *A corpus with 2 text documents The metadata consists of 2 tag-value pairs and a data frame Available tags are: create_date creator Available variables in the data frame are: MetaID [[1]] %PDF-1.3 Error: invalid input '%Åþë×' in 'utf8towcs'* Could anybody help me to identify where I went wrong and what I need to do to proceed further? Thanks, Shreyasee [[alternative HTML version deleted]]

Help using "tm" text mining package - preprocessing

2011 Feb 10

Help using "tm" text mining package - preprocessing

...d error, and then the Term Document Matrix command gives me a peculiar error: > other.TDM <- TermDocumentMatrix(textd, control = list(stopwords = TRUE)) Error in tolower(txt) : invalid input 'Valentino bag, breakfasting at West Palm Beach caf? Testa . . . VALENTINO, in' in 'utf8towcs' > Is it something to do with the structure of the documents I've read in. The "tm" documentation is *extremely* abstract, at my Neanderthal level. Thanks to anyone who can help -- View this message in context: http://r.789695.n4.nabble.com/Help-using-tm-text-mining-pa...

cannot find package in Packages>>Install Packages

2012 Jan 08

cannot find package in Packages>>Install Packages

...¼Œå >> æ £æ”¿æ²»æ–—äº‰ä¸ ä¼šä¸¢æŽ‰æ€§å‘½ï¼Œè€ å å‡ºæ ¥å Žæ›´æ˜¯ä¸€æ >> ¡å¥½æ±‰ã€‚åŒ—é£Žè¿˜æ˜¯èˆ ä¸ å¾—*éœ¸åœ°ä½ ã€ è‚‰ã€ ä¹¦ã€ >> å¥³äººå’Œç½‘ç»œçš„ï¼Œä¸ è¿‡ç‰¢é‡Œä¸ ä¼šæ ä¾›è¿™äº›ã€‚å >> ¦â€¦;å±±è¥¿ï¼Œæµ™æ±Ÿ;China;**Zhejiang;;;28.695035;119.**751054' >> in 'utf8towcs' >> >>> >>> >> Can anyone suggest what I can do? >> >> P.S. Actually, I would love to remove all the non-English tweets but I >> have >> no clue about how to do that. >> >> -- >> > > David Winsemius, MD > Heritage...

Lattice: key does not accept German umlaute

2008 Jun 06

Lattice: key does not accept German umlaute

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 library(lattice) ## works as expected xyplot(1~1, key = list(text = list(c("Maenner")))) ## works as expected xyplot(1~1, key = list(text = list(c("Maenner"))), xlab = "M\344nner") ## gives an error xyplot(1~1, key = list(text = list(c("M\344nner")))) Is this a bug? TIA, Bernd -----BEGIN PGP

Subsetting to unique values

2008 Jun 06

Subsetting to unique values

I want to take the first row of each unique ID value from a data frame. For instance > ddTable <- data.frame(Id=c(1,1,2,2),name=c("Paul","Joe","Bob","Larry")) I want a dataset that is Id Name 1 Paul 2 Bob > unique(ddTable) Will give me all 4 rows, and > unique(ddTable$Id) Will give me c(1,2), but not accompanied by the name column.

search for: utf8towcs