search for: utf8towcs

Displaying 10 results from an estimated 10 matches for "utf8towcs".

2012 Jul 05
1
Invalid input in 'utf8towcs' when saving script file
Hello, When I try to save my script file before closing the R console session I get this error. Error: invalid input 'C:\Documents and Settings\xxxx\xxxx\datafile' in 'utf8towcs' Does anyone know what can cause this error? I use the RGui (R verison 2.14.0) in Windows and the problem appears when I try to re-save the script file. Using Save as and rename it works. Kind regards, Marine Andersson
2011 May 21
1
DocumentTermMatrix error
Hi all, I have tried to create a DocumentTermMatrix with a tm package, but i get this error : Error in tolower(txt) : invalid input 'PROD Z LAHKO GNETNO MELJNO GLINO, ... in 'utf8towcs' I tried doing this as it is showed in : http://www.r-project.org/doc/Rnews/Rnews_2008-2.pdf (An Introduction to Text Mining), with this R code : setwd("C:/Users/mpavlic/Desktop/temp") tekst <- Corpus(DirSource(".")) >Warning message: >In readLines(y...
2012 May 31
1
possible bug in "R Editor"
...ear all, I clicked "File-New Script" to open a R Editor, typed some commands in it and then saved it to a file. If the location where I tried to save the script contained Chinese Character, R Editor complained, Error: invalid input 'E:\Some.Chinese.Characters\new_file.R' in 'utf8towcs' > sessionInfo() R version 2.15.0 (2012-03-30) Platform: i386-pc-mingw32/i386 (32-bit) locale: [1] LC_COLLATE=Chinese (Simplified)_People's Republic of China.936 [2] LC_CTYPE=Chinese (Simplified)_People's Republic of China.936 [3] LC_MONETARY=Chinese (Simplified)_People's Repu...
2008 May 01
1
Locale problem with umlauts in factor levels in 2.7.0 (patched) from grid or lattice
...locate who's at fault Dieter library(lattice) dt = data.frame(x=rnorm(100),y=1:100,levs= as.factor(c("Gru","Gr?"))) stripplot(x ~ y|levs, data = dt) #Error in grid.Call.graphics("L_text", as.graphicsAnnot(x$label), x$x, : # invalid input 'Gr?' in 'utf8towcs' ># Works as.graphicsAnnot(as.factor(c("Gru","Gr?"))) R version 2.7.0 Patched (2008-04-30 r45572) i386-pc-mingw32 locale: LC_COLLATE=German_Germany.1252;LC_CTYPE=German_Germany.1252; LC_MONETARY=German_Germany.1252;LC_NUMERIC=C;LC_TIME=German_Germany.1252 attached...
2009 Nov 01
4
convert list to Dataframe
...???????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????*????????????????????????????????????????????????????????????????????????????????????;???????????????;China;Zhejiang;;;28.695035;119.751054' in 'utf8towcs' > Can anyone suggest what I can do? P.S. Actually, I would love to remove all the non-English tweets but I have no clue about how to do that. -- View this message in context: http://old.nabble.com/convert-list-to-Dataframe-tp26148889p26148889.html Sent from the R help mailing list arc...
2010 Jan 22
1
Invalid input error in tm package
...edsurgj00978-0007.pdf'* *> inspect(surgj)* *A corpus with 2 text documents The metadata consists of 2 tag-value pairs and a data frame Available tags are: create_date creator Available variables in the data frame are: MetaID [[1]] %PDF-1.3 Error: invalid input '%Åþë×' in 'utf8towcs'* Could anybody help me to identify where I went wrong and what I need to do to proceed further? Thanks, Shreyasee [[alternative HTML version deleted]]
2011 Feb 10
2
Help using "tm" text mining package - preprocessing
...d error, and then the Term Document Matrix command gives me a peculiar error: > other.TDM <- TermDocumentMatrix(textd, control = list(stopwords = TRUE)) Error in tolower(txt) : invalid input 'Valentino bag, breakfasting at West Palm Beach caf? Testa . . . VALENTINO, in' in 'utf8towcs' > Is it something to do with the structure of the documents I've read in. The "tm" documentation is *extremely* abstract, at my Neanderthal level. Thanks to anyone who can help -- View this message in context: http://r.789695.n4.nabble.com/Help-using-tm-text-mining-pa...
2012 Jan 08
2
cannot find package in Packages>>Install Packages
...¼Œå >> æ £æ”¿æ²»æ–—äº‰ä¸ ä¼šä¸¢æŽ‰æ€§å‘½ï¼Œè€ å å‡ºæ ¥å Žæ›´æ˜¯ä¸€æ >> ¡å¥½æ±‰ã€‚åŒ—é£Žè¿˜æ˜¯èˆ ä¸ å¾—*éœ¸åœ°ä½ ã€ è‚‰ã€ ä¹¦ã€ >> å¥³äººå’Œç½‘ç»œçš„ï¼Œä¸ è¿‡ç‰¢é‡Œä¸ ä¼šæ 供这些。å >> ¦â€¦;山西,浙江;China;**Zhejiang;;;28.695035;119.**751054' >> in 'utf8towcs' >> >>> >>> >> Can anyone suggest what I can do? >> >> P.S. Actually, I would love to remove all the non-English tweets but I >> have >> no clue about how to do that. >> >> -- >> > > David Winsemius, MD > Heritage...
2008 Jun 06
3
Lattice: key does not accept German umlaute
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 library(lattice) ## works as expected xyplot(1~1, key = list(text = list(c("Maenner")))) ## works as expected xyplot(1~1, key = list(text = list(c("Maenner"))), xlab = "M\344nner") ## gives an error xyplot(1~1, key = list(text = list(c("M\344nner")))) Is this a bug? TIA, Bernd -----BEGIN PGP
2008 Jun 06
6
Subsetting to unique values
I want to take the first row of each unique ID value from a data frame. For instance > ddTable <- data.frame(Id=c(1,1,2,2),name=c("Paul","Joe","Bob","Larry")) I want a dataset that is Id Name 1 Paul 2 Bob > unique(ddTable) Will give me all 4 rows, and > unique(ddTable$Id) Will give me c(1,2), but not accompanied by the name column.