Displaying 20 results from an estimated 8000 matches similar to: "remove Punctuation characters"
2007 Oct 21
2
Help installing Rstem package
An embedded and charset-unspecified text was scrubbed...
Name: n?o dispon?vel
Url: https://stat.ethz.ch/pipermail/r-help/attachments/20071021/3a2e8c5b/attachment.pl
2009 Aug 06
2
Re move all punctuations except commas
Dear all,
I wanted to remove all punctuations except commas from a string.
I used:
gsub("[[:punct:]]", "", string)
but I don't know how to exclude the commas ",". Anyone would kindly answer
my basic question?
--
View this message in context: http://www.nabble.com/Remove-all-punctuations-except-commas-tp24845721p24845721.html
Sent from the R help mailing list
2008 Nov 28
3
regexp help needed
Hello,
I have a vector of dates and I would like to grep the year component
from this vector (= all digits
after the last punctuation character)
dates <- c("28.7.08","28.7.2008","28/7/08", "28/7/2008", "28/07/2008",
"28-07-2008", "28-07-08")
the resulting vector should look like
"08" "2008"
2016 Sep 09
2
Borrar carácteres extraños /xax
Buenos días,
estoy realizando análisis de texto con Twitter y tengo un problema con unos
carácteres que no logro quitar. Són cadenas de letras con forma similar a
*xaexdfxdeaxoa*. Creo que surgen de la códificación de los emojis.
Yo suelo utilizar, más o menos el siguiente codigo con gsub para limpiar
texto, pero no me sirve
# remove rt
x = gsub("rt", "", x)
# remove at
x =
2012 Mar 18
3
Extracting numbers from a character variable of different types
Hello,
I have a file which contains a column with age, which is represented in the
two following patterns
1. "007/A" or ''007/a" or ''7 /a" ..... In this case A or a means year and I
would like to extract only the numeric values eg 7 in the above case if this
pattern exits in a line of file.
2. "004/M" or "004/m" where M or m means month
2011 Aug 23
1
Implementing a "plugin" paradigm with R methods
Dear list,
I was wondering how to best implement some sort of a "plugin" paradigm
using R methods and the dispatcher:
Say we have a function/method ('foo') that does something useful, but
that should be open for extension in ONE specific area by OTHERS using
my package. Of course they could go ahead and write a whole new 'foo'
method including the features they'd
2009 Nov 12
1
How can this code be improved?
I am running the following code on a MacBook Pro 17" Unibody early
2009 with 8GB RAM, OS X 10.5.8, R 2.10.0 Patch from Nov. 2, 2009, in
64-bit mode.
freq.stopwords <- numeric(0)
freq.nonstopwords <- numeric(0)
token.tables <- list(0)
i.ss <- c(0)
cat("Beginning at ", date(), ".\n")
for (i.d in 1:length(tokens)) {
tt <- list(0)
for (i.s in
2009 Apr 09
1
Does R support [:punct:] in regexps?
Hello does R support [:punct:] in regular expressions? I am trying to
strip all regular expressions for a vector of strings.
> x <- c("yoda-yoda","billy!")
> gsub("/[:punct:]/","",x)
[1] "yoda-yoda" "billy!"
Thanks
Dan
--
**************************************************************
Daniel Brewer, Ph.D.
Institute of Cancer
2009 Aug 13
3
Finding minimum of time subset
Dear List,
I have a data frame of data taken every few seconds. I would like to subset the data to retain only the data taken on the quarter hour, and as close to the quarter hour as possible. So far I have figured out how to subset the data to the quarter hour, but not how to keep only the minimum time for each quarter hour.
For example:
2012 Jan 27
2
tm package: handling contractions
I tried making a wordcloud of Obama's State of the Union address using
the tm package to process the text
sotu <- scan(file="c:/R/data/sotu2012.txt", what="character")
sotu <- tolower(sotu)
corp <-Corpus(VectorSource(paste(sotu, collapse=" ")))
corp <- tm_map(corp, removePunctuation)
corp <- tm_map(corp, stemDocument)
corp <- tm_map(corp,
2003 Apr 29
5
[Bug 550] Problems with .Ql in mdoc
http://bugzilla.mindrot.org/show_bug.cgi?id=550
Summary: Problems with .Ql in mdoc
Product: Portable OpenSSH
Version: -current
Platform: All
OS/Version: All
Status: NEW
Severity: minor
Priority: P2
Component: Documentation
AssignedTo: openssh-unix-dev at mindrot.org
ReportedBy: cjwatson at
2015 Nov 24
2
transformar número en R
Buenas tardes,
Estoy extrayendo un dato de una web y el separador de miles es el ".",
que es el separador decimal en R. He intentado reemplazaro por el
caracter, para convertirlo en ´numérico de R, vacío pero no me deja.
Lo que he intentado es lo siguiente:
> a<-"17.691.700" # valor que obtengo de la web
> sub(".","",a)
[1] "7.691.700"
2010 Aug 04
6
applying strsplit to a whole column
I am sorry, I'd like to split my column ("names") such that all the
beginning of a string ("X..") is gone and only the rest of the text is
left.
x<-data.frame(names=c("X..aba","X..abb","X..abc","X..abd"))
x$names<-as.character(x$names)
(x)
str(x)
Can't figure out how to apply strsplit in this situation - without
using a
2010 Jul 15
1
Proper use of grep
I just need to confirm something with pattern matching folks. I have a factor with the following levels in a very large data set:
> levels(all$Classical.Statistic)
[1] "" "AB;ABD" "CollapsedSteps" "CR_P" "CR_Prop;CR_P;AB"
[6] "NMK"
2012 May 31
3
How can I get this function to work?
Hello All,
Can anyone tell help me understand why the function below doesn't work and how I can fix it? Below are some sample data, some code that works on individual rows of the data, and my attempt to translate that code into a function. My hope is to get the function working and then to apply it to the larger data frame using ddply() from the plyr package or possibly some other approach.
2003 Jul 17
3
how to divide a string into characters? - for comparing strings that is
Hi
I am searching for a way to do something like "ABC" -> c("A","B","C"). How can this be accomplished?
I tried cut() and split(), but they do something else, it seems.
The purpose for doing this is to find the number of common (and uncommon) characters, i.e. ultimately I want something like this:
> foo("ABD","ADE")
c(2,1) # 2
2013 Mar 13
2
patch - Some CJK codepoints are also punctuation
--
Greg.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: xapian-some-cjk-codepoints-are-also-punctuation.patch
Type: text/x-patch
Size: 1499 bytes
Desc: not available
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130313/4da8b0f9/attachment.bin>
2002 Oct 02
5
SaMBa permissions problem
Hi,
I'm having a slight problem with samba permissions.
Here goes my scenario:
Red Hat Linux 8.0 with samba
I got a samba share "public"
Users can access the share and write there what they need, but if a user
creates a directory, other users can't access it.
How can I configure things so that everyone in the group accesses
everything in the share?
2009 Feb 02
2
parsing problem
Hi all,
I am trying to parse a vector for caliculating minimum in that vector the
vector having values like
1 Kontrolle
2 Placebo
3 125mg/kg
4 250mg/kg
5 500mg/kg
6 1000mg/kg
hear i tries for comverting it into numeric with using "as.numaric()"
function
but i got values like
5
6
2
3
4
1
it gives 1000mg/kg is the least one
but i have
2008 Jan 07
1
numerical data frame
Dear All,
I've successfully import my synteny data to R by using scan command. Below show my results. My major problem with my data is how am i going to combine the column names with the data( splt) where i have tried on cbind but a warning message occur. I have realized that the splt data only have 5 column instead of 6. Please help me with this!!
I want my data to be a numerical