Displaying 20 results from an estimated 7000 matches similar to: "Correct subsetting in R"
2017 Nov 01
1
Correct subsetting in R
It's not what I want, the first data frame has 499 observations and the second data frame is a subset of the first one but with 375 observations. I want something that returns the ID for training data frame
On Wednesday, November 1, 2017 10:18 AM, Eric Berger <ericjberger at gmail.com> wrote:
matches <- merge(training,data,by=intersect(names(training),names(data)))
HTH,
Eric
2017 Nov 01
1
Correct subsetting in R
But they row.names() cannot give me the IDs
On Wednesday, November 1, 2017 9:45 AM, David Wolfskill <r at catwhisker.org> wrote:
On Wed, Nov 01, 2017 at 04:13:42PM +0000, Elahe chalabi via R-help wrote:
> Hi all,
> I have two data frames that one of them does not have the column ID:
>
> > str(data)
> 'data.frame': 499 obs. of 608 variables:
2017 Jun 12
3
count number of stop words in R
define your string as whatever object you want:
data <- "Mhm . Alright . There's um a young boy that's getting a cookie jar . And it he's uh in bad shape because uh the thing is falling over . And in the picture the mother is washing dishes and doesn't see it . And so is the the water is overflowing in the sink . And the dishes might get falled over if you don't fell
2017 Jun 12
0
count number of stop words in R
Thanks for your reply. I know the command
data <- tm_map(data, removeWords, stopwords("english"))
removes English stop words, I don't know how should I count stop words of my string:
str="Mhm . Alright . There's um a young boy that's getting a cookie jar . And it he's uh in bad shape because uh the thing is falling over . And in the picture the mother is
2017 Jun 12
0
count number of stop words in R
Defining data as you mentioned in your respond causes the following error:
Error in UseMethod("tm_map", x) :
no applicable method for 'tm_map' applied to an object of class "character"
I can solve this error by using Corpus(VectorSource(my string)) and the using your command but I cannot see the number of stop words in my string!
On Monday, June 12, 2017 8:36
2017 Jun 12
3
count number of stop words in R
You can define stop words as below.
data <- tm_map(data, removeWords, stopwords("english"))
Patrick Casimir, PhD
Health Analytics, Data Science, Big Data Expert & Independent Consultant
C: 954.614.1178
________________________________
From: R-help <r-help-bounces at r-project.org> on behalf of Bert Gunter <bgunter.4567 at gmail.com>
Sent: Monday, June 12, 2017
2017 Jun 12
0
count number of stop words in R
You can use regular expressions.
?regex and/or the stringr package are good places to start. Of
course, you have to define "stop words."
Cheers,
Bert
Bert Gunter
"The trouble with having an open mind is that people keep coming along
and sticking things into it."
-- Opus (aka Berkeley Breathed in his "Bloom County" comic strip )
On Mon, Jun 12, 2017 at 5:40
2017 Jun 12
3
count number of stop words in R
Hi all,
Is there a way in R to count the number of stop words (English) of a string using tm package?
str="Mhm . Alright . There's um a young boy that's getting a cookie jar . And it he's uh in bad shape because uh the thing is falling over . And in the picture the mother is washing dishes and doesn't see it . And so is the the water is overflowing in the sink . And the
2018 Jan 04
0
Random Forest tree labels
Hi all,
I have built a Random Forest using Caret package, however, I don't understand how the splits are labeled in trees. My dataset contains the frequency of the words in the speeches of the people:
'data.frame': 499 obs. of 608 variables:
$ alright : num 1 0 0 0 0 0 0 1 2 1 ...
$ bad : num 1 0 0 0 0 0 0 0 0 0 ...
$ boy : num 1 2 1 1 0 2 2 4 2 1 ...
$ cooki : num 1 2 2 1 0 1 1 4
2016 Apr 24
1
assign color to subsets
'grepl' returns a logical vector; you have to use this to get your subset.
You can use:
df_tq <- subset(df, grepl("t1", Command))
df_t2 <- subset(df, grepl("t2", Command))
# if you want to also get a subset that has both, use
df_both <- subset(df, grepl("t1", Command) & grepl("t2", Command))
Jim Holtman
Data Munger Guru
What is
2016 Apr 24
0
assign color to subsets
now after this:
df_both <- subset(df, grepl("t1", Command) & grepl("t2", Command))
I use factor to apply the subset to df but then the Command level becomes 0
df_both$Command=factor(df_both$Command)
str(df_both)
$ Protocol : Factor w/ 0 levels:
Do you know what is the reason?
Thanks for replying
On Sunday, April 24, 2016 12:18 PM, jim
2017 Jul 26
2
fill out a PDF form in R
Hi all,
I would like to get ideas about how to fill out a PDF form in R and to know if it's possible or not. I could not find something helpful in Internet.
Does anyone know a good link for that or have experience in this?
Thanks for any help!
Elahe
2017 Jul 26
0
fill out a PDF form in R
Hi Elahe,
I have no clue, but maybe you can dump the data fields using pdftk, and
work with those in R.
HTH
Ulrik
On Wed, 26 Jul 2017 at 13:50 Elahe chalabi via R-help <r-help at r-project.org>
wrote:
> Hi all,
>
> I would like to get ideas about how to fill out a PDF form in R and to
> know if it's possible or not. I could not find something helpful in
> Internet.
2017 Aug 23
0
cross validation in random forest using rfcv functin
Any responds?!
On Wednesday, August 23, 2017 5:50 AM, Elahe chalabi via R-help <r-help at r-project.org> wrote:
Hi all,
I would like to do cross validation in random forest using rfcv function. As the documentation for this package says:
rfcv(trainx, trainy, cv.fold=5, scale="log", step=0.5, mtry=function(p) max(1, floor(sqrt(p))), recursive=FALSE, ...)
however I
2017 Aug 23
1
cross validation in random forest using rfcv functin
Hi all,
I would like to do cross validation in random forest using rfcv function. As the documentation for this package says:
rfcv(trainx, trainy, cv.fold=5, scale="log", step=0.5, mtry=function(p) max(1, floor(sqrt(p))), recursive=FALSE, ...)
however I don't know how to build trianx and trainy for my data set, and I could not understand the way trainx is built in the package
2016 Apr 23
1
subset by multiple letters condition
Thanks Jean, Does anyone know how to set these [hast1] and [hast2] as the colors of a plot?
On Friday, April 22, 2016 7:39 AM, "Adams, Jean" <jvadams at usgs.gov> wrote:
You can use the grepl() function to give you logicals for each criterion, then combine them as needed. For example:
# example version of Command
Command <- paste0("_localize_",
2016 Apr 16
1
a replace for subset
-Thank you James, well the problem of my type of data is that there can be many possible subsets and therefore plots, and I want to automatically generate them, and facet_wrap does not give me all the possible cases
On Saturday, April 16, 2016 6:01 AM, James C. Whanger <james.whanger at gmail.com> wrote:
Would facet_wrap or facet_grid give you what you want?
On Sat, Apr 16, 2016 at
2016 Apr 16
0
a replace for subset
Would facet_wrap or facet_grid give you what you want?
On Sat, Apr 16, 2016 at 8:45 AM, ch.elahe via R-help <r-help at r-project.org>
wrote:
> Hi,
> I have a data set (mydata), which a part of this is like the following:
>
>
> 'data.frame': 36190 obs. of 16 variables:
> $ RE : int 38 41 11 67 30 18 38 41 41 30 ...
> $ LU
2017 Dec 13
1
overlay two histograms ggplot
Hi all,
How can I overlay these two histograms?
ggplot(gg, aes(gg$Alz, fill = gg$veg)) + geom_histogram(alpha = 0.2)
ggplot(tt, aes(tt$Cont, fill = tt$veg)) + geom_histogram(alpha = 0.2)
thanks for any help!
Elahe
2016 Apr 16
3
a replace for subset
Hi,
I have a data set (mydata), which a part of this is like the following:
'data.frame': 36190 obs. of 16 variables:
$ RE : int 38 41 11 67 30 18 38 41 41 30 ...
$ LU : int 4200 3330 530 4500 3000 1790 4700 3400 3640 4000 ...
$ COUNTRY : Factor w/ 4 levels "DE","FR","JP", "FR"?
$Light