similar to: Correct subsetting in R

Displaying 20 results from an estimated 7000 matches similar to: "Correct subsetting in R"

Correct subsetting in R

2017 Nov 01

1

Correct subsetting in R

It's not what I want, the first data frame has 499 observations and the second data frame is a subset of the first one but with 375 observations. I want something that returns the ID for training data frame On Wednesday, November 1, 2017 10:18 AM, Eric Berger <ericjberger at gmail.com> wrote: matches <- merge(training,data,by=intersect(names(training),names(data))) HTH, Eric

Correct subsetting in R

2017 Nov 01

1

Correct subsetting in R

But they row.names() cannot give me the IDs On Wednesday, November 1, 2017 9:45 AM, David Wolfskill <r at catwhisker.org> wrote: On Wed, Nov 01, 2017 at 04:13:42PM +0000, Elahe chalabi via R-help wrote: > Hi all, > I have two data frames that one of them does not have the column ID: > > > str(data) > 'data.frame': 499 obs. of 608 variables:

count number of stop words in R

2017 Jun 12

3

count number of stop words in R

define your string as whatever object you want: data <- "Mhm . Alright . There's um a young boy that's getting a cookie jar . And it he's uh in bad shape because uh the thing is falling over . And in the picture the mother is washing dishes and doesn't see it . And so is the the water is overflowing in the sink . And the dishes might get falled over if you don't fell

count number of stop words in R

2017 Jun 12

0

count number of stop words in R

Thanks for your reply. I know the command data <- tm_map(data, removeWords, stopwords("english")) removes English stop words, I don't know how should I count stop words of my string: str="Mhm . Alright . There's um a young boy that's getting a cookie jar . And it he's uh in bad shape because uh the thing is falling over . And in the picture the mother is

count number of stop words in R

2017 Jun 12

0

count number of stop words in R

Defining data as you mentioned in your respond causes the following error: Error in UseMethod("tm_map", x) : no applicable method for 'tm_map' applied to an object of class "character" I can solve this error by using Corpus(VectorSource(my string)) and the using your command but I cannot see the number of stop words in my string! On Monday, June 12, 2017 8:36

count number of stop words in R

2017 Jun 12

3

count number of stop words in R

You can define stop words as below. data <- tm_map(data, removeWords, stopwords("english")) Patrick Casimir, PhD Health Analytics, Data Science, Big Data Expert & Independent Consultant C: 954.614.1178 ________________________________ From: R-help <r-help-bounces at r-project.org> on behalf of Bert Gunter <bgunter.4567 at gmail.com> Sent: Monday, June 12, 2017

count number of stop words in R

2017 Jun 12

0

count number of stop words in R

You can use regular expressions. ?regex and/or the stringr package are good places to start. Of course, you have to define "stop words." Cheers, Bert Bert Gunter "The trouble with having an open mind is that people keep coming along and sticking things into it." -- Opus (aka Berkeley Breathed in his "Bloom County" comic strip ) On Mon, Jun 12, 2017 at 5:40

count number of stop words in R

2017 Jun 12

3

count number of stop words in R

Hi all, Is there a way in R to count the number of stop words (English) of a string using tm package? str="Mhm . Alright . There's um a young boy that's getting a cookie jar . And it he's uh in bad shape because uh the thing is falling over . And in the picture the mother is washing dishes and doesn't see it . And so is the the water is overflowing in the sink . And the

Random Forest tree labels

2018 Jan 04

0

Random Forest tree labels

Hi all, I have built a Random Forest using Caret package, however, I don't understand how the splits are labeled in trees. My dataset contains the frequency of the words in the speeches of the people: 'data.frame': 499 obs. of 608 variables: $ alright : num 1 0 0 0 0 0 0 1 2 1 ... $ bad : num 1 0 0 0 0 0 0 0 0 0 ... $ boy : num 1 2 1 1 0 2 2 4 2 1 ... $ cooki : num 1 2 2 1 0 1 1 4

assign color to subsets

2016 Apr 24

1

assign color to subsets

'grepl' returns a logical vector; you have to use this to get your subset. You can use: df_tq <- subset(df, grepl("t1", Command)) df_t2 <- subset(df, grepl("t2", Command)) # if you want to also get a subset that has both, use df_both <- subset(df, grepl("t1", Command) & grepl("t2", Command)) Jim Holtman Data Munger Guru What is

assign color to subsets

2016 Apr 24

0

assign color to subsets

now after this: df_both <- subset(df, grepl("t1", Command) & grepl("t2", Command)) I use factor to apply the subset to df but then the Command level becomes 0 df_both$Command=factor(df_both$Command) str(df_both) $ Protocol : Factor w/ 0 levels: Do you know what is the reason? Thanks for replying On Sunday, April 24, 2016 12:18 PM, jim

fill out a PDF form in R

2017 Jul 26

2

fill out a PDF form in R

Hi all, I would like to get ideas about how to fill out a PDF form in R and to know if it's possible or not. I could not find something helpful in Internet. Does anyone know a good link for that or have experience in this? Thanks for any help! Elahe

fill out a PDF form in R

2017 Jul 26

0

fill out a PDF form in R

Hi Elahe, I have no clue, but maybe you can dump the data fields using pdftk, and work with those in R. HTH Ulrik On Wed, 26 Jul 2017 at 13:50 Elahe chalabi via R-help <r-help at r-project.org> wrote: > Hi all, > > I would like to get ideas about how to fill out a PDF form in R and to > know if it's possible or not. I could not find something helpful in > Internet.

cross validation in random forest using rfcv functin

2017 Aug 23

0

cross validation in random forest using rfcv functin

Any responds?! On Wednesday, August 23, 2017 5:50 AM, Elahe chalabi via R-help <r-help at r-project.org> wrote: Hi all, I would like to do cross validation in random forest using rfcv function. As the documentation for this package says: rfcv(trainx, trainy, cv.fold=5, scale="log", step=0.5, mtry=function(p) max(1, floor(sqrt(p))), recursive=FALSE, ...) however I

cross validation in random forest using rfcv functin

2017 Aug 23

1

cross validation in random forest using rfcv functin

Hi all, I would like to do cross validation in random forest using rfcv function. As the documentation for this package says: rfcv(trainx, trainy, cv.fold=5, scale="log", step=0.5, mtry=function(p) max(1, floor(sqrt(p))), recursive=FALSE, ...) however I don't know how to build trianx and trainy for my data set, and I could not understand the way trainx is built in the package

subset by multiple letters condition

2016 Apr 23

1

subset by multiple letters condition

Thanks Jean, Does anyone know how to set these [hast1] and [hast2] as the colors of a plot? On Friday, April 22, 2016 7:39 AM, "Adams, Jean" <jvadams at usgs.gov> wrote: You can use the grepl() function to give you logicals for each criterion, then combine them as needed. For example: # example version of Command Command <- paste0("_localize_",

a replace for subset

2016 Apr 16

1

a replace for subset

-Thank you James, well the problem of my type of data is that there can be many possible subsets and therefore plots, and I want to automatically generate them, and facet_wrap does not give me all the possible cases On Saturday, April 16, 2016 6:01 AM, James C. Whanger <james.whanger at gmail.com> wrote: Would facet_wrap or facet_grid give you what you want? On Sat, Apr 16, 2016 at

a replace for subset

2016 Apr 16

0

a replace for subset

Would facet_wrap or facet_grid give you what you want? On Sat, Apr 16, 2016 at 8:45 AM, ch.elahe via R-help <r-help at r-project.org> wrote: > Hi, > I have a data set (mydata), which a part of this is like the following: > > > 'data.frame': 36190 obs. of 16 variables: > $ RE : int 38 41 11 67 30 18 38 41 41 30 ... > $ LU

overlay two histograms ggplot

2017 Dec 13

1

overlay two histograms ggplot

Hi all, How can I overlay these two histograms? ggplot(gg, aes(gg$Alz, fill = gg$veg)) + geom_histogram(alpha = 0.2) ggplot(tt, aes(tt$Cont, fill = tt$veg)) + geom_histogram(alpha = 0.2) thanks for any help! Elahe

a replace for subset

2016 Apr 16

3

a replace for subset

Hi, I have a data set (mydata), which a part of this is like the following: 'data.frame': 36190 obs. of 16 variables: $ RE : int 38 41 11 67 30 18 38 41 41 30 ... $ LU : int 4200 3330 530 4500 3000 1790 4700 3400 3640 4000 ... $ COUNTRY : Factor w/ 4 levels "DE","FR","JP", "FR"? $Light