similar to: aggregate() naming -- bug or feature

Displaying 20 results from an estimated 1000 matches similar to: "aggregate() naming -- bug or feature"

aggregate() naming -- bug or feature

2018 Mar 23

1

aggregate() naming -- bug or feature

On Fri, Mar 23, 2018 at 6:43 PM, Rui Barradas <ruipbarradas at sapo.pt> wrote: > Hello, > > Not exactly an answer but here it goes. > If you use the formula interface the names will be retained. Also if you pass named arguments: aggregate(iris["Sepal.Length"], by = iris["Species"], FUN = foo) # Species Sepal.Length # 1 setosa 5.006 # 2

question about "mean"

2010 Jun 09

4

question about "mean"

Hi there: I have a question about generating mean value of a data.frame. Take iris data for example, if I have a data.frame looking like the following: --------------------- Sepal.Length Sepal.Width Petal.Length Petal.Width Species 1 5.1 3.5 1.4 0.2 setosa 2 4.9 3.0 1.4 0.2

couple of how-to-do it in R questions regarding corelations and mean and SD of likert items

2018 Mar 06

0

couple of how-to-do it in R questions regarding corelations and mean and SD of likert items

Hi For first question, maybe I am completely wrong but cor(swiss[,-1], swiss[,1]) should give you what you want in one step. Second question Without an example it is hard to say but maybe aggregate is the way forward. > aggregate(iris[,1:4], list(iris$Species), function (x) c(mean=mean(x), sd=sd(x))) Group.1 Sepal.Length.mean Sepal.Length.sd Sepal.Width.mean Sepal.Width.sd 1

more on apply on data frame

2004 Aug 21

2

more on apply on data frame

Hi R People: Several of you pointed out that using "tapply" on a data frame will work on the iris data frame. I'm still having a problem. The iris data frame has 150 rows, 5 variables. The first 4 are numeric, while the last is a factor, which has the Species names. I can use tapply for 1 variable at a time: >tapply(iris[,1],iris[,5],mean) setosa versicolor virginica

couple of how-to-do it in R questions regarding corelations and mean and SD of likert items

2018 Mar 06

5

couple of how-to-do it in R questions regarding corelations and mean and SD of likert items

Dear list, I have the following how-to-do it in R, questions. Suppose I have ten independent variables, and one dependent variable. I want to find the Pearson correlation of all the IVs with the DV, but not the correlation between the IVs. What I know so far, about R, that I have to type the cor () function ten times, each time requesting for a correlation between one IV and the DV. I was

multiple tapply

2008 Aug 07

6

multiple tapply

Hi folk, I tried this and it works just perfectly tapply(iris[,1],iris[5],mean) but, how to obtain a single table from multiple variables? In tapply x is an atomic object so this code doesn't work tapply(iris[,1:4],iris[5],mean) Thanx and great summer holidays Gianandrea -- View this message in context: http://www.nabble.com/multiple-tapply-tp18868063p18868063.html Sent from the R help

saving sublist lda object with save.image()

2012 Jun 11

1

saving sublist lda object with save.image()

Greetings R experts, I'm having some difficulty recovering lda objects that I've saved within sublists using the save.image() function. I am running a script that exports a variety of different information as a list, included within that list is an lda object. I then take that list and create a list of that with all the different replications I've run. Unfortunately I've been

don't print object attributes

2012 Aug 28

1

don't print object attributes

Dear all Suppose the object below: > require(Hmisc) > require(plyr) > x <- dlply(iris, .(Species), describe) How can I print the object without displaying the attributes? I inspected ?print and ?print.default with no luck. > x $setosa x[, "Sepal.Length"] n missing unique Mean .05 .10 .25 .50 .75 50 0 15 5.006 4.40 4.59

Get a percent variable based on group

2013 Jan 16

4

Get a percent variable based on group

Dear all, I'd like to get a percentage variable based on a group, but without creating a new data frame. For example: data(iris) iris$percent <-unlist(tapply(iris$Sepal.Length,iris$Species,function(x) x/sum(x, na.rm=TRUE))) This does not work, I should have only three standard values, respectively for setosa, versicolor, and virginica. How can I do this? MANY THANKS, Karine

means by column after split

2001 Feb 19

1

means by column after split

Colleagues ---------------------------------- System info: R version rw1020 on NT ESS using emacs ver. 20.4 ---------------------------------- I need to get the means for each column of a dataframe in the list created by splitting a data frame. At present, I am getting the mean of all columns in aggregate. The structure of the unsplit data is: > shuttle.tr1[1:10,] juliandate

Applying function to a TABLE and also "apply, tapply, sapply etc"

2010 Dec 15

3

Applying function to a TABLE and also "apply, tapply, sapply etc"

Dear R-help forum members, Suppose I have a data-frame having two variables and single data for each of them, as described below. variable_1 variable_2 10 20 I have written a function, say, 'fun' which uses input 10 and 20 and gives me desired result. fun = function(X, Y) { X + Y #( I am just giving an example of

New R package sqldf

2007 Aug 01

1

New R package sqldf

sqldf is an R package for running SQL select statements on one or more R data frames. It is optimized for convenience making it useful for ad hoc queries against R data frames. Given an SQL select statement whose tables are the names of R data frames it: - sets up the database (by default it transparently sets up an in memory SQLite database using RSQLite; however, MySQL via RMySQL, can be

New R package sqldf

2007 Aug 01

1

New R package sqldf

sqldf is an R package for running SQL select statements on one or more R data frames. It is optimized for convenience making it useful for ad hoc queries against R data frames. Given an SQL select statement whose tables are the names of R data frames it: - sets up the database (by default it transparently sets up an in memory SQLite database using RSQLite; however, MySQL via RMySQL, can be

Cannot Compute Box's M (Three Days Trying...)

2017 Oct 27

4

Cannot Compute Box's M (Three Days Trying...)

It can't be this hard, right? I really need a shove in the right direction here. Been spinning wheels for three days. Cannot get past the errors. I'm doing something wrong, obviously, since I can easily compute the Box's M right there in RStudio But I don't see what is wrong below with the coding equivalent. The entire code snippet is below. The code fails below on the call to

2017 Oct 29

3

Renjin?

Hi All, OK, in the "back to the drawing board" department, I found what looks like a much better solution to using R in Java. Renjin. Looking at the docs and then trying a quick example, didn't quite work. Of course I'm missing something. Although I'm telling the engine to require ("biotools") just like I would in R itself, when I get to the line of code that

Cannot Compute Box's M (Three Days Trying...)

2017 Oct 28

2

Cannot Compute Box's M (Three Days Trying...)

I'm not sure what you mean. Could you please be more specific? If I print the string, I get: boxM(boxMVariable[, -5], boxMVariable[, 5]) From this code: . . . // assign the data to a variable.rConnection.assign("boxMVariable", myDf); // create a string command with that variable name.String boxVariable = "boxM(boxMVariable[, -5], boxMVariable[, 5])";

Cannot Compute Box's M (Three Days Trying...)

2017 Oct 28

2

Cannot Compute Box's M (Three Days Trying...)

Thanks Duncan. Awesome ideas! I think we're getting closer! I tried what you suggested and got a possibly better error... . . . rConnection.assign("boxMVariable", myDf); String resultBV = "str(boxMVariable)"; // your suggestion. RESULTING ERROR: Error in format.default(nam.ob, width = max(ncn), justify = "left") : invalid 'width' argument (No idea

Cannot Compute Box's M (Three Days Trying...)

2017 Oct 28

2

Cannot Compute Box's M (Three Days Trying...)

Hey Duncan, Hard to debug? That's an understatement. Eyes bleeding.... In any case, I tried all your suggestions. To get "integer" for the final column, I had to change the code to get integers instead of strings. double[] d1 = ((REXPVector) ((RList) tableRead).get(0)).asDoubles(); double[] d2 = ((REXPVector) ((RList) tableRead).get(1)).asDoubles(); double[] d3 = ((REXPVector)

Cannot Compute Box's M (Three Days Trying...)

2017 Oct 29

2

Cannot Compute Box's M (Three Days Trying...)

Thanks Duncan. I can't tell you how helpful all your terrific replies have been. I think the biggest surprise is that nobody appears to be using Java and R together like I"m trying to do. I suppose it should be a surprise since there are no books on the subject and almost no technical documentation other than a few sites here and there. ----- I originally had the "int" as the

an unsophisticated question about recoding in a data frame with control structure if {}

2008 Oct 01

2

an unsophisticated question about recoding in a data frame with control structure if {}

Hello all, I apologize for a terribly simple question. I'm used to using Stata and am trying to `switch' over to R. I would like to recode a vector in a data frame when the value of it meets the following condition: if (dataframe$factor=='3'){dataframe$thevector<-(an arithmetic mean). What I would like to result is the creation of a new variable within the data frame for