Displaying 20 results from an estimated 7000 matches similar to: "Get a percent variable based on group"
2010 Jun 09
question about "mean"
Hi there:
I have a question about generating mean value of a data.frame. Take
iris data for example, if I have a data.frame looking like the following:
Sepal.Length Sepal.Width Petal.Length Petal.Width Species
1 5.1 3.5 1.4
0.2 setosa
2 4.9 3.0 1.4
2018 Mar 23
aggregate() naming -- bug or feature
In the examples below, the first loses the name attached by foo(), the second retains names attached by bar(). Is this an intentional difference? I?d prefer that the names be retained in both cases.
foo <- function(x) { c(mean = base::mean(x)) }
bar <- function(x) { c(mean = base::mean(x), sd = stats::sd(x))}
aggregate(iris$Sepal.Length, by = list(iris$Species), FUN = foo)
2005 Sep 26
How to get the rowindices without using which?
I was wondering if it is possible to get the
rowindices without using the function "which" because
I don't have a restriction criteria. Here's an example
of what I mean:
# take 10 randomly selected instances
iris[sample(1:nrow(iris), 10),]
# output
Sepal.Length Sepal.Width Petal.Length Petal.Width
76 6.6 3.0 4.4 1.4
2017 Oct 28
Cannot Compute Box's M (Three Days Trying...)
Hey Duncan,
Hard to debug? That's an understatement. Eyes bleeding....
In any case, I tried all your suggestions. To get "integer" for the final column, I had to change the code to get integers instead of strings.
double[] d1 = ((REXPVector) ((RList) tableRead).get(0)).asDoubles();
double[] d2 = ((REXPVector) ((RList) tableRead).get(1)).asDoubles();
double[] d3 = ((REXPVector)
2017 Oct 28
Cannot Compute Box's M (Three Days Trying...)
Thanks Duncan. Awesome ideas!
I think we're getting closer!
I tried what you suggested and got a possibly better error...
rConnection.assign("boxMVariable", myDf);
String resultBV = "str(boxMVariable)"; // your suggestion.
Error in format.default(nam.ob, width = max(ncn), justify = "left") : invalid 'width' argument
(No idea
2008 Aug 07
multiple tapply
Hi folk,
I tried this and it works just perfectly
but, how to obtain a single table from multiple variables?
In tapply x is an atomic object so this code doesn't work
Thanx and great summer holidays
View this message in context: http://www.nabble.com/multiple-tapply-tp18868063p18868063.html
Sent from the R help
2004 Aug 21
more on apply on data frame
Hi R People:
Several of you pointed out that using "tapply" on a data frame will work on
the iris data frame.
I'm still having a problem.
The iris data frame has 150 rows, 5 variables. The first 4 are numeric,
while the last is a factor, which has the Species names.
I can use tapply for 1 variable at a time:
setosa versicolor virginica
2018 Mar 23
aggregate() naming -- bug or feature
On Fri, Mar 23, 2018 at 6:43 PM, Rui Barradas <ruipbarradas at sapo.pt> wrote:
> Hello,
> Not exactly an answer but here it goes.
> If you use the formula interface the names will be retained.
Also if you pass named arguments:
aggregate(iris["Sepal.Length"], by = iris["Species"], FUN = foo)
# Species Sepal.Length
# 1 setosa 5.006
# 2
2011 Aug 16
Newbie question - struggling with boxplots
Hopefully I will not be flamed for this on the list, but I am starting out
with R and having some trouble with combining plots.
I am playing with the famous iris dataset (checking out example dataset in R
while reading through Introduction to datamining)
What I would like to do is create three graphs (combined boxplots) besides
each other for each of the three species (Setosa, Versicolour and
2017 Oct 29
Cannot Compute Box's M (Three Days Trying...)
Thanks Duncan. I can't tell you how helpful all your terrific replies have been.
I think the biggest surprise is that nobody appears to be using Java and R together like I"m trying to do. I suppose it should be a surprise since there are no books on the subject and almost no technical documentation other than a few sites here and there.
I originally had the "int" as the
2012 Jun 11
saving sublist lda object with save.image()
Greetings R experts,
I'm having some difficulty recovering lda objects that I've saved within sublists using the save.image() function. I am running a script that exports a variety of different information as a list, included within that list is an lda object. I then take that list and create a list of that with all the different replications I've run. Unfortunately I've been
2012 Aug 01
Neuralnet Error
I require some help in debugging this codeĀ
ir1 <- data.frame(ir[1:100,2:6])
ir2 <- data.frame(ifelse(ir1$Species=="setosa",1,ifelse(ir1$Species=="versicolor",0,"")))
ir3 <- data.frame(rbind(ir1[1:4],ir2))
2003 Jun 13
problem with latex of object summary reverse
I have the following problem (library Hmisc loaded,
iris data loaded, R Version 1.7.0 (2003-04-16), packages
updated, running on a linux Debian i386):
> summary(Species~Sepal.Length,method="reverse")->a
> a
Descriptive Statistics by Species
| |setosa |versicolor |virginica
2012 Oct 10
Summary using by() returns character arrays in a list
I use by() to generate a summary statistics like so:
Lbys <- by(dat[Nidx], dat$LipTest, summary)
where Nidx is an index vector with names picking out the columns in the
data frame dat.
This returns a list of character arrays (see below for str() output) where
the columns are named correctly but the rownames are empty strings and the
values are strings prepended with the summary
2012 Dec 10
splitting dataset based on variable and re-combining
I have a dataset and I wish to use two different models to predict. Both models are SVM. The reason for two different models is based
on the sex of the observation. I wish to be able to make predictions and have the results be in the same order as my original dataset. To
illustrate I will use iris:
# Take Iris and create a dataframe of just two Species, setosa and versicolor, shuffle them
2006 Oct 26
distance between legend title and legend box
I've looked at the parameters available for the legend function and
cannot find a way to change the distance between the top of the box
surrounding a legend and the legend's title. I have a math expression
that raises the height of my title.
If you don't mind the non-sensical title I give to the legend for
this plot (Figure 3.20 in R Graphics):
2006 Jan 02
How to create "special" (source) file
Dear All,
I'm a Marine Biologist using R (in a most informal and applied way) for a couple of years. Actually, I've been using R to analyse the results supporting my thesis. I would like to put together (in the same file) the data (.txt files) and the scripts files (.R files) for each chapter. How can I do this?
Thanks in advance, Eduardo Esteves
[[alternative HTML version deleted]]
2009 Jul 03
Two questions about the cloud function in the lattice package
I have two questions regarding the cloud function in the lattice
1) Is there a way to not print the surrounding frame (i.e. the square
surrounding the entire plot)?
2) Is there a way to italicize the text displayed with the key argument?
Some sample code:
2007 Sep 19
By() with method = spearman
I have a data set where I want the correlations between 2 variables
conditional on a students grade level.
This code works just fine.
by(tmp[,c('mtsc07', 'DCBASmathscoreSPRING')], tmp$Grade, cor,
use='complete', method='pearson')
However, this generates an error
by(tmp[,c('mtsc07', 'DCBASmathscoreSPRING')], tmp$Grade, cor,
2012 Aug 28
don't print object attributes
Dear all
Suppose the object below:
> require(Hmisc)
> require(plyr)
> x <- dlply(iris, .(Species), describe)
How can I print the object without displaying the attributes? I
inspected ?print and ?print.default with no luck.
> x
x[, "Sepal.Length"]
n missing unique Mean .05 .10 .25 .50 .75
50 0 15 5.006 4.40 4.59