thr3ads.net - similar to: "Summarize by two or more attributes"

Displaying 20 results from an estimated 9000 matches similar to: "Summarize by two or more attributes"

2011 Dec 01

Summarizing elements of a list

Hi everyone, I looked around the list for a while but couldn't find a solution to my problem. I am storing some results to a simulation in a list and for each element i have two separate vectors(is that what they are called, correct my vocab if necessary). See below Version1_<-list() for(i in 1:5){ Version1_[[i]]<-list(First=rnorm(1),Second=rnorm(1)) } What I want is to put all

Oh apply functions, how you confuse me

2011 Sep 01

Oh apply functions, how you confuse me

Hi guys, I have a crap load of data to parse and have enjoyed creating a script that takes this data and creates a number of useful graphics for our area. I am unable to figure out one summary though and its all cause I dont fully understand the apply family of functions. Consider the following: #Create data

Drop last numeral

2010 Jan 12

Drop last numeral

Hello all, Frustrated and i know you can help I need to drop the last numeral of each of my values in my data set. So for the following i have tried the ?substring but since i have to specify the length, but because my data are of varying lengths it doenst work so well Data<-c("1131", "1132", "1731" ,"1732" ,"1821" ,"1822",

I bet apply has a solution

2012 Feb 06

I bet apply has a solution

Hi all For the data below, I would like to return a logical value indicating differences in the data. #Create data Data..<-data.frame(a=rep(1,10),b=c(rep(1,9),2),c=c(rep(1,8),2,2)) a b c 1 1 1 1 2 1 1 1 3 1 1 1 4 1 1 1 5 1 1 1 6 1 1 1 7 1 1 1 8 1 1 1 9 1 1 2 10 1 2 2 So what I want is to return logical value telling me if all the values are the same. So the result would be a b

List elements of NULL to value

2010 Nov 29

List elements of NULL to value

Hi everyone, I am posting this because i know its easy and i cant for the life of me figure out how to do it though i have tried and through a ridiculously complex loop made it happen. I need to convert some list elements of NULL value to 0s so they mesh with my data frame properly. So for A<-list(1,NULL) returns [[1]] [1] 1 [[2]] NULL Would instead return [[1]] [1] 1 [[2]] [1] 0

proportion

2010 Sep 13

proportion

Hi , SO i have been on a role of asking simple questions lately. So much for feeling like im getting this R business. I wrote a script 2 weeks ago that utilized "proportion" to turn values in a table (from "table") into proportions to then graph. I now get an error that proportion is not a function so im confused. I ran the script a few times and im thinking maybe i had

Pie Chart in map

2010 Aug 09

Pie Chart in map

Hey R'rs, So im sick of dealing with ESRI products and am looking to stream line a process i now use GIS to do using R. I have made a lot of maps using R but have not yet seen a map that puts pie charts within the map to help represent data like the attachment. http://r.789695.n4.nabble.com/file/n2318816/template1.bmp I found Tanimura et al. work "Proportional Symbol Mapping in

Sorting a data frame by specifying a vector

2012 Oct 11

Sorting a data frame by specifying a vector

Hello all, I cannot seem to figure out this seemingly simple procedure. I want to sort a data frame by a specified character vector. So for : df.. <- data.frame(Season=rep(c("Summer","Fall","Winter","Spring"),4),Obs= runif(length(rep(c("Summer","Fall","Winter","Spring"),4)))) I want to sort the data frame

Hmisc + summarize + quantile: Why only quantiles for first variable in data frame?

2006 Apr 19

Hmisc + summarize + quantile: Why only quantiles for first variable in data frame?

Hi, I'm working on a data set that contains a couple of factors and a number of dependent variables. From all of these dependent variables I would like to calculate mean, standard deviation and quantiles. With the function FUN I get all the means and stdev that I want but quantiles are only calculated for the first of the dependent variables (column 8 in the summarize command). What do I

Hmisc summarize() with level "" in by variable

2009 Jun 13

Hmisc summarize() with level "" in by variable

I was using summarize() in a data set in which one of the levels of the by variable was "". The summary statistic was consistently off by one level and the "" level was not in the output data frame. I tried to report it as a bug, but I could not log into the Hmisc bug reporting website to do so. I searched for this in the email archives. If it's there, I failed to find

summarize: A log analysis script

2006 May 05

summarize: A log analysis script

Hi folks, I wrote a quick script to extract performance one-liners from Rails logs. (Didn''t want to futz with syslog for http://rails-analyzer.rubyforge.org/ .) Reads output from :info or :debug log levels (the defaults). Usage: # summarize < development.log # summarize < production.log Output looks like: (w/FULL_URL set false) 123.23.23.123 2006-05-05 10:59:42 | r

Help with ddply/summarize

2014 Nov 13

Help with ddply/summarize

I have a straightforward application of ddply() and summarize(): ddply(MyFrame, .(Treatment, Week), summarize, MeanValue=mean(MyVar)) This works just fine: Treatment Week MeanValue 1 MyDrug BASELINE 5.91 2 MyDrug WEEK 1 4.68 3 MyDrug WEEK 2 4.08 4 MyDrug WEEK 3 3.67 5 MyDrug WEEK 4 2.96 6 MyDrug WEEK 5 2.57 7 MyDrug

dplyr/summarize does not create a true data frame

2014 Nov 21

dplyr/summarize does not create a true data frame

I got an error when trying to extract a 1-column subset of a data frame (called "my.output") created by dplyr/summarize. The ncol() function says that my.output has 4 columns, but "my.output[4]" fails. Note that converting my.output using as.data.frame() makes for a happy ending. Is this the intended behavior of dplyr? Tx, John > library(dplyr) > # set up data frame

Hmisc::summarize with a dataframe as input?

2010 Apr 13

Hmisc::summarize with a dataframe as input?

Hi all, I'm looking for a function with the same functionalities as Hmisc::summarize but accepting a dataframe as input (not just a vector or a matrix). I'd like to compute the correlation between two variables in my dataframe, grouped according to other variables in the same dataframe. For exemple, consider the following dataframe D: V1 V2 V3 A 1 -1 A 1 1 A -1 -1

Aggregation using list with Hmisc summarize function

2006 Dec 28

Aggregation using list with Hmisc summarize function

Hi All, I'm using the Hmisc summarize function and used list instead of llist to provide the by variables. It generated an error message. Is this a bug, or do I misunderstand how Hmisc works with lists? The program below demonstrates the error message. Thanks, Bob x<-1:8 group <- c(1,1,1,1,2,2,2,2) gender<- c(1,2,1,2,1,2,1,2) mydata<-data.frame(x,group,gender)

Subsetting for unwanted values

2010 Jun 03

Subsetting for unwanted values

Hi all, I have toyed with this for too long today and in the past i used multiple lines of code to get at what i want. Consider the following: All i need to do is subset Pc to the values that do not equal Pc.X. The first attempt doesnt work because i have unequal lengths. The second attempt doesnt give me an the right answer.

Curious about nomenclature: random deviates

2004 Apr 06

Curious about nomenclature: random deviates

< Does anyone know why they're called random deviates, as opposed to random numbers?> Others will probably give you some technical reason about random numbers can be considered as random deviates from a mean (I think at least the 1875 Galton paper at http://www.mugu.com/galton/ uses similar terminology (I'm not claiming this is the earliest use - just the easiest to access at the

problem with FUN in Hmisc::summarize

2010 Apr 16

problem with FUN in Hmisc::summarize

Hi all, I'd like to use the Hmisc::summarize function, but it uses a function (FUN) of a single vector argument to create the statistical summaries. Consider an easy case: I'd like to compute the correlation between two variables in my dataframe, grouped according to other variables in the same dataframe. For exemple, consider the following dataframe D: V1 V2 V3 A 1 -1 A 1

Is there a statistics that can summarize the correlation formore than two random variables?

2004 Jul 13

Is there a statistics that can summarize the correlation formore than two random variables?

This seems more like a STATS question than an R question - asking on a list like STAT-L or ALLSTAT may result in more replies Nevertheless, it seems to me that you need to describe (and maybe decide) what you mean by 'summarize' the correlations. Certainly the mean DOES summarize them, but is it the summary you want? Maybe, maybe not. Perhaps the median? Or a trimmed mean? Do you want

By= levels with the Hmisc summarize function.

2009 Apr 24

By= levels with the Hmisc summarize function.

Hi, All I have a data frame as follows: > attach(mf) > names(mf) [1] "centre" "complex" "appl" "pool" "month" "alloc_gb" I want to summarize this as follows: agg<-summarize(alloc_gb,by=llist(centre,complex,appl,month),FUN=sum, na.rm=TRUE) That seems to run fine but there something odd about the output. The

similar to: Summarize by two or more attributes