thr3ads.net - similar to: "help with "by" command"

Displaying 20 results from an estimated 40000 matches similar to: "help with "by" command"

2011 Aug 10

Clustering Large Applications..sort of

Hello all, I am using the clustering functions in R in order to work with large masses of binary time series data, however the clustering functions do not seem able to fit this size of practical problem. Library 'hclust' is good (though it may be sub par for this size of problem, thus doubly poor for this application) in that I do not want to make assumptions about the number of

data frames; matching/merging

2010 Feb 08

data frames; matching/merging

Hi all, I'm feeling a little guilty to ask this question, since I've written a solution using a rather clunky for loop that gets the job done. But I'm convinced there must be a faster (and probably more elegant) way to accomplish what I'm looking to do (perhaps using the "merge" function?). I figured somebody out there might've already figured this out: I have

bwplot superpose panel.points from another dataframe

2010 Sep 27

bwplot superpose panel.points from another dataframe

Hi everybody, using bwplot for producing panel boxplot with 3 dimensions i want to add a mark on each boxplot representing one individual (on all its dimensions) till now, i didn't succeed getting the desired solution I want as well to keep the median symbols as a line Many thanks for your help christophe here is the tested code: ######################## library(lattice) ex <-

Applying function to a TABLE and also "apply, tapply, sapply etc"

2010 Dec 15

Applying function to a TABLE and also "apply, tapply, sapply etc"

Dear R-help forum members, Suppose I have a data-frame having two variables and single data for each of them, as described below. variable_1 variable_2 10 20 I have written a function, say, 'fun' which uses input 10 and 20 and gives me desired result. fun = function(X, Y) { X + Y #( I am just giving an example of

Data frame "pivoting"

2010 May 06

Data frame "pivoting"

Dear R experts, I am trying to solve this problem, related to the possibility of changing the shape of a data frame using a "pivoting-like" function. I have a dataframe df of observations as follows: ID VALIDITY YEAR PROPERTY PROPERTY VALUE A1 2007 P1 V1 A1 2007 P2 V2 A1 2007 P3 V3 A1 2008 P1 V10 A1 2008 P2 V20 A2 2007 P5 V50 A2 2008 P6 V20 A3 2007

tapply output as a dataframe

2009 Apr 13

tapply output as a dataframe

i use tapply and by often, but i always end up banging my head against the wall with the output. is there a simpler way to convert the output of the following tapply to a dataframe or matrix than what i have here: # setup data for tapply dt = data.frame(bucket=rep(1:4,25),val=rnorm(100)) fn = function(x) { ret = c(unname(quantile(x,probs=seq(.25,.75,.25),na.rm=T)),mean(x,na.rm=T)) } a =

"tapply versus by" in function with more than 1 arguments

2008 Oct 01

"tapply versus by" in function with more than 1 arguments

Hi. I searched the list and didn't found nothing similar to this. I simplified my example like below: #I need calculate correlation (for example) between 2 columns classified by a third one at a data.frame, like below: #number of rows nr = 10 #the third column is to enforce that I need correlation on two variables only dataf =

Aggregate to find majority level of a factor

2007 May 31

Aggregate to find majority level of a factor

I want to use the aggregate function to summarize data by a factor (my field plots), but I want the summary to be the majority level of another factor. For example, given the dataframe: Plot1 big Plot1 big Plot1 small Plot2 big Plot2 small Plot2 small Plot3 small Plot3 small Plot3 small My desired result would be: Plot1 big Plot2 small Plot3 small I

Summarize by two-column factor, retaining original factors

2006 Feb 24

Summarize by two-column factor, retaining original factors

I am having trouble doing the following. I have a data.frame like this, where x and y are a variable that I want to do calculations on: Name Year x y ab 2001 15 3 ab 2001 10 2 ab 2002 12 8 ab 2003 7 10 dv 2002 10 15 dv 2002 3 2 dv 2003 1 15 Before I do all the other things I need to do with this data, I need to summarize or collapse the data by name and year. I've

problem in tapply command

2007 Apr 24

problem in tapply command

hello when I entered following command, I got NA values for some catagories. > tapply(slp_jeo2$slp,slp_jeo2$jeo,mean ) 999 Ca Cka DCy Jh JKi Kk 14.06665 NA 14.60445 NA NA NA NA NA KTa KTac Ku Kua Kus Ky Kyk ODe NA NA NA NA NA NA

Finding the maximum in a particular group in a dataframe

2011 Feb 02

Finding the maximum in a particular group in a dataframe

Hello I am trying to find a way to find the max value, for only a subset of a dataframe, depending on how the data is grouped for example, How would I find the maxmium responce, for all the GPR119a condition below: I've tried tapply > tapply(GPR119data$responce, GPR119data$GPR119a, max) Error in tapply(GPR119data$responce, GPR119data$GPR119a, max) : arguments must have same length

Runnin R prorams from a command line

2002 Sep 29

Runnin R prorams from a command line

Hello, I am a new R user, using version 1.5.1. I am attempting to run R programs from a dos command line (in win2000) and am having problems. My goal is to be able to use R from batch scripts in both windows and also in Linux eventually later. When I first ran "rcmd BATCH --help" it said that perl was not found. after installing perl and running again, I received the following error:

Sample Docker images for Asterisk available

2015 Apr 23

Sample Docker images for Asterisk available

Hello all, I created a set of Docker images running Asterisk and exposing AMI / ARI ports that i found to be quite useful for ARI / AMI development and regression. As they are based on Docker with whaleware, adding new configuration files to roll your own dialplan / queues / voicemail etc is pretty easy. And you can run quite a lot on the same box to simulate clusters. There is no SIP / RTP

by() processing on a dataframe

2005 Sep 30

by() processing on a dataframe

I want to calculate a statistic on a number of subgroups of a dataframe, then put the results into a dataframe. (What SAS PROC MEANS does, I think, though it's been years since I used it.) This is possible using by(), but it seems cumbersome and fragile. Is there a more straightforward way than this? Here's a simple example showing my current strategy: > dataset <-

self-written function

2005 Jan 27

self-written function

Dear all, I?ve got a simple self-written function to calculate the mean + s.e. from arcsine-transformed data: backsin<-function(x,y,...){ backtransf<-list() backtransf$back<-((sin(x[x!="NA"]))^2)*100 backtransf$mback<-tapply(backtransf$back,y[x!="NA"],mean)

Change dataframe-structure

2012 Feb 13

Change dataframe-structure

Ein eingebundener Text mit undefiniertem Zeichensatz wurde abgetrennt. Name: nicht verf?gbar URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20120213/d2a5afa6/attachment.pl>

attach data from tapply to dataframe

2004 Aug 03

attach data from tapply to dataframe

I am working with a longitudinal data set in the long format. This data set has three observations per grade level per year. Here are the first 10 rows of the data frame: >tenn.dat[1:10,] year schid type grade gain se new cohort 6 2001 100005 5 4 33.1 3.5 4 3 7 2002 100005 5 4 33.9 3.9 4 2 8 2003 100005 5 4 32.3 4.2 4 1 10 2001 100005

tapply

2005 Jun 20

tapply

hi, i have another question on tapply: i have a dataset z like this: 5540 389100307391 2600 5541 389100307391 2600 5542 389100307391 2600 5543 389100307391 2600 5544 389100307391 2600 5546 381300302513 NA 5547 387000307470 NA 5548 387000307470 NA 5549 387000307470 NA 5550 387000307470 NA 5551 387000307470 NA 5552 387000307470

tapply output as a dataframe

2011 Feb 03

tapply output as a dataframe

On Mon, Apr 13, 2009 at 12:41 PM, Dan Dube <ddube-at-advisen.com> wrote: > i use tapply and by often, but i always end up banging my head against > the wall with the output. The proposed solution of Dan's problem posted on R-help was: > do.call(rbind,a) When I use this 'solution' I get 'ERROR: second argument must be a list'. So head on wall continues. My

problem with FUN in Hmisc::summarize

2010 Apr 16

problem with FUN in Hmisc::summarize

Hi all, I'd like to use the Hmisc::summarize function, but it uses a function (FUN) of a single vector argument to create the statistical summaries. Consider an easy case: I'd like to compute the correlation between two variables in my dataframe, grouped according to other variables in the same dataframe. For exemple, consider the following dataframe D: V1 V2 V3 A 1 -1 A 1

similar to: help with "by" command