thr3ads.net - search: "jtor14"

Displaying 19 results from an estimated 19 matches for "jtor14".

rle on large data . . . without a for loop!

2011 Jun 17

rle on large data . . . without a for loop!

I think need to do something like this: dat<-data.frame(state=sample(id=rep(1:5,each=200),1:3, 1000, replace=T,prob=c(0.7,0.05,0.25)),V1=runif(1,10,1000),V2=rnorm(1000)) rle.dat<-rle(dat$state) temp<-1 out<-data.frame(id=1:length(rle.dat$length)) for(i in 1:length(rle.dat$length)){ temp2<-temp+rle.dat$length[[i]] out$V1[i]<-mean(dat$V1[temp:temp2])

count value changes in a column

2011 May 31

count value changes in a column

is there a way to look for value changes in a column? set.seed(144) df<-data.frame(state=sample(rep(1:5,200),1000)) any of the five states are acceptable. however if, for example, states 4 or 5 follow state 3, i want to overwrite them with 3. changes from 1 to any value and 2 to any value are acceptable as are changes from any value to 1 or 2. By way of an example: the sequence 1 3 3 5 5 3

string interpolation

2011 Mar 21

string interpolation

Is there a way to do this in R? I have data in the form: 57_input 57_output 58_input 58_output etc. can i use a for loop (i in 57:n) that plots only the outputs? I want this to be robust so im not specifying a column id but rather something like c++ code, %s_input, i is that doable in R? Thanks, justin

MASS fitdistr with plyr or data.table?

2011 Apr 27

MASS fitdistr with plyr or data.table?

I am trying to extract the shape and scale parameters of a wind speed distribution for different sites. I can do this in a clunky way, but I was hoping to find a way using data.table or plyr. However, when I try I am met with the following: set.seed(144) weib.dist<-rweibull(10000,shape=3,scale=8) weib.test<-data.table(cbind(1:10,weib.dist))

binary conversion list to data.frame with plyr... AND NO LOOPS!

2011 Jul 08

binary conversion list to data.frame with plyr... AND NO LOOPS!

Happy weekend helpeRs! As usual, I'm stumped by R... My plan was to take an integer number, convert it to binary and wind up with a data.frame where each column is either 1 or 0 so I can see which bits are changing: bb<-function(i) ifelse(i, paste(bb(i %/% 2), i %% 2, sep=""), "") my.dat<-c(36,40,10,4) my.binary.dat<-bb(my.dat)

system command to a specific shell (bash)

2012 Apr 16

system command to a specific shell (bash)

I need to run a bash command, but when you call system() the default shell is sh (see my sessionInfo below). I found the shell command ( http://www.stat.ucl.ac.be/ISdidactique/Rhelp/library/base/html/shell.html) but it seems to be disappeared in current versions of R? I am running all this from R CMD BATCH with "system" calls to other R scripts. For a little more info, I'm

lubridate and intervals

2011 Aug 30

lubridate and intervals

Hiya, maybe there is a native R function for this and if so please let me know! I have 2 data.frames with start and end dates, they read in as strings and I am converting to POSIXct. How can I check for overlap? The end result ideally will be a single data.frame containing all the columns of the other two with rows where there were date overlaps.

preceding X. and X

2011 Oct 27

preceding X. and X

Hello, Why do I get preceding "X." (that is a and X followed by a period) for negative numbers and an "X" for positive numbers when I read a csv file? Am I stuck with this? If so, how do I convert it to normal numbers? dat=read.csv(file_path) > dat [1] X0.0 X.0.240432350374 X0.355468069625 X.0.211469972378 X1.1812797415 X.0.227975150826

ddply with mean and max...

2011 May 11

ddply with mean and max...

I'm trying to use ddply to compute summary statistics for many variables splitting on the variable site. however, it seems to work fine for mean() but if i use max() or min() things fall apart. whats going on? test.set<-data.frame(site=1:10,x=.Random.seed[1:100],y=rnorm(100)) means<-ddply(test.set,.(site),mean) means site x y 1 1 -97459496 -0.14826303 2

plot 3 lines with ggplot2

2011 Sep 12

plot 3 lines with ggplot2

Hi, I am trying to learn to use ggplot2 for what I had hoped would be a fairly simple task. I have a relatively small data.frame (100 by 4). The first column contains symbols. The 2nd, 3rd and 4th columns represent percentage weightings for each symbol using 3 different methodologies. For example: sym <- make.unique(replicate(100, paste(sample(LETTERS, 3, replace = TRUE), collapse =

ggplot in a function confusion!

2011 Aug 15

ggplot in a function confusion!

Whats going on here? df<-data.frame(x=1:10,y=1:10) ggplot()+geom_point(data=df,aes(x=x,y=y)) ## this is the normal usage right? ggplot()+geom_point(data=df,aes(x=df[,1],y=df[,2])) ## but I can also feed it column indices ggplot()+geom_point(aes(x=df[,'x'],y=df[,'y'])) ## or column names. ## but if i wrap it in a function... plot.func.one<-function(dff,x.var,y.var){

Sequential Naming of ggplot .pngs using plyr

2011 Aug 10

Sequential Naming of ggplot .pngs using plyr

If I have data: dat<-data.frame(a=rnorm(20),b=rnorm(20),c=rnorm(20),d=rnorm(20),site=rep(letters[5:8],each=5)) And want to plot like this: ctr<-1 for(i in c('a','b','c','d')){ png(file=paste('/tmp/plot_number_',ctr,'.png',sep=''),height=8.5, width=11,units='in',pointsize=9,res=300) print(ggplot(dat[,names(dat) %in%

linear regression in a data.frame using recast

2011 Mar 16

linear regression in a data.frame using recast

I have a very large dataset with columns of id number, actual value, predicted value. This used to be a time series but I have dropped the time component. So I now have a data.frame where the id number is repeated but each value in the actual and predicted columns are unique. I assume I need to use recast somehow but I'm at a loss... how can I perform a simple linear regression (using

MASS fitdistr call in plyr help!

2011 Apr 23

MASS fitdistr call in plyr help!

I have a set of wind speeds read at different locations. The data is a data frame with two columns: site and wind speed. I want to split the data on site and call a function to find the shape and scale parameters of a weibull distribution fit. The end result is a plot with x-axis = shape and y-axis = scale. Currently my code looks like: fit_wind_speed<-function(x){

xtable without a loop alongside a ggplot

2011 May 04

xtable without a loop alongside a ggplot

I would like to create a table of my points and identify which 'quadrant' of a plot they are in with the 'origin' at the means. the kicker is i would like to display it right next to or below a ggplot of the data. Maybe xtable isnt the right thing to use, but its the only thing i can think of. Any help is appreciated! set.seed(144) x=rnorm(100,mean=5,sd=1)

How do I break my addiction to for loops!?!?

2011 May 13

How do I break my addiction to for loops!?!?

I know I'm not supposed to use them... but they're just so easy! I have trouble defining an appropriate function for plyr or apply! data<-rnorm(144) groups1<-c('a','b','c','d') groups2<-c('aa','bb','cc','dd') machines<-1:12

ragged data.frame? using plyr

2011 Jun 03

ragged data.frame? using plyr

I have a dataset that looks like: set.seed(144) sam<-sample(1000,100) dat<-data.frame(id=letters[1:10],value=rnorm(1000),day=c(rep(1,100),rep(2,100),rep(3,100),rep(4,100),rep(5,100))) I want to "normalise" it using the following function (unless you have a better idea...): adj.values<-function(dframe){ value_mean<-mean(dframe$value) value_sd<-sd(dframe$value)

png output on a server?

2012 Jan 19

png output on a server?

I've got R running on a gentoo server that doesn't have X11 installed. Its a custom build to keep those dependencies at bay! However, some of my scripts use the base png() function and ggplot2. But, png uses X11. A google search suggests using the Cairo package, which works... but changes the fonts (specifically the size of the font). Adjusting the pointsize doesn't seem to have

mysterious warning message regarding bytecode...

2011 Nov 02

mysterious warning message regarding bytecode...

While running a long script which source()s other scripts I get the following warning: Warning message: In t(object$S[[1]]) : bytecode version mismatch; using eval I cannot replicate it if I run the sourced files line by line though... What is that error? And do I care about it? It doesn't seem to affect my output as far as I can tell. Thanks! Justin > sessionInfo() R version

search for: jtor14