thr3ads.net - search: "bravegag"

Displaying 20 results from an estimated 33 matches for "bravegag".

2011 Nov 12

dev.new() within a loop

...dev.new(..) all windows come out blank whereas if I execute each file outside of a loop then I can see the plots properly. What's wrong here? Thanks in advance, Best regards, Giovanni # given a directory name, it will iterate all files that match the given pattern #basedir <- "/Users/bravegag/code/asl11/data/2k-r1-test-20111111_data/" basedir <- "/Users/bravegag/code/asl11/data/nclients_2_128-20111110_data/" pattern <- paste("logs.*cl\\-.*mw\\-.*db\\-.*\\-client\\.dat",sep="") all_files <- dir(path=basedir, pattern=pattern) throughput <- N...

issue loading doBy library

2011 Oct 22

issue loading doBy library

Hello, How can I fix this? I have the latest version of R 2.13.2 and I use Mac OS X 10.7.2 > library(doBy) Loading required package: lme4 Error in dyn.load(file, DLLpath = DLLpath, ...) : function 'cholmod_l_start' not provided by package 'Matrix' Error: package 'lme4' could not be loaded > library(lme4) Error in dyn.load(file, DLLpath = DLLpath, ...) : function

2^k*r experimental design and anova

2011 Nov 10

2^k*r experimental design and anova

Hello, Can anyone point me to an online tutorial or book containing the easiest way to do ANOVA over the result data from a 2^k*r experiment. It is not clear to me if I can pass the raw data corresponding to each experiment or just the summarized data i.e. mean, sse, std, etc. I would like to get the: - box plot showing the effect for the different factors and levels - plot showing whether there

interpreting acf plot

2010 Apr 17

interpreting acf plot

...================================ # load dataset from web #bmwlr <- scan("http://stat.ethz.ch/Teaching/Datasets/bmw.dat") # load dataset from file bmwlr <- scan("/Users/bravegag/code/compstats/bmw.dat") par(mfrow=c(1,2)) # visualize two plots acf(bmwlr, lag.max = 10) acf(bmwlr^2, lag.max = 10) [[alternative HTML version deleted]]

summarizing a data frame i.e. count -> group by

2011 Oct 23

summarizing a data frame i.e. count -> group by

Hello, This is one problem at the time :) I have a data frame df that looks like this: time partitioning_mode workload runtime 1 1 sharding query 607 2 1 sharding query 85 3 1 sharding query 52 4 1 sharding query 79 5 1 sharding query 77 6 1 sharding query 67 7 1

glm predict issue

2011 Dec 26

glm predict issue

Hello, I have tried reading the documentation and googling for the answer but reviewing the online matches I end up more confused than before. My problem is apparently simple. I fit a glm model (2^k experiment), and then I would like to predict the response variable (Throughput) for unseen factor levels. When I try to predict I get the following error: > throughput.pred <-

ggplot2's geom_errorbar legend

2010 May 01

ggplot2's geom_errorbar legend

Hello, I create a simple ggplot that only shows a straight line. I then add three datasets of CI using the geom_errorbar function. The problem is that I can't find any way to have the legend showing up ... I need to show what each color of the CIs corresponds to i.e. which method. Can anyone advice please? TIA, Best regards, Giovanni

data frame and cumulative sum

2011 Dec 07

data frame and cumulative sum

Hello, I have a data frame that looks like this (containing interarrival times): > str(df) 'data.frame': 18233 obs. of 1 variable: $ Interarrival: int 135 806 117 4 14 1 9 104 169 0 ... > head(df) Interarrival 1 135 2 806 3 117 4 4 5 14 6 1 > This corresponds to the time differences (in ms) of a poisson arrival

R-latex syntax highlighting?

2011 Nov 23

R-latex syntax highlighting?

Hello, Can anyone provide or point me to a good setup for the listings latex package that would produce nice R-syntax highlighting? I am using an example I found in internet for setting up listings like this: \lstset{ language=R, basicstyle=\scriptsize\ttfamily, commentstyle=\ttfamily\color{gray}, numbers=left, numberstyle=\ttfamily\color{red}\footnotesize, stepnumber=1, numbersep=5pt,

binning runtimes

2011 Oct 24

binning runtimes

Hello, Suppose I have the dataset shown below. The amount of observations is too massive to get a nice geom_point and smoother on top. What I would like to do is to bin the data first. The data is indexed by Time (minutes from 1 to 120 i.e. two hours of System benchmarking). Option 1) group the data by Time i.e. minute 1, minute 2, etc and within each group create bins of N consecutive

simplest way (set of functions) to parse a file

2012 Aug 27

simplest way (set of functions) to parse a file

Hello, What would be the best set of R functions to parse and transform a file? My file looks as shown below. I would like to plot this data and I need to parse it into a single data frame that sorts of "transposes the data" with the following structure: > df <- data.frame(n=c(1,1,2,2),iter=c(1,2,1,2),step=as.factor(c('Step 1', 'Step2', 'Step 1',

2^k*r (with replications) experimental design question

2011 Nov 14

2^k*r (with replications) experimental design question

Hello, I have one replication (r=1 of the 2^k*r) of a 2^k experimental design in the context of performance analysis i.e. my response variables are Throughput and Response Time. I use the "aov" function and the results look ok: > str(throughput) 'data.frame': 286 obs. of 7 variables: $ Time : int 6 7 8 9 10 11 12 13 14 15 ... $ Throughput : int 42 44 33 41

unfold list (variable number of columns) into a data frame

2011 Oct 23

unfold list (variable number of columns) into a data frame

Hello, I used R a lot one year ago and now I am a bit rusty :) I have my raw data which correspond to the list of runtimes per minute (minute "1" "2" "3" in two database modes "sharding" and "query" and two workload types "query" and "refresh") and as a list of char arrays that looks like this: > str(data) List of 122 $ :

code review: is it too much to ask?

2011 Oct 23

code review: is it too much to ask?

...# use doBy library # ========================================================================================= # ETL Step # ========================================================================================= data_file <- file("/Users/bravegag/code/asl11/trunk/report/experiment.dat") df <- read.table(data_file) # reads the data as data frame class(df) # show the class to be 'list' names(df)...

use subset to trim data but include last per category

2012 Sep 09

use subset to trim data but include last per category

Hello, I bumped into the following funny use-case. I have too much data for a given plot. I have the following data frame df: > str(df) 'data.frame': 5015 obs. of 5 variables: $ n : Factor w/ 5 levels "1000","2000",..: 1 1 1 1 1 1 1 1 1 1 ... $ iter : int 10 20 30 40 50 60 70 80 90 100 ... $ Error : num 1.05e-02 1.24e-03 3.67e-04 1.08e-04

plotting multiple CIs

2010 Apr 30

plotting multiple CIs

Hello, I need to plot multiple confidence intervals for the same model parameter e.g. so for the same value of the parameter in point x_1 I would like to see four different confidence intervals so that I can compare the accuracy e.g. boot basic vs normal vs my own vs classic lm CI etc. I like very very much the plotCI implemented here: http://cran.r-project.org/web/packages/plotrix/index.html

mlm for within subject design

2010 Aug 24

mlm for within subject design

...; ls() # presto Only because a=1 and a is the first item in the list! Not because you are doing '-a'! If a is 0 then nothing gets deleted, and if a isn't numeric vector then it just fails. If you want to do it by name, use match.... Barry --Forwarded Message Attachment-- From: bravegag at gmail.com To: r-help at r-project.org Date: Tue, 24 Aug 2010 11:08:51 +0200 Subject: [R] update and rebuild all? Hello, I upgraded my Mac R version to the newest 2.11.1, then I ran the option to update all packages but there was an error related to fetching one of those and the process stoppe...

closest match in R to c-like struct?

2010 May 01

closest match in R to c-like struct?

Hello, What would be in R the closest match to a c-struct? e.g. data.frame requires all elements to be of the same length ... or is there a way to circumvent this? TIA, Best regards, Giovanni

bootstrap generalization error

2010 May 01

bootstrap generalization error

Hello, I use the following function "bootstrapge" to calculate (and compare) the generalization error of several bootstrap implementations: ## ## Calculates and returns a coefficient corresponding to the generalization ## error. The formula for the bootstrap generalization error is: ## $N^{-1}\sum_{i=1}^n B^{-1}\sum_{j=1}^B |y_i - (\beta_n^{*j})^T x|$ ## ## x - mxn matrix where m is

cbind and automatic type conversion

2010 May 01

cbind and automatic type conversion

Hello, I have three method types and 100 generalization errors for each, all in the range [0.65,0.81]. I would like to make a stacked histogram plot using ggplot2 with this data ... Therefore I need a data frame of the form e.g. Method GE ---------- ------ "Classic" 0.76 "Classic" 0.79 "Own

search for: bravegag