thr3ads.net - similar to: "how to select random rows ?"

Displaying 20 results from an estimated 1000 matches similar to: "how to select random rows ?"

2003 Oct 31

print(), cat() and simple I/O in R

I am trying to produce rather mundane output of the form e.g. pi, e = 3.14 2.718 The closest result I achieved so far with print() is: > print (c(pi, exp(1)), digits = 3) [1] 3.14 2.72 > print(c("pi, e =", pi, exp(1)), digits = 3) [1] "pi, e =" "3.14159265358979" "2.71828182845905" I understand that c() promotes floats to strings and

column extraction by name ?

2003 Nov 12

column extraction by name ?

I have a data frame (df) with colums x, y and z. e.g. df <- data.frame(x = sample(4), y = sample(4), z = sample(4)) I can extract column z by: df$z or df[3] I can also extract columns x,y by: df[1:2] or by df[-3]. Is it possible to extract x,y columns in a "symbolic" fashion i.e. by equivalent of df[-z] (which is illegal) ??? Or alternativeley, is there an equivalent of

index of max value ?

2003 Nov 14

index of max value ?

Is there a function in R, which would return index of maximum value in a vector ? e.g. > v <- round(10*rnorm(8)) > v [1] 6 -3 -6 15 7 9 0 -19 > max(v) [1] 15 ??? index.max(v) ??? 4

e1071:svm - default epsilon = 0.1 (NOT 0.5) (PR#5671)

2003 Dec 10

e1071:svm - default epsilon = 0.1 (NOT 0.5) (PR#5671)

In e1071 package/svm default epsilon value is set to 0.1 and not 0.5 as documentation says. R

problem with pls(x, y, ..., ncomp = 16): Error in inherit s( x, "data.frame") : subscript out of bounds

2003 Dec 09

problem with pls(x, y, ..., ncomp = 16): Error in inherit s( x, "data.frame") : subscript out of bounds

I don't know the details of pls (in the pls.pcr package, I assume), but if you use validation="CV", that says you want to use CV to select the best number of components. Then why would you specify ncomp as well? Andy > From: ryszard.czerminski at pharma.novartis.com > > When I try to use ncomp parameter in pls procedure I get > following error: > > >

how to remove NaN columns ?

2003 Oct 24

how to remove NaN columns ?

How can I remove columns with NaN entries ? Here is my simple example: > data <- read.csv("test.csv") > xdata <- data[3:length(data)] > xs <- lapply(xdata, function(x){(x - mean(x))/sqrt(var(x))}) > x <- data.frame(xs) > x C D E F 1 -0.7071068 NaN -0.7071068 -0.7071068 2 0.7071068 NaN 0.7071068 0.7071068

prcomp scale error (PR#6433)

2004 Jan 15

prcomp scale error (PR#6433)

Full_Name: Ryszard Czerminski Version: 1.8.1 OS: GNU/Linux Submission from: (NULL) (205.181.102.120) prcomp(..., scale = TRUE) does not work correctly: $ uname -a Linux 2.4.20-28.9bigmem #1 SMP Thu Dec 18 13:27:33 EST 2003 i686 i686 i386 GNU/Linux $ gcc --version gcc (GCC) 3.2.2 20030222 (Red Hat Linux 3.2.2-5) > a <- matrix(rnorm(6), nrow = 3) > sum((scale(a %*% svd(cov(a))$u, scale

row.names are dropped when extracting one column ?

2004 Mar 02

row.names are dropped when extracting one column ?

Apparently row names are dropped when I extract single column from a data frame. Why this behaviour ? > y <- as.matrix(df[,1:2]); length(row.names(y)) [1] 324 > y <- as.matrix(df[,1:1]); length(row.names(y)) [1] 0 Best regards, Ryszard

strange sprintf() behaviour ?

2003 Oct 31

strange sprintf() behaviour ?

This is quite strange behaviour - at least for R-novice as myself.... Consider this: > testf <- function() { x <-2; sprintf("%s %f", "x =", x); return(x) } > result <- testf() > testf <- function() { x <-2; sprintf("%s %f", "x =", x) } > result <- testf() > testf() [1] "x = 2.000000" Apparently adding return()

how to initialize random seed properly ?

2004 Jun 09

how to initialize random seed properly ?

I want to start R processes on multiple processors from single shell script and I want all of them to have different random seeds. One way of doing this is sleep 2 # (with 'sleep 1' I am often getting the same number) ... set.seed(unclass(Sys.time())) Is there a simpler way without a need to sleep between invoking different R processes ? Ryszard

problem with read.table

2004 Feb 03

problem with read.table

Any ideas why read.table complains about not correct number of elements in line while readLine/strsplit indicate that all lines have the same number of elements ? R > tbl <- read.table('tmp', header = T, sep = '\t') Error in scan(file = file, what = what, sep = sep, quote = quote, dec = dec, : line 32 did not have 27 elements > lines <-

dist(x,y)

2004 Jan 06

dist(x,y)

Hi Ryszard! There is a dist function in R. It's in the mva package. You can set the kind of distance that you want. Thanks, Erin mailto:hodgess at gator.uhd.edu

R-1.9.0: Error in paste(ncomp, "LV's") : Argument "ncomp" is missing, with no default

2004 May 27

R-1.9.0: Error in paste(ncomp, "LV's") : Argument "ncomp" is missing, with no default

Is it just my installation or bug in 1.9.0 ? The same thing works fine in 1.8.1 Best regards, Ryszard # R-1.9.0 library(pls.pcr) nr <- 8; ndim <- 2 x <- matrix(rnorm(nr*ndim), nrow=nr) y <- as.matrix(x[,1]) for (i in 2:ndim) y <- y + x[,i] y <- y + rnorm(length(y)) m <- pls(x,y,validation='CV') # Error in paste(ncomp, "LV's") : Argument

how to pass extra parameters using call() or similar mechanism ?

2004 Mar 11

how to pass extra parameters using call() or similar mechanism ?

I am trying to write a function, which would allow to call various methods and would pass to them extra arbitrary parameters. My first attempt was to use call() as illustrated below, but apparently '...' cannot be used in such context. How can this be achieved ? Best regards, Ryszard > myfun <- function(method, x, ...) { + v <- eval(call(method, x, ...)) + } > method =

strange matrix behaviour: is there a matrix with one row?

2006 Apr 07

strange matrix behaviour: is there a matrix with one row?

Consider this: > y <- matrix(1:8, ncol=2) > is.matrix(y[-c(1,2),]) [1] TRUE > is.matrix(y[-c(1,2,3),]) [1] FALSE > is.matrix(y[-c(1,2,3,4),]) [1] TRUE It seems like an inconsistent behaviour: - with 2 or more rows we have a matrix - with 1 row we do not have a matrix and - with 0 rows we have a matrix again I just stumbled on this behaviour, because I had a problem with my

R CMD BATCH: unable to start device PNG

2006 Nov 03

R CMD BATCH: unable to start device PNG

And on that note, here is a function that I use to get around it: -----Original Message----- From: r-help-bounces at stat.math.ethz.ch [mailto:r-help-bounces at stat.math.ethz.ch] On Behalf Of Jeffrey Horner Sent: Friday, November 03, 2006 10:01 AM To: ryszard.czerminski at novartis.com Cc: r-help at stat.math.ethz.ch Subject: Re: [R] R CMD BATCH: unable to start device PNG ryszard.czerminski

How to avoid converting "_" to "." ?

2003 Oct 24

How to avoid converting "_" to "." ?

It is minor thing, but how can I avoid converting "_" to "." ? e.g. I have a data set "test.csv" A,A_B,A_C,C,D X,11,0,13,14 Y,21,0,23,24 and when I read it all underscores are converted to dots (:<) > d <- read.csv("test.csv") > d A A.B A.C C D 1 X 11 0 13 14 2 Y 21 0 23 24 Ryszard [[alternative HTML version deleted]]

how to print table with more columns per row?

2006 Jul 20

how to print table with more columns per row?

When printing a table it is broken at some point (depending how long are the associated names) >>> see example below. Is there a way to control number of columns being printed for a given chunk of the table? Best regards, Ryszard > z5 AAAAAAA BBBBBBB CCCCCCC DDDDDDD EEEEEEE FFFFFFF GGGGGGG HHHHHHH IIIIIII AAAAAAA 1.00 -0.69 -0.54 -0.88 NA NA NA

svm in e1071 package: polynomial vs linear kernel

2003 Nov 03

svm in e1071 package: polynomial vs linear kernel

I am trying to understand what is the difference between linear and polynomial kernel: linear: u'*v polynomial: (gamma*u'*v + coef0)^degree It would seem that polynomial kernel with gamma = 1; coef0 = 0 and degree = 1 should be identical to linear kernel, however it gives me significantly different results for very simple data set, with linear kernel

Getting elements of a matrix by a vector of column indice s

2004 Jul 08

Getting elements of a matrix by a vector of column indice s

See if the following helps: > m <- outer(letters[1:5], 1:4, paste, sep="") > m [,1] [,2] [,3] [,4] [1,] "a1" "a2" "a3" "a4" [2,] "b1" "b2" "b3" "b4" [3,] "c1" "c2" "c3" "c4" [4,] "d1" "d2" "d3" "d4" [5,]

similar to: how to select random rows ?