thr3ads.net - similar to: "cor.test in matrices"

Displaying 20 results from an estimated 10000 matches similar to: "cor.test in matrices"

2003 Apr 24

Missing Value And cor() function

Hi r lovers! I 'd like to apply the cor() function to a matrix which have some missing values As a matter of fact and quite logically indeed it doesn't work Is there a trick to replace the missing value by the mean of each variable or by any other relevant figures ? Or should I apply a special derivate of the cor() function, (I don't have any idea if it exists and have some trouble to

chi-squared test

2001 Dec 18

chi-squared test

I don't quite understand the difference between the two methods for performing a chi-squared test on contingency tables: summary(table()) and chisq.test() They may different results. E.g.: aa <- gl(2, 10) bb <- as.factor(c(1,2,2,2,1,2,1,2,2,2,1,2,2,2,1,1,1,2,1,1)) aa <- c(aa, aa) bb <- c(bb, bb) table(aa, bb) summary(table(aa, bb)) chisq.test(aa, bb) Could somebody give me

glm.nb, anova.negbin

2001 Sep 25

glm.nb, anova.negbin

Dear R-collegues, I'm getting an error message (Error in round) when summarising a glm.nb model, and when using anova.negbin (in R 1.3.1 for windows): > m.nb <- glm.nb(tax ~ areal) > m.bn Call: glm.nb(formula = tax ~ areal, init.theta = 5.08829537115498, link = log) Coefficients: (Intercept) areal 3.03146 0.03182 Degrees of Freedom: 283 Total (i.e. Null); 282

Goodman / Kruskal gamma

2003 Mar 11

Goodman / Kruskal gamma

The Goodman/Kruskal gamma is a nice descriptive rank-order correlation statistic, often used in psychology. It is nice because it is easy to understand. It takes all pairs of values of each variable and asks whether they are congruent (S+ is the number in the same order for both variables) or discordant (S-, opposite ranking). The statistic is (S+ - S-)/(S+ + S-). It is like tau except for the

reading very large files

2007 Feb 02

reading very large files

Hi all, I have a large file (1.8 GB) with 900,000 lines that I would like to read. Each line is a string characters. Specifically I would like to randomly select 3000 lines. For smaller files, what I'm doing is: trs <- scan("myfile", what= character(), sep = "\n") trs<- trs[sample(length(trs), 3000)] And this works OK; however my computer seems not able to handle

Spearman rank-order correlation matrix

2002 May 06

Spearman rank-order correlation matrix

I"ve got a data frame with a selection of columns I want to compute a rank-order correlation matrix from without disturbing the original data frame. foo[,c("a","b","d","f","g")] What I wanted to do, intuitively, was: > cor(rank(foo[,c("a","b","d","f","g")])) but rank in that context

matrix to coordinates

2003 Apr 24

matrix to coordinates

Dear R-users, I'm sure it must be a specific function or a better way to convert matrix to x,y,z coordinates (and viceversa), than my function below (it works). Any help? m2coord <- function(m) { k <- nrow(m)*ncol(m) aa <- data.frame(r=1:k, c=1:k, v=1:k) k <- 0 for (i in 1:nrow(m)) for (j in 1:ncol(m)) { k <- k+1 aa$f[k]=i; aa$c[k]=j; aa$v[k]=m[i,j] } aa } Juli

repeated measures with random effects

2005 Oct 04

repeated measures with random effects

Dear all, I'm interested in analysing a reapeated measure desing where plant height (H) was measured 3 times (Time). The experimental design include 2 fixed factor (say A and B) in which A is nested in B, and a random factor (C, the plot), using the aov(). So my first idea would be something like: aov(H ~ B * A %in% B * Time + Error(id) ) where id is the factor coded for the repeated

barplot & plot together

2003 Dec 18

barplot & plot together

Dear colleges, I'm trying to combine a barplot and a plot in a single figure as follows: data <- 1:6 t <- barplot(data, axes=F) par(new= T) plot(t, data, type="b") However, as you can see in the example, the dots of the second plot do not fall in the midpoint of the bars in the first. Any trick for setting the 2 plots at the same scale? I have unsuccessfully tried: plot(t,

reshape

2008 Feb 10

reshape

Dear colleagues, I'd like to reshape a datafame in a long format to a wide format, but I do not quite get what I want. Here is an example of the data I've have (dat): sp <- c("a", "a", "a", "a", "b", "b", "b", "c", "d", "d", "d", "d") tr <- c("A",

var[i]

2003 May 01

var[i]

Dear all, How could I use variables in a loop that their names are in a vector? For example: aaa <- 1:10 bbb <- aaa*2 ccc <- aaa+bbb varn <- c("aaa", "bbb", "ccc") m <- rep(NA, 3) for (i in 1:length(varn)) m[i] <- mean(varn[i]) # wrong thanks in advance Juli -- "Wars do not solve problems, wars generate even more problems"

glm: offset in poisson

2000 Mar 16

glm: offset in poisson

R-users, Can an offset term be included in a Poisson model? I get an error message when trying that: >r3o <- glm(tax ~ areal + offset(o), family=poisson) Error in (if (is.empty.model(mt)) glm.fit.null else glm.fit)(x = X, y = Y, : inner loop 1; can't correct step size In addition: Warning message: Step size truncated due to divergence in: (if (is.empty.model(mt))

line thickness in plots

2002 Sep 05

line thickness in plots

Hi all,, Is it possible to set a thicker line for the box around the plots? (i.e. for the four axes) Something like lwd (lines) but for the box. Thanks juli -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the

write.table a df with specific column order

2008 Jun 26

write.table a df with specific column order

Hi I'd like to write.table a dataframe, but with an specific order of columns. Is there a direct way to do it? or I have to generate a new dataframe as follows: t <- data.frame(c=1:10, b=11:20, a=letters[1:10]) t2 <- data.frame(a=t$a, b=t$b, c=t$c) write.table(t2, row.names=F) Thanks for any comment Juli -- http://www.ceam.es/pausas

removing blanks from a string

2008 Jun 27

removing blanks from a string

Hi Is there a way to remove blank characters from the end of strings in a vector? Something like the =TRIM functions of the OpenOffice spreadsheet. E.g., a <- c("hola ", "Yes ", "hello ") # I'd like to get: c("hola", "Yes", "hello") Thanks Juli -- http://www.ceam.es/pausas

RODBC sqlFetch

2002 May 11

RODBC sqlFetch

Hi, Thanks for showing me how to use RODBC and odbcConnect. Now works nicely. The question I've got now is: noms <- list.files(pattern=".DBF") # removing extension names: noms <- sapply(noms, function(x) as.character(strsplit(x,".DBF")) , USE.NAMES=F) for (i in 1:length(noms)) { s <- sqlFetch(bdades, noms[i]) # etc. } But it seems that sqlFetch()

cor.test observations limit

2003 Apr 01

cor.test observations limit

Hi, Is there a limit on the number of observations for using cor.test. For example, > library(ctest) > cor.test(rnorm(3000), rnorm(3000), method="spearman") Error in if (q > (n^3 - n)/6) pspearman(q - 1, n, lower.tail = FALSE) else pspearman(q, : missing value where logical needed In addition: Warning message: NAs introduced by coercion I mainly want to calculate

Very slow: using double apply and cor.test to compute correlation p.values for 2 matrices

2008 Nov 26

Very slow: using double apply and cor.test to compute correlation p.values for 2 matrices

My two matrices are roughly the sizes of m1 and m2. I tried using two apply and cor.test to compute the correlation p.values. More than an hour, and the codes are still running. Please help to make it more efficient. m1 <- matrix(rnorm(100000), ncol=100) m2 <- matrix(rnorm(10000000), ncol=100) cor.pvalues <- apply(m1, 1, function(x) { apply(m2, 1, function(y) { cor.test(x,y)$p.value

glm binomial with no successes

2008 Feb 27

glm binomial with no successes

Dear all, I have a question on glm, family binomial. I do not see significant differences between the levels of a factor (treatment) if all data for a level is 0; and replacing a 0 for a 1 (in fact reducing the difference), then I detect the significant difference that I expected. Is there a way to overcome this problem? or this is an expected behaviour ? Here is an example: s <-

Pseudo R^2 for logit - really naive question

2002 Aug 04

Pseudo R^2 for logit - really naive question

I am using GLM to calculate logit models based on cross-sectional data. I am now down to the hard work of making the results intelligible to very average readers. Is there any way to calculate a psuedo analoque to the R^2 in standard linear regression for use as a purely descriptive statistic of goodness of fit? Most of the readers of my report will be vaguely familiar and more comfortable with

similar to: cor.test in matrices