similar to: cor.test in matrices

Displaying 20 results from an estimated 10000 matches similar to: "cor.test in matrices"

2003 Apr 24
3
Missing Value And cor() function
Hi r lovers! I 'd like to apply the cor() function to a matrix which have some missing values As a matter of fact and quite logically indeed it doesn't work Is there a trick to replace the missing value by the mean of each variable or by any other relevant figures ? Or should I apply a special derivate of the cor() function, (I don't have any idea if it exists and have some trouble to
2001 Dec 18
4
chi-squared test
I don't quite understand the difference between the two methods for performing a chi-squared test on contingency tables: summary(table()) and chisq.test() They may different results. E.g.: aa <- gl(2, 10) bb <- as.factor(c(1,2,2,2,1,2,1,2,2,2,1,2,2,2,1,1,1,2,1,1)) aa <- c(aa, aa) bb <- c(bb, bb) table(aa, bb) summary(table(aa, bb)) chisq.test(aa, bb) Could somebody give me
2003 Mar 11
1
Goodman / Kruskal gamma
The Goodman/Kruskal gamma is a nice descriptive rank-order correlation statistic, often used in psychology. It is nice because it is easy to understand. It takes all pairs of values of each variable and asks whether they are congruent (S+ is the number in the same order for both variables) or discordant (S-, opposite ranking). The statistic is (S+ - S-)/(S+ + S-). It is like tau except for the
2001 Sep 25
2
glm.nb, anova.negbin
Dear R-collegues, I'm getting an error message (Error in round) when summarising a glm.nb model, and when using anova.negbin (in R 1.3.1 for windows): > m.nb <- glm.nb(tax ~ areal) > m.bn Call: glm.nb(formula = tax ~ areal, init.theta = 5.08829537115498, link = log) Coefficients: (Intercept) areal 3.03146 0.03182 Degrees of Freedom: 283 Total (i.e. Null); 282
2002 May 06
3
Spearman rank-order correlation matrix
I"ve got a data frame with a selection of columns I want to compute a rank-order correlation matrix from without disturbing the original data frame. foo[,c("a","b","d","f","g")] What I wanted to do, intuitively, was: > cor(rank(foo[,c("a","b","d","f","g")])) but rank in that context
2007 Feb 02
5
reading very large files
Hi all, I have a large file (1.8 GB) with 900,000 lines that I would like to read. Each line is a string characters. Specifically I would like to randomly select 3000 lines. For smaller files, what I'm doing is: trs <- scan("myfile", what= character(), sep = "\n") trs<- trs[sample(length(trs), 3000)] And this works OK; however my computer seems not able to handle
2003 Apr 24
1
matrix to coordinates
Dear R-users, I'm sure it must be a specific function or a better way to convert matrix to x,y,z coordinates (and viceversa), than my function below (it works). Any help? m2coord <- function(m) { k <- nrow(m)*ncol(m) aa <- data.frame(r=1:k, c=1:k, v=1:k) k <- 0 for (i in 1:nrow(m)) for (j in 1:ncol(m)) { k <- k+1 aa$f[k]=i; aa$c[k]=j; aa$v[k]=m[i,j] } aa } Juli
2005 Oct 04
1
repeated measures with random effects
Dear all, I'm interested in analysing a reapeated measure desing where plant height (H) was measured 3 times (Time). The experimental design include 2 fixed factor (say A and B) in which A is nested in B, and a random factor (C, the plot), using the aov(). So my first idea would be something like: aov(H ~ B * A %in% B * Time + Error(id) ) where id is the factor coded for the repeated
2003 Dec 18
2
barplot & plot together
Dear colleges, I'm trying to combine a barplot and a plot in a single figure as follows: data <- 1:6 t <- barplot(data, axes=F) par(new= T) plot(t, data, type="b") However, as you can see in the example, the dots of the second plot do not fall in the midpoint of the bars in the first. Any trick for setting the 2 plots at the same scale? I have unsuccessfully tried: plot(t,
2008 Feb 10
2
reshape
Dear colleagues, I'd like to reshape a datafame in a long format to a wide format, but I do not quite get what I want. Here is an example of the data I've have (dat): sp <- c("a", "a", "a", "a", "b", "b", "b", "c", "d", "d", "d", "d") tr <- c("A",
2003 May 01
4
var[i]
Dear all, How could I use variables in a loop that their names are in a vector? For example: aaa <- 1:10 bbb <- aaa*2 ccc <- aaa+bbb varn <- c("aaa", "bbb", "ccc") m <- rep(NA, 3) for (i in 1:length(varn)) m[i] <- mean(varn[i]) # wrong thanks in advance Juli -- "Wars do not solve problems, wars generate even more problems"
2000 Mar 16
2
glm: offset in poisson
R-users, Can an offset term be included in a Poisson model? I get an error message when trying that: >r3o <- glm(tax ~ areal + offset(o), family=poisson) Error in (if (is.empty.model(mt)) glm.fit.null else glm.fit)(x = X, y = Y, : inner loop 1; can't correct step size In addition: Warning message: Step size truncated due to divergence in: (if (is.empty.model(mt))
2002 Sep 05
4
line thickness in plots
Hi all,, Is it possible to set a thicker line for the box around the plots? (i.e. for the four axes) Something like lwd (lines) but for the box. Thanks juli -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the
2008 Jun 26
1
write.table a df with specific column order
Hi I'd like to write.table a dataframe, but with an specific order of columns. Is there a direct way to do it? or I have to generate a new dataframe as follows: t <- data.frame(c=1:10, b=11:20, a=letters[1:10]) t2 <- data.frame(a=t$a, b=t$b, c=t$c) write.table(t2, row.names=F) Thanks for any comment Juli -- http://www.ceam.es/pausas
2008 Jun 27
1
removing blanks from a string
Hi Is there a way to remove blank characters from the end of strings in a vector? Something like the =TRIM functions of the OpenOffice spreadsheet. E.g., a <- c("hola ", "Yes ", "hello ") # I'd like to get: c("hola", "Yes", "hello") Thanks Juli -- http://www.ceam.es/pausas
2003 Apr 01
2
cor.test observations limit
Hi, Is there a limit on the number of observations for using cor.test. For example, > library(ctest) > cor.test(rnorm(3000), rnorm(3000), method="spearman") Error in if (q > (n^3 - n)/6) pspearman(q - 1, n, lower.tail = FALSE) else pspearman(q, : missing value where logical needed In addition: Warning message: NAs introduced by coercion I mainly want to calculate
2002 May 11
2
RODBC sqlFetch
Hi, Thanks for showing me how to use RODBC and odbcConnect. Now works nicely. The question I've got now is: noms <- list.files(pattern=".DBF") # removing extension names: noms <- sapply(noms, function(x) as.character(strsplit(x,".DBF")) , USE.NAMES=F) for (i in 1:length(noms)) { s <- sqlFetch(bdades, noms[i]) # etc. } But it seems that sqlFetch()
2008 Nov 26
2
Very slow: using double apply and cor.test to compute correlation p.values for 2 matrices
My two matrices are roughly the sizes of m1 and m2. I tried using two apply and cor.test to compute the correlation p.values. More than an hour, and the codes are still running. Please help to make it more efficient. m1 <- matrix(rnorm(100000), ncol=100) m2 <- matrix(rnorm(10000000), ncol=100) cor.pvalues <- apply(m1, 1, function(x) { apply(m2, 1, function(y) { cor.test(x,y)$p.value
2008 Feb 27
1
glm binomial with no successes
Dear all, I have a question on glm, family binomial. I do not see significant differences between the levels of a factor (treatment) if all data for a level is 0; and replacing a 0 for a 1 (in fact reducing the difference), then I detect the significant difference that I expected. Is there a way to overcome this problem? or this is an expected behaviour ? Here is an example: s <-
2002 Aug 04
5
Pseudo R^2 for logit - really naive question
I am using GLM to calculate logit models based on cross-sectional data. I am now down to the hard work of making the results intelligible to very average readers. Is there any way to calculate a psuedo analoque to the R^2 in standard linear regression for use as a purely descriptive statistic of goodness of fit? Most of the readers of my report will be vaguely familiar and more comfortable with