similar to: statistical significance test for cluster agreement

Displaying 20 results from an estimated 10000 matches similar to: "statistical significance test for cluster agreement"

2004 Feb 27
2
importing S-Plus data files
I have some data in the Linux version of S-Plus, which I can not use anymore. The program is just broken and won't run. I'm trying to find a way to import that data to either Windows version of S-Plus (which I have running on my other machine) or R (Linux or Windows, it doesn't matter). Unfortunately, nothing seems to work. Windows S-Plus seem to ignore files from Linux .Data
2007 Jun 26
3
inter-rater agreement index kappa
Is there a function that calculates the inter-rater agreement index (kappa) in R? Thanks ../Murli [[alternative HTML version deleted]]
2003 Oct 16
4
R memory and CPU requirements
Thanks for all the help on my previous questions. One more (hopefully last one) : I've been very surprised when I tried to fit a model (using aov()) for a sample of size 200 and 10 variables and their interactions. It turns out that even 2GB of RAM is not anough for aov() with this sample size, which does not seem so big for me. Am I doing something wrong or this is considered a normal
2003 Oct 15
2
aov and non-categorical variables
It is unclear to me how aov() handles non-categorical variables. I mean it works and produces results that I would expect, but I was under impression that ANOVA is only defined for categorical variables. In addition, help(aov) says that it "call to 'lm' for each stratum", which I presume means that it calls to lm() for every group of the categorical variable, however I
2006 Jan 08
1
Clustering and Rand Index - VS-KM
Dear WizaRds, I have been trying to compute the adjusted Rand index as by Hubert/ Arabie, and could not correctly approach how to define a partition object as in my last request yesterday. With package fpc I try to work around the problem, using my original data: mat <- matrix( c(6,7,8,2,3,4,12,14,14, 14,15,13,3,1,2,3,4,2, 15,3,10,5,11,7,13,6,1, 15,4,10,6,12,8,12,7,1), ncol=9, byrow=T )
2002 Feb 05
2
Measures of agreement
Greetings. I've been experimenting with some algorithms for document classification (specifically, a Naive Bayes classifier and a kNN classifier) and I would now like to calculate some inter-rater reliability scores. I have the data in a PostgreSQL database, such that for each document, each measure (there are 9) has three variables: ap_(measure), nb_(measure), and knn_(measure). ap is me
2004 Jul 13
2
e1071 question: what's the definition of performance in t une.* functions?
Basically, the `Detail' section of ?tune says it all: Details: As performance measure, the classification error is used for classification, and the mean squared error for regression. ... Andy > From: Tae-Hoon Chung > > Hi, all; > > Basically, the subject contains the all information I need to know. > In e1071 library, there are functions to tune parameters.
2005 May 27
1
logistic regression
Hi I am working on corpora of automatically recognized utterances, looking for features that predict error in the hypothesis the recognizer is proposing. I am using the glm functions to do logistic regression. I do this type of thing: * logistic.model = glm(formula = similarity ~., family = binomial, data = data) and end up with a model: > summary(logistic.model) Call:
2002 Aug 09
1
asking for help (was RE: nnet trouble, continued)
> From: Sirotkin, Alexander [mailto:demiurg at ti.com] > > P.S. Anybody who finds my postings insulting, please write a > mail filter > for my address. Although I can not understand why it may insult > anybody. > > P.P.S. Must be related to cultural differences, I guess... It's not insulting to me, personally, but quite possibly to the author of that
2006 Dec 11
1
cohen kappa for two-way table
Greetings, I am a bit confused by the results returned by the functions: cohen.kappa {concord} classAgreement {e1071} when using a two-way table. for example, if I have an matrix A, and a similar matrix B (same dimensions), then: matrix A and B can be found: http://casoilresource.lawr.ucdavis.edu/drupal/files/a_40.txt http://casoilresource.lawr.ucdavis.edu/drupal/files/b_40.txt A <-
2004 Nov 22
1
Questions of Significance Analysis of Microarrays(SAM){siggenes}
Dear All: Significance Analysis of Microarrays(SAM) As we know sam do multiple t.test as following ## Default S3 method: t.test(x, y = NULL, alternative = c("two.sided", "less", "greater"),mu = 0, paired = FALSE, var.equal = FALSE,conf.level = 0.95, ...) var.equal: a logical variable indicating whether to treat the two variances as being equal. If 'TRUE'
2001 Aug 21
1
difference between trees in R?
Hi. I am wondering if anybody has studied and/or written code in R to calculate the distance between 2 "trees". For example, if one does a hierarchical agglomerative clustering and say, a hierachical divisive clustering (represented as trees) and wishes to compute a metric on them. I am thinking of something like the symmetric difference as mentioned in Margush and McMorris (1982).
2002 Jul 02
3
mean and array
In general this is what "apply" does. In your example you could use rowMeans(ar) as well, I think. Reid Huntsinger -----Original Message----- From: Olivier Martin [mailto:olivier.martin at inrialpes.fr] Sent: Tuesday, July 02, 2002 10:55 AM To: r-help Subject: [R] mean and array Hi all, I have an array, one say ar, with 3 dimensions. dim(ar) >200 3 4 I would like to
2006 Aug 24
3
internet explorer
Help! I tried nearly everything I could and still was not able to get IE working with wine 0.9.19. Manual install using App DB, winetools, ies4linux, etc... everything fails, with different problems and error messages of course. It's really frustrating. Is there a way to install IE that actually works ?
2005 Jul 14
2
QoS on receive
It appears that while Linux has plenty of traffic shaping mechanism on transmit, there is nothing on receive side. While generally it does make sense since transmit is more CPU intensive operation, after all receive also consumes CPU cycles. It is clear that it''s best to drop the packet as soon as possible, i.e. on receive, if possible - by the driver itself. It may not be feasible in
2005 May 12
1
modifications to text.tree function
Hi. I have to make some minor modifications to the text.tree function - I don't like the way it prints the split labels (they are too long in my case and overlap). I tried to make s simple modification to the text.tree function so that it will limit the number of significant digits in tree labels, but could not - the original function uses some undocumented "treeco" function, which
2009 Mar 08
0
statistical question: confidence interval of regression weight - significance
hi, at first; thanks for the help on getting confidence intervals in R. now I have a pure statistical question. I hope you don't mind if I ask ... I have an expectation of how large my beta-weight in a regression should be - so I have an "ideal" or expected regression line. Now the real beta-weight is less then the expected and when I draw the confidence interval lines
2005 Aug 05
1
Statistical significance of a classifier
Hi, I have a bunch of data points x from two classes A & B, and I'm creating a classifier. So I have a function f(x) which estimates the probability that x is in class A. (I have an equal number of examples of each, so p(class) = 0.5.) One way of seeing how well this does is to compute the error rate on the test set, i.e. if f(x)>0.5 call it A, and see how many times I
2009 Mar 20
1
Is this sample size big enough to test for statistical significance?
Dear R community, Is this sample size large enough to study differences between two groups of the populations? Q1: do the body temperatures differ between the two groups of the overwintering turtles juveniles and adults? One group (adults) has 6 turtles Second group (juveniles) has 1 turtle. There are 3 replications, i.e. the experiment was repeated over the three years, but using
2008 Aug 21
3
Null and Alternate hypothesis for Significance test
Hi, I had a question about specifying the Null hypothesis in a significance test. Advance apologies if this has already been asked previously or is a naive question. I have two samples A and B, and I want to test whether A and B come from the same distribution. The default Null hypothesis would be H0: A=B But since I am trying to prove that A and B indeed come from the same distribution, I think