thr3ads.net - similar to: "statistical significance test for cluster agreement"

Displaying 20 results from an estimated 10000 matches similar to: "statistical significance test for cluster agreement"

2004 Feb 27

importing S-Plus data files

I have some data in the Linux version of S-Plus, which I can not use anymore. The program is just broken and won't run. I'm trying to find a way to import that data to either Windows version of S-Plus (which I have running on my other machine) or R (Linux or Windows, it doesn't matter). Unfortunately, nothing seems to work. Windows S-Plus seem to ignore files from Linux .Data

inter-rater agreement index kappa

2007 Jun 26

inter-rater agreement index kappa

Is there a function that calculates the inter-rater agreement index (kappa) in R? Thanks ../Murli [[alternative HTML version deleted]]

R memory and CPU requirements

2003 Oct 16

R memory and CPU requirements

Thanks for all the help on my previous questions. One more (hopefully last one) : I've been very surprised when I tried to fit a model (using aov()) for a sample of size 200 and 10 variables and their interactions. It turns out that even 2GB of RAM is not anough for aov() with this sample size, which does not seem so big for me. Am I doing something wrong or this is considered a normal

aov and non-categorical variables

2003 Oct 15

aov and non-categorical variables

It is unclear to me how aov() handles non-categorical variables. I mean it works and produces results that I would expect, but I was under impression that ANOVA is only defined for categorical variables. In addition, help(aov) says that it "call to 'lm' for each stratum", which I presume means that it calls to lm() for every group of the categorical variable, however I

Clustering and Rand Index - VS-KM

2006 Jan 08

Clustering and Rand Index - VS-KM

Dear WizaRds, I have been trying to compute the adjusted Rand index as by Hubert/ Arabie, and could not correctly approach how to define a partition object as in my last request yesterday. With package fpc I try to work around the problem, using my original data: mat <- matrix( c(6,7,8,2,3,4,12,14,14, 14,15,13,3,1,2,3,4,2, 15,3,10,5,11,7,13,6,1, 15,4,10,6,12,8,12,7,1), ncol=9, byrow=T )

Measures of agreement

2002 Feb 05

Measures of agreement

Greetings. I've been experimenting with some algorithms for document classification (specifically, a Naive Bayes classifier and a kNN classifier) and I would now like to calculate some inter-rater reliability scores. I have the data in a PostgreSQL database, such that for each document, each measure (there are 9) has three variables: ap_(measure), nb_(measure), and knn_(measure). ap is me

e1071 question: what's the definition of performance in t une.* functions?

2004 Jul 13

e1071 question: what's the definition of performance in t une.* functions?

Basically, the `Detail' section of ?tune says it all: Details: As performance measure, the classification error is used for classification, and the mean squared error for regression. ... Andy > From: Tae-Hoon Chung > > Hi, all; > > Basically, the subject contains the all information I need to know. > In e1071 library, there are functions to tune parameters.

logistic regression

2005 May 27

logistic regression

Hi I am working on corpora of automatically recognized utterances, looking for features that predict error in the hypothesis the recognizer is proposing. I am using the glm functions to do logistic regression. I do this type of thing: * logistic.model = glm(formula = similarity ~., family = binomial, data = data) and end up with a model: > summary(logistic.model) Call:

asking for help (was RE: nnet trouble, continued)

2002 Aug 09

asking for help (was RE: nnet trouble, continued)

> From: Sirotkin, Alexander [mailto:demiurg at ti.com] > > P.S. Anybody who finds my postings insulting, please write a > mail filter > for my address. Although I can not understand why it may insult > anybody. > > P.P.S. Must be related to cultural differences, I guess... It's not insulting to me, personally, but quite possibly to the author of that

cohen kappa for two-way table

2006 Dec 11

cohen kappa for two-way table

Greetings, I am a bit confused by the results returned by the functions: cohen.kappa {concord} classAgreement {e1071} when using a two-way table. for example, if I have an matrix A, and a similar matrix B (same dimensions), then: matrix A and B can be found: http://casoilresource.lawr.ucdavis.edu/drupal/files/a_40.txt http://casoilresource.lawr.ucdavis.edu/drupal/files/b_40.txt A <-

Questions of Significance Analysis of Microarrays(SAM){siggenes}

2004 Nov 22

Questions of Significance Analysis of Microarrays(SAM){siggenes}

Dear All: Significance Analysis of Microarrays(SAM) As we know sam do multiple t.test as following ## Default S3 method: t.test(x, y = NULL, alternative = c("two.sided", "less", "greater"),mu = 0, paired = FALSE, var.equal = FALSE,conf.level = 0.95, ...) var.equal: a logical variable indicating whether to treat the two variances as being equal. If 'TRUE'

difference between trees in R?

2001 Aug 21

difference between trees in R?

Hi. I am wondering if anybody has studied and/or written code in R to calculate the distance between 2 "trees". For example, if one does a hierarchical agglomerative clustering and say, a hierachical divisive clustering (represented as trees) and wishes to compute a metric on them. I am thinking of something like the symmetric difference as mentioned in Margush and McMorris (1982).

mean and array

2002 Jul 02

mean and array

In general this is what "apply" does. In your example you could use rowMeans(ar) as well, I think. Reid Huntsinger -----Original Message----- From: Olivier Martin [mailto:olivier.martin at inrialpes.fr] Sent: Tuesday, July 02, 2002 10:55 AM To: r-help Subject: [R] mean and array Hi all, I have an array, one say ar, with 3 dimensions. dim(ar) >200 3 4 I would like to

internet explorer

2006 Aug 24

internet explorer

Help! I tried nearly everything I could and still was not able to get IE working with wine 0.9.19. Manual install using App DB, winetools, ies4linux, etc... everything fails, with different problems and error messages of course. It's really frustrating. Is there a way to install IE that actually works ?

QoS on receive

2005 Jul 14

QoS on receive

It appears that while Linux has plenty of traffic shaping mechanism on transmit, there is nothing on receive side. While generally it does make sense since transmit is more CPU intensive operation, after all receive also consumes CPU cycles. It is clear that it''s best to drop the packet as soon as possible, i.e. on receive, if possible - by the driver itself. It may not be feasible in

modifications to text.tree function

2005 May 12

modifications to text.tree function

Hi. I have to make some minor modifications to the text.tree function - I don't like the way it prints the split labels (they are too long in my case and overlap). I tried to make s simple modification to the text.tree function so that it will limit the number of significant digits in tree labels, but could not - the original function uses some undocumented "treeco" function, which

statistical question: confidence interval of regression weight - significance

2009 Mar 08

statistical question: confidence interval of regression weight - significance

hi, at first; thanks for the help on getting confidence intervals in R. now I have a pure statistical question. I hope you don't mind if I ask ... I have an expectation of how large my beta-weight in a regression should be - so I have an "ideal" or expected regression line. Now the real beta-weight is less then the expected and when I draw the confidence interval lines

Statistical significance of a classifier

2005 Aug 05

Statistical significance of a classifier

Hi, I have a bunch of data points x from two classes A & B, and I'm creating a classifier. So I have a function f(x) which estimates the probability that x is in class A. (I have an equal number of examples of each, so p(class) = 0.5.) One way of seeing how well this does is to compute the error rate on the test set, i.e. if f(x)>0.5 call it A, and see how many times I

Is this sample size big enough to test for statistical significance?

2009 Mar 20

Is this sample size big enough to test for statistical significance?

Dear R community, Is this sample size large enough to study differences between two groups of the populations? Q1: do the body temperatures differ between the two groups of the overwintering turtles juveniles and adults? One group (adults) has 6 turtles Second group (juveniles) has 1 turtle. There are 3 replications, i.e. the experiment was repeated over the three years, but using

How to test the statistical significance of the difference of two univariate Linear Regression betas?

2012 Mar 14

How to test the statistical significance of the difference of two univariate Linear Regression betas?

How to test the statistical significance of the difference of two univariate Linear Regression betas? Hi all, There are two samples of data: D1 and D2. On data D1 we do a univariate Linear Regression and get the coefficient beta1. On data D2 we do a univariate Linear Regression and get the coefficient beta2. How do I test the statistical significance of (beta1-beta2)? Could you please

similar to: statistical significance test for cluster agreement