Displaying 20 results from an estimated 20000 matches similar to: "test for contingency table when there are many zeros"
2003 Jul 15
1
Why two chisq.test p values differ when the contingency table is transposed?
I'm using R1.7.0 runing with Win XP. Thanks,
...Tao
????????????????????????????????????????????????????????
>x
[,1] [,2]
[1,] 149 151
[2,] 1 8
>t(x)
[,1] [,2]
[1,] 149 1
[2,] 151 8
>chisq.test(x, simulate.p.value=T, B=100000)
Pearson's Chi-squared test with simulated p-value (based on 1e+05
replicates)
data: x
X-squared = 5.2001, df =
2012 Dec 01
4
Getting all possible contingency tables
Hello all,
Let say I have 2-way contingency table:
Tab <- matrix(c(8, 10, 12, 6), nr = 2)
and the Chi-squared test could not reject the independence:
> chisq.test(Tab)
Pearson's Chi-squared test with Yates' continuity correction
data: Tab
X-squared = 1.0125, df = 1, p-value = 0.3143
However I want to get all possible contingency tables under this
independence
2005 Jun 22
1
chisq test and fisher exact test
Hi,
I have a text mining project and currently I am working on feature
generation/selection part.
My plan is selecting a set of words or word combinations which have
better discriminant capability than other words in telling the group
id's (2 classes in this case) for a dataset which has 2,000,000
documents.
One approach is using "contrast-set association rule mining" while the
2005 May 26
1
Chi Square Test on two groups of variables
Dear R help
I have been trying to conduct a chi square test on two groups of variables
to test whether there is any relationship between the two sets of variables
chisq.test(oxygen, train)
Pearson's Chi-squared test
data: oxygen
X-squared = 26.6576, df = 128, p-value = 1
> chisq.test(oxygen)
Pearson's Chi-squared test
data: oxygen
X-squared = 26.6576, df = 128,
2010 Jul 08
1
mimic SPSS contingency table results
Dear all
Seems that puzzles always come in packs. I was asked to help with some
statistics in blood analysis. (You can not refuse your wife's asks :-).
She has contingency table for values IgVH mutation and ZAP expression. I
can do chi-square test (in R) and get a results, and with some literature
I can try explain them. However she found an article in which they use
SPSS and use
2009 Nov 26
2
Testing for strength of fit using R
Dear all,
I am trying to validate a model by comparing simulated output values against observed values. I have produced a simple X-y scatter plot with a 1:1 line, so that the closer the points fall to this line, the better the 'fit' between the modelled data and the observation data.
I am now attempting to quantify the strength of this fit by using a statistical test in R. I am no
2003 Jul 15
0
Why two chisq.test p values differ when the contingency
Hi Tao:
The P-values for 2x2 table are generated based on a random (discrete
uniform distribution) sampling of all possible 2x2 tables, conditioning
on the observed margin totals. If one of the cells is extremely small,
as in your case, you get a big difference in P-values. Suppose, you
changed the cell with value 1 to, say, 5 or 6, then the two P-values
are nearly the same. However, I
2006 Dec 02
1
Chi-squared approximation may be incorrect in: chisq.test(x)
I am getting "Chi-squared approximation may be incorrect in:
chisq.test(x)" with the data bleow.
Frequency distribution of number of male offspring in families of size 5.
Number of Male Offspring N
0 518
1 2245
2 4621
3 4753
4 2476
5
2012 Feb 20
1
chisq.test vs manual calculation - why are different results produced?
Hello,
I am trying to fit gamma, negative exponential and inverse power functions
to a dataset, and then test whether the fit of each curve is good. To do
this I have been advised to calculate predicted values for bins of data (I
have grouped a continuous range of distances into 1km bins), and then apply
a chi-squared test. Example:
> data <- data.frame(distance=c(1,2,3,4,5,6,7),
2005 Jun 26
2
chisq.test using amalgamation automatically (possible ?!?)
Dear List,
If any of observed and/or expected data has less than
5 frequencies, then chisq.test (Pearson's Chi-squared
Test for Count Data from package:stats) gives warning
messages. For example,
x<-c(10, 14, 10, 11, 11, 7, 8, 4, 1, 4, 4, 2, 1, 1, 2,
1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1)
y<-c(9.13112391745095, 13.1626482033341,
12.6623267638188, 11.0130706413029, 9.16415925139016,
2002 May 23
2
Find if there is independence
Hello
I have the matrix
a<-matrix(c(2,1,0,1,2,2,1,5,7,2,5,12),nrow=6)
a
[,1] [,2]
[1,] 2 1
[2,] 1 5
[3,] 0 7
[4,] 1 2
[5,] 2 5
[6,] 2 12
Suppose that in the first row we have 3 men of England, 2 with hair, and 1
no
In the second we have 6 italian men, 1 with hair and 5 no ...
I want to find if there is a dependence between men withouth hair and
2011 Oct 11
3
Chi-Square test and survey results
An organization has asked me to comment on the validity of their
recent all-employee survey. Survey responses, by geographic region,
compared
with the total number of employees in each region, were as follows:
> ByRegion
All.Employees Survey.Respondents
Region_1 735 142
Region_2 500 83
Region_3 897 78
2005 Feb 15
1
Tests on contingency tables
Dear all,
I have a dataset with qualitative variables (factors) and I want to test the
null hypothesis of independance between two variables for each pair by using
appropriate tests on contingency tables.
I first applied chisq.test and obtained dependance in almost all cases with
extremely small p-values and warning messages.
> chisq.test(table(data$ins.f, data$ins.st))$p.val
[1]
2008 Nov 16
3
chisq.test with simulate.p.value=TRUE (PR#13292)
Full_Name: Reginaldo Constantino
Version: 2.8.0
OS: Ubuntu Hardy (32 bit, kernel 2.6.24)
Submission from: (NULL) (189.61.88.2)
For many tables, chisq.test with simulate.p.value=TRUE gives a p value that is
obviously incorrect and inversely proportional to the number of replicates:
> data(HairEyeColor)
> x <- margin.table(HairEyeColor, c(1, 2))
>
2003 Dec 09
2
p-value from chisq.test working strangely on 1.8.1
Hello everybody,
I'm seeing some strange behavior on R 1.8.1 on Intel/Linux compiled
with gcc 3.2.2. The p-value calculated from the chisq.test function is
incorrect for some input values:
> chisq.test(matrix(c(0, 1, 1, 12555), 2, 2), simulate.p.value=TRUE)
Pearson's Chi-squared test with simulated p-value (based on 2000
replicates)
data: matrix(c(0, 1, 1,
2009 Jun 03
1
Validity of Pearson's Chi-Square for Large Tables
Is Pearson's Chi-Square test for contingency tables asymptotically unbiased
for large tables (large degrees of freedom) regardless of the expected
values in each cell? The rule of thumb is that Pearson's Chi-square should
not be used when large numbers of cells have expected values < 5. However,
I compared the results on 4x4 contingency tables for R's chisq.test using
chi-square
2003 Apr 22
4
fisher exact vs. simulated chi-square
Dear All,
I have a problem understanding the difference between the outcome of a
fisher exact test and a chi-square test (with simulated p.value).
For some sample data (see below), fisher reports p=.02337. The normal
chi-square test complains about "approximation may be incorrect",
because there is a column with cells with very small values. I
therefore tried the chi-square with
2006 Jun 30
2
Query : Chi Square goodness of fit test
I want to calculate chi square test of goodness of fit to test,
Sample coming from Poisson distribution.
please copy this script in R & run the script
The R script is as follows
########################## start
#########################################
No_of_Frouds<-
c(4,1,6,9,9,10,2,4,8,2,3,0,1,2,3,1,3,4,5,4,4,4,9,5,4,3,11,8,12,3,10,0,7)
N <- length(No_of_Frouds)
# Estimation of
2012 Mar 06
1
How to apply two parameter function in data frame
I know this is something simple that I cannot do because I do not yet "think"
in R.
I have a data frame has a variable participation (a factor), and several
other factors.
I want a chisq test (no contingency tables) for participation vs all of the
other factors.
In SPSS I would do:
CROSSTABS
/TABLES= (my other factors) BY participation
/FORMAT=NOTABLES
/STATISTICS=CHISQ
2008 Jan 17
1
'simulate.p.value' for goodness of fit
R Help on 'chisq.test' states that
"if 'simulate.p.value' is 'TRUE', the p-value is computed by Monte
Carlo simulation with 'B' replicates.
In the contingency table case this is done by random sampling from
the set of all contingency tables with given marginals, and works
only if the marginals are positive...
In the