Hi,
I want to do a chi square test and I have two tab delimited text files with
Expected and Observed values to compare.  Each file contains only the values
and are 48 rows by 116 columns.  I have managed to do something with them,
but I don't think it is right as I got a p value of 1.  In this case I used
the read.table() function to read the values from the files.  But I don't
know if this was right.
 
 
> x=read.table(file="C:/Program Files/R/R-2.2.1/Projects/Stats
EU/Expected
input.txt")
 
> y=read.table(file="C:/Program Files/R/R-2.2.1/Projects/Stats
EU/Observed
input.txt")
 
> chisq.test(x,y)
 
 
 
 
Pearson's Chi-squared test
 
data:  x 
X-squared = 4.4602, df = 5405, p-value = 1
 
Warning message:
Chi-squared approximation may be incorrect in: chisq.test(x, y)
 
 
 
 
Maybe the scan() function is more correct??  Using this I got:
 
 
> x=scan(file="C:/Program Files/R/R-2.2.1/Projects/Stats EU/Observed
input.txt")
Read 5568 items
 
> y=scan(file="C:/Program Files/R/R-2.2.1/Projects/Stats EU/Expected
input.txt")
Read 5568 items
 
> chisq.test(x,y)
 
        Pearson's Chi-squared test
 
data:  x and y 
X-squared = 172306.4, df = 13880, p-value < 2.2e-16
 
Warning message:
Chi-squared approximation may be incorrect in: chisq.test(x, y)
 
 
 
Any help would be much appreciated.
Regards,
 
Carina
	[[alternative HTML version deleted]]