Displaying 20 results from an estimated 1000 matches similar to: "data import problem"
2006 Sep 15
3
graphics and 'layout' question
Hello,
I got stuck with a graphics question: I've 3 figures that I present on a single page (window) via 'layout'. The layout is
layout(matrix(c(1,1,2,3), 2, 2, byrow=TRUE));
so that the frst plot spans the both columns in row one. Now I'd like to magnify the fist figure so that it takes 20% more vertical space (i.e. more space for the y-axis). How would I do this in R?
2005 Jul 21
4
RandomForest question
Hello,
I'm trying to find out the optimal number of splits (mtry parameter) for a randomForest classification. The classification is binary and there are 32 explanatory variables (mostly factors with each up to 4 levels but also some numeric variables) and 575 cases.
I've seen that although there are only 32 explanatory variables the best classification performance is reached when
2006 Nov 01
4
splitting very long character string
Hello,
I've a very long character array (>500k characters) that need to split by '\n' resulting in an array of about 60k numbers. The help on strsplit says to use perl=TRUE to get better formance, but still it takes several minutes to split this string.
The massive string is the return value of a call to xmlElementsByTagName from the XML library and looks like this:
...
12345
2005 Jun 28
2
svm and scaling input
Dear All,
I've a question about scaling the input variables for an analysis with svm (package e1071). Most of my variables are factors with 4 to 6 levels but there are also some numeric variables.
I'm not familiar with the math behind svms, so my assumtions maybe completely wrong ... or obvious. Will the svm automatically expand the factors into a binary matrix? If I add numeric
2006 Feb 02
2
calculating IC50
Hello,
I was wondering if there is an R-package to automatically calculate the IC50 value (concentration of a substrance that inhibits cell growth to 50%) for some measurements.
kind regards,
Arne
[[alternative HTML version deleted]]
2006 Jan 05
4
Q: R 2.2.1: Memory Management Issues?
Dear Developers:
I have a question about memory management in R 2.2.1 and am wondering if you would be kind enough to help me understand what is going on.
(It has been a few years since I have done software development on Windows, so I apologize in advance if these are easy questions.)
-------------
MY SYSTEM
-------------
I am currently using R (version 2.2.1) on a PC running Windows 2000
2005 May 23
2
Trouble with drplot
Hi, I am a newbie with R, so I hope my question isn't too stupid. I am trying to generate dose-response curves using the "drfit" package. I have formatted my CSV files to the correct format, and have no trouble running drfit to get a summary of my data. The problem is that when I try to use "drplot" to graph my data I get an error. The message is:
Error in
2006 Sep 29
3
if then else
What is the correct form to write statement meaning:
if (a==1) {b=2; c=3}; else {b=0; c=0};
Thank you
Jue Wang, Biostatistician
Contracted Position for Preclinical & Research Biostatistics
PrO Unlimited
(908) 231-3022
2005 Mar 31
2
pictex graphics device and color
Is the pictex graphics device known not to support color?
In R 2.0.1 Patched (2004-11-17), it produced very pretty output:
## pictex(file = "modern-metatheonomy.tex", bg = "transparent");
## plotData(data);
## dev.off()
... but it appears to have ignored all color information.
__________
Steve Rowley <steve.rowley at sanofi-aventis.com> Cambridge, MA: (617) 768-4054
2007 Sep 05
1
question: randomization t-test function already defined in R?
Dear R Users,
I am hoping you can help me.
I have received code from a colleague who uses Matlab. I need to
translate it into R.
I am wondering if there is a randomization t-test (from non-parametric
statistics) function already defined in R.
(In Matlab the function is randtest.m.)
************************************************************************
**********
QUESTION: Is anyone
2006 Oct 13
1
side by side plot of Histogram and densityplot
Using "par" seems easily put a "hist" and a density side by side on the same output window.
I would like to use some features in "histogram" from Lattice, but how can I put
"histogram" and "densityplot" side by side on the same graph?
Thank you
par(mfrow=c(2,1))
hist(y)
plot(density(y))
Jue Wang, Biostatistician
Contracted Position for
2006 Oct 12
1
Draw a circle at the end of a line
I have a plot of cumulative distribution function which is a step function,
I'd like to put a cycle at the right end of each line to indicate that the value here is not available in this line.
How can I do that?
Thank you.
cdf<-function(x){
do.call("rbind",lapply(1:nrow(as.matrix(x)), function(i){
a<-x[i]
if (a<0.5){b=0.1}
else if (a<1){b=0.3}
else if
2007 Mar 20
1
grid on a wireframe plot
Hello,
I want to do a surface plot with wireframe from the lattice package.
As for now I use the following command :
print(wireframe(m,main="% my title", colorkey=TRUE,
col.regions=rainbow(100), drape=TRUE,aspect =
c(1,1.0),ylab="y",xlab="x",zlab="z",
scales=list(arrows=FALSE)),split=c(1,1,3,2), more=TRUE)
How can I show the gridlines in a 3D surface
2007 Aug 02
2
two plots super-impose
Dear R users,
I used plot(x,y) to draw a scatter plot. I need to add another smaller
histogram on the same plot I just had. However when I used hist(y), the
histogram was on another page.
Any suggestion?
Best Regards,
Philip
[[alternative HTML version deleted]]
2005 Oct 21
1
finite mixture model (2-component gaussian): plotting component gaussian components?
Dear Knowledgeable R Community Members,
Please excuse my ignorance, I apologize in advance if this is an easy question, but I am a bit stumped and could use a little guidance.
I have a finite mixture modeling problem -- for example, a 2-component gaussian mixture -- where the components have a large overlap, and
I am trying to use the "mclust" package to solve this problem.
I need
2005 May 13
1
error in plot.lmList
Hello,
in R-2.1.0 I'm trying to prodice trellis plots from an lmList object as described in the help for plot.lmList. I can generate the plots from the help, but on my own data plotting fails with an error message that I cannot interpret (please see below). Any hints are greatly appreciapted.
kind regards,
Arne
> dim(d)
[1] 575 4
> d[1:3,]
Level_of_Expression SSPos1 SSPos19
2005 Jun 30
1
randomForest error
Hello,
I'm using the random forest package. One of my factors in the data set contains 41 levels (I can't code this as a numeric value - in terms of linear models this would be a random factor). The randomForest call comes back with an error telling me that the limit is 32 categories.
Is there any reason for this particular limit? Maybe it's possible to recompile the module with a
2005 Jul 07
2
randomForest
> From: Weiwei Shi
>
> it works.
> thanks,
>
> but: (just curious)
> why i tried previously and i got
>
> > is.vector(sample.size)
> [1] TRUE
Because a list is also a vector:
> a <- c(list(1), list(2))
> a
[[1]]
[1] 1
[[2]]
[1] 2
> is.vector(a)
[1] TRUE
> is.numeric(a)
[1] FALSE
Actually, the way I initialize a list of known length is by
2008 Mar 21
4
difference between 2 ecdfs
hi,
a) i have something like:
ecdfgrp1<-ecdf(subset(mydata,TMT_GRP==1)$Y);
ecdfgrp2<-ecdf(subset(mydata,TMT_GRP==2)$Y);
how can i plot the difference between these 2 step functions?
i could begin with ecdfrefl<-function(x){ecdfgrp2(x)-ecdfgrp1(x);} ...
what next?
b) if i have a vector with repeated numeric values how can i get the
subset without repeated values .e.g (0,4,0,2,2)
2005 Jul 01
1
p-values for classification
Dear All,
I'm classifying some data with various methods (binary classification). I'm interpreting the results via a confusion matrix from which I calculate the sensitifity and the fdr. The classifiers are trained on 575 data points and my test set has 50 data points.
I'd like to calculate p-values for obtaining <=fdr and >=sensitifity for each classifier. I was thinking about