On Jun 17, 2010, at 4:46 PM, weller wrote:
>
> I have a csv file that has approximately 50k rows. In the first
> value of
> each row, a file name is listed, and there are 162 different file
> names. At
> the end of each row, there is a number value. What I would like to
> be able
> to do is for the 162 different files (or we could call them
> categories), is
> compute the ecdf for the values within that category. Then plot the
> ecdf
> for each file on the same graph. Essentially, it would look
> something like
>
http://www-stat.stanford.edu/~jtaylo/courses/stats202/R/chap3_data_exploration/iris_ecdf.png
> ,
> but instead of the 3 lines, it would show 162. They don't have to be
> different colors, and the number of records in each file category
> changes.
> I was considering using a matrix and adding to it via a loop, but
> couldn't
> quite get it to work. This is what I have right now.
>
> thwop <- read.csv("real_unmod_estimated_pI.csv", header=TRUE)
> filelist <- levels(thwop$Source)
> rig=matrix(nrows=162)
>
> for (i in filelist)
> {
> thug <- subset(thwop, == i)
> rig[i,length(ecdf(thug$Estimated.pI))]=ecdf(thug$Source)
> }
> rig
Perhaps:
plot(x=0, y=min(thwop$estimated.pI), xlim=c(min(thwop$estimated.pI),
max(thwop$estimated.pI) ), ylim=c(0.0, 1.0), xlab="",
ylab="")
tapply(thwop$estimated.pI, thwop$Source, function(x) { par(new=TRUE) ;
plot(ecdf(x), verticals=TRUE, xlim=c(min(thwop$estimated.pI), max(thwop
$estimated.pI, xaxt=FALSE) )) })
(It is going to be a mess.)
>
> Any help would be appreciated
>
>
> --
> View this message in context:
http://r.789695.n4.nabble.com/Multiple-ecdf-plots-tp2259465p2259465.html
> Sent from the R help mailing list archive at Nabble.com.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
David Winsemius, MD
West Hartford, CT