Displaying 2 results from an estimated 2 matches for "mybigdata".
2008 Oct 19
3
pairs plots in R
Hi,
is there a way to take a data frame with 100+ columns and large data set to do efficient exploratory analysis in R with pairs?
I find using pairs on the whole matrix is slow and the resulting matrix is tiny.
Also the variable of interest for me is a binary var Y or N .
Is there an efficient way to graphically view many variable relationships that does not look teeny ?
I could do
2010 Jun 30
2
anyone know why package "RandomForest" na.roughfix is so slow??
Hi all,
I am using the package "random forest" for random forest predictions. I
like the package. However, I have fairly large data sets, and it can often
take *hours* just to go through the "na.roughfix" call, which simply goes
through and cleans up any NA values to either the median (numerical data) or
the most frequent occurrence (factors).
I am going to start