Displaying 2 results from an estimated 2 matches for "mybigdata".
2008 Oct 19
3
pairs plots in R
Hi,
  is there a way to take a data frame with 100+ columns and large data set to do efficient exploratory analysis in R with pairs?
 
I find using pairs on the whole matrix is slow and the resulting matrix is tiny.
 
Also the variable of interest for me is a binary var Y or N .
 
Is there an efficient way to graphically view many variable relationships that does not look teeny ?
 
I could do
2010 Jun 30
2
anyone know why package "RandomForest" na.roughfix is so slow??
Hi all,
    I am using the package "random forest" for random forest predictions.  I
like the package.  However, I have fairly large data sets, and it can often
take *hours* just to go through the "na.roughfix" call, which simply goes
through and cleans up any NA values to either the median (numerical data) or
the most frequent occurrence (factors).
    I am going to start