Supriya Jain
2014-Jul-01 22:46 UTC
[R] Data visualization: overlay columns of train/test/validation datasets
Hello, Given two different datasets (having the same number and type of columns, but different observations, as commonly encountered in data-mining as train/test/validation datasets), is it possible to overlay plots (histograms) and compare the different attributes from the separate datasets, in order to check how similar the different datasets are? Is there a package available for such plotting together of similar columns from different datasets? Thanks, SJ [[alternative HTML version deleted]]
David Winsemius
2014-Jul-01 23:42 UTC
[R] Data visualization: overlay columns of train/test/validation datasets
On Jul 1, 2014, at 3:46 PM, Supriya Jain wrote:> Hello, > > Given two different datasets (having the same number and type of columns, > but different observations, as commonly encountered in data-mining as > train/test/validation datasets), is it possible to overlay plots > (histograms) and compare the different attributes from the separate > datasets, in order to check how similar the different datasets are? > > Is there a package available for such plotting together of similar columns > from different datasets?Possible. Assuming you just want frequency histograms (or ones using counts for that matter) it can be done in any of the three major plotting paradigms supported in R. No extra packages needed if using just base graphics.> > Thanks, > SJ > > [[alternative HTML version deleted]]Oh, you must have missed the parts of the Posign Guide where plain text was requyested. See below.> PLEASE do read the posting guide http://www.R-project.org/posting-guide.htmlAnd you missed that section, as well.> and provide commented, minimal, self-contained, reproducible code.-- David Winsemius Alameda, CA, USA