Displaying 2 results from an estimated 2 matches for "dataguru".
Did you mean:
dattaguru
2009 Dec 16
0
R graphics
Graphics about...
Bayesian
ChemPhys
Cluster
Distributions
Econometrics
Environmetrics
ExperimentalDesign
Finance
Genetics
gR
Graphics
HighPerformanceComputing
MachineLearning
MedicalImaging
Multivariate
NaturalLanguageProcessing
Optimization
Pharmacokinetics
Psychometrics
Robust
SocialSciences
Spatial
Survival
TimeSeries
Other
URL: http://bm2.genes.nig.ac.jp/RGM2/index.php?clear=all
--
Share
2009 Sep 27
1
Clustering with R - efficient processing of large sparse data sets (text data)
I checked the R procedure HCLUST (hierarchical clustering) but it
looks like it requires a full triangular n x n similarity matrix as
input, where n = number of observations. The number of variables is
200.
My data set has n = 50,000 observations (keywords), and I use ad-hoc
similarity measures, not available in R, to measure keyword
similarity. Here, the vast majority of the n x n similarities