Hello, Suppose the problem is to cluster each document in a set of 'n' documents (.csv) separately by some clustering method. If 'n' is very large, then is there a method of increasing speed and efficiency by treating the set as a document vector or as a set of document vectors ? Thanks -- A. Mani Member, Cal. Math. Soc