search for: clusterset

Displaying 4 results from an estimated 4 matches for "clusterset".

2016 May 05
2
GSoC 2016 - Introduction
Hello, Thanks James for the reply. That cleared a few things out. Apologies for replying late because of exams going on. I was going through the previous clustering API to understand how it worked and it seems like the the approach for construction of the termlists which are used for distance metrics use TF-IDF weighting with cosine similarity, which is very similar to the approach I would need
2017 Mar 09
2
GSoC 2017 Project Proposal
...Analysis I had roughly implemented 2 evaluation techniques previously (Distance b/w document and centroids within clusters and Silhouette coefficient) but I hadn't implemented them within Xapian, and thus it wasn't possible to automate the process of evaluating the clustering results in the ClusterSet. It is thus important to implement cluster evaluation techniques within a module (as a ClusterEvaluation class) so that users can get output on how they can improve their clustering by passing in the ClusterSet (and the labels if necessary). The cluster evaluation techniques that I would like to c...
2016 Aug 18
3
KMeans - Evaluation Results
> > > > Actually, you're doing something slightly unusual there: making the > internal member public. Protected would be better, and private is I think > most usual; library clients aren't going to have access to the Internal > class declaration, so they can't call things on it. This means it's > actually difficult right now to subclass Feature. > > I
2016 Aug 19
2
KMeans - Evaluation Results
...of an equality / assignment error, and under gcc because there are warnings about members shadowed by local variables. You should have got one or the other while compiling yourself; what version or which compiler are you using? > Currently, the main classes which have data to hide are Cluster, ClusterSet and Clusterer subclass (currently KMeans). Thus, if we can use PIMPL with these classes, it could hide quite a lot of the non-public data. As a plus point, I'm having problems with shifting PointType, Point and Centroid classes to a private header because of forward referencing problems, so eve...