Reetesh Ranjan
2014-Mar-21 18:29 UTC
[Xapian-devel] Xapian-devel Digest, Vol 107, Issue 31
Hello Everyone, I have uploaded my proposal on weighting scheme. https://www.google-melange.com/gsoc/proposal/review/student/google/gsoc2014/reetu11/5707702298738688 Your comments will prove fruitful to me, so please kindly review it and let me know if any change is required. Kind Regrads, Reetesh Reetesh Ranjan Junior Undergraduate IIT(BHU), Varanasi contact no: +917275115929 Skype : reetesh.ranjan11 On Fri, Mar 21, 2014 at 5:30 PM, <xapian-devel-request at lists.xapian.org>wrote:> Send Xapian-devel mailing list submissions to > xapian-devel at lists.xapian.org > > To subscribe or unsubscribe via the World Wide Web, visit > http://lists.xapian.org/mailman/listinfo/xapian-devel > or, via email, send a message with subject or body 'help' to > xapian-devel-request at lists.xapian.org > > You can reach the person managing the list at > xapian-devel-owner at lists.xapian.org > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of Xapian-devel digest..." > > > Today's Topics: > > 1. GSoC 2014: Clustering of Search Results (Abhishek Gupta) > 2. Proposal update (Anish Kanchan) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Fri, 21 Mar 2014 01:37:31 +0530 > From: Abhishek Gupta <a.gupps at gmail.com> > To: xapian-devel at lists.xapian.org > Subject: [Xapian-devel] GSoC 2014: Clustering of Search Results > Message-ID: > <CAPfSG7gVoO9Lj94V=bPTWceFeP40JiwBAQ2uZ> 6-fXCKe4nJ+A at mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > Sir, > > I am Abhishek Gupta. I know I am quite late for the project discussion > because I came to know about GSoC a bit lately but still I would like to > discuss this project which interests me a lot. I know I have to submit some > code so as to show my skill set but as the deadline is quite near I will > submit the patches or exercises after the deadline to strengthen my > application and show my coding skill. > I read your existing source code for the clustering which is quite slow > because of the hierarchical based clustering which is not required at > all.*You > have already provided with the number of clusters you should have in the > end*. So for this we can employ K-means algorithm which can perform far > better than the current algorithm. > > 1) Hierarchical clustering have high memory requirements *O(n*n)* in > comparison to *O(n+K)* space complexity of K-means algorithm, where*n* is > the number of elements and* K *is the number of clusters. > 2) Hierarchical clustering running time is *O(n*n*n)* on the other hand > K-means algorithm has time complexity of *O(n)*. > 3) K-means improves the clustering iteratively, more you run the code more > better you will get the results. > > One thing that K-means lacks is its non-deterministic outcome. Every time > it will produce different clusters. But we can always run the algorithm > 10-12 times and then take the average even then it will perform far better > than the hierarchical one. > > So I would like to propose this algorithm which can perform better than the > hierarchical one. After that to improve the clustering more we can also > implement K-medoids/K-means++ clustering methods. > > I would you give some reviews regarding the proposal, so that I can submit > the proposal at time. > > Thanks and Regards > Abhishek Gupta > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < > http://lists.xapian.org/pipermail/xapian-devel/attachments/20140321/7bda8301/attachment-0001.html > > > > ------------------------------ > > Message: 2 > Date: Fri, 21 Mar 2014 09:59:06 +0530 > From: Anish Kanchan <anishgkanchan at gmail.com> > To: Xapian Development <xapian-devel at lists.xapian.org> > Subject: [Xapian-devel] Proposal update > Message-ID: > <CAJiBCaNXA77qcmcmciidGEGdXF6d> JvL3JK1ADiHzLtSjvALWg at mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > Hello, > > I have updated my proposal as per Olly's suggestion. Kindly review it and > let me know if any changes are to be made. > > Thanks and Regards, > Anish Kanchan > Student, University of Mumbai > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < > http://lists.xapian.org/pipermail/xapian-devel/attachments/20140321/347d3b44/attachment-0001.html > > > > ------------------------------ > > Subject: Digest Footer > > _______________________________________________ > Xapian-devel mailing list > Xapian-devel at lists.xapian.org > http://lists.xapian.org/mailman/listinfo/xapian-devel > > > ------------------------------ > > End of Xapian-devel Digest, Vol 107, Issue 31 > ********************************************* >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20140321/19164a7c/attachment-0002.html>