similar to: Contribute in the Xapian project throgh GSoC.

Displaying 20 results from an estimated 2000 matches similar to: "Contribute in the Xapian project throgh GSoC."

2014 Mar 18
2
Considering Parallel computing for Letor
Hi everyone, My name is Shreedhar Pawar. I have already introduced myself on Xapian-discuss... I feel that the Xapian Search/Letor Algorithm, can speed up using Parallel computing. Techniques like Map-reduce, 'compact n split', radix sort, scan, parallel hashing n much more can be used to speed up the learning algorithms as well as the search... support vector machines in the Letor
2012 Apr 01
1
[GSoC2012] Learning to Rank: few thoughts/issues
Hello, I would like to work with Orange as part of GSoC 2012(and continue henceforth). Apologies for joining in a bit late- i was waiting to get a proper grasp of things before discussing it here. Currently I am a Masters students in Mathematics with my bachelors in Computer Science[integrated dual degree]. Over the last year and a half, I have worked on a few ML projects and have a couple of
2012 Mar 24
3
Learning to rank
Dear Sir, I am Pankaj Singhal from Jaipur, India. I am very much interested and strongly looking forward in getting involved in this project Learning-to-Rank. My previous experience in this field is good. Last semester I did a similar job of ranking the URLs of the given huge dataset based on their attribute values. The dataset consisted hundreds of thousands of URLs and each url
2014 Mar 04
4
Questions on letor module
Hi, I have several questions regarding the letor module,I looked at the framework of learning to rank in xapian http://rishabhmehrotra.com/gsoc/17.png, I am a little confused. Why using deep learning to find unsupervised features in test data? Since in my understanding, learning to rank model usually learn features from the training data then apply the model to the test data? Why test set and
2014 Mar 11
3
Proposal Outline
Hi, Before starting my proposal, I wanted to know what is the expected output of Letor module. Is it for transfer learning (i.e you learn from one dataset and leverage it to predict the rankings of other dataset) or is it for supervised learning? For instance - Xapian currently powers the Gmane search which is by default based on BM25 weighting scheme and now suppose we want to use LETOR to rank
2014 Mar 22
2
[GSOC 2014] Indexing INEX dataset
For unsupervised approaches like BM25 this approach works well but letor does not need special weighting for title in this form as it itself assigns weights to title features separately. But I see your concern it would be a problem when BM25 is used on the index with this setup. Hence its preferable to take a note of this uplift in title weight for xapian-letor and normalize it everywhere
2014 Mar 17
2
[GSOC 2014] Indexing INEX dataset
Hi Olly, Wouldn't setting the weight of terms in title back to normal (e.g. 5 to 1) by below line, automatically adjust the wdfs and field lengths? indexer.index_text(title, 5, "S"); -> indexer.index_text(title, 1, "S"); if it does not then we should include that part in the patch too. I like to create a patch for xapian-letor for resolving common code of xapian.
2011 Apr 01
2
New Idea on Ranking in IR
Hello, I want to discuss my idea on ranking in IR system which I think can be good extension to Xapian. If I am not too late to discuss it then please consider it. I first give you brief background of me, I am a Masters student working on my thesis in the Information Retrieval. I today only got a mail from one of the professor from Europe whom i am going to join for Ph.D about GSoC and more
2012 Dec 08
2
Want to contribute code to the Xapian project
Hey guys,I am a 3rd year Computer Science undergrad student.I a extremely interested in contributing code to the XAPIAN project. The work you people do sounds extremely fascinating and interesting.Can someone just give me a brief overview of how to proceed ?. I Can code in C,C++ and Python and have experience in Natural Lanuage Processing.Am also quite comfortable with NLTK and using Wordnet.Am
2010 May 26
3
cluster analysis and supervised classification: an alternative to knn1?
Hi, I have a 1.000 observations with 10 attributes (of different types: numeric, dicotomic, categorical ecc..) and a measure M. I need to cluster these observations in order to assign a new observation (with the same 10 attributes but not the measure) to a cluster. I want to calculate for the new observation a measure as the average of the meausures M of the observations in the cluster
2014 Dec 05
3
Contributing to Xapian
On Thu, Dec 04, 2014 at 08:28:32PM -0500, Manu Gupta wrote: > I was trying to build Han's code and was stuck here. It seems to me that > his code is trying to generate pdf for them. > > Is it possible to modify the make files to stop building the documentation. You can run the top-level configure with --disable-documentation. Cheers, Olly
2014 Dec 07
2
Contributing to Xapian
I use Ubuntu12.04, GNU Make 3.81 and I am not sure if I use some special configure options. The commands I use after clone the repo are as follows: cd xapian ./bootstarp ./configure make cd xapian-letor aclocal -I ../xapian-core/m4 libtoolize --force autoconf autoheader automake --add-missing ./configure make Is this normal? 2014-12-07 22:37 GMT+08:00 James Aylett <james-xapian at
2014 Dec 08
2
Contributing to Xapian
Hi I am on Debian verson 7.7, GCC 4.7.2 and make 3.8.1. I cloned the code : git clone https://github.com/v-hasu/xapian git checkout gsoc2014-evaluation rm xapian-letor/nobootstrap ./bootstrap ./configure --no-documentation make CXXFLAGS='-w' and I get a different set of errors --- ranker.cc: In member function 'std::string Xapian::Ranker::get_cwd()': ranker.cc:80:36: error:
2014 Dec 06
2
Contributing to Xapian
Hi~ Manu I was glad to see you were interested in Xapian, especially the learning to rank project. There are two branch in my repo. The one named gsoc2014 is followed the thought of Parth's work in which we not assign the test set for each ranking and in another one gsoc2014-evaluation we assign the training set and test set explicitly so that we can get the evaluation score of each ranking
2007 Nov 22
1
Cluster Analysis:build a classifier?
Dear All, I'm currently doing a project about unsupervised learning, and I'll be using R to analyse a few network traffic datasets downloaded off Andrew Moore's website (http://www.cl.cam.ac.uk/Research/SRG/netos/nprobe/data/papers/sigmetrics/index.html). Could anyone shed some light on how to build a classifier from the training set A, and then use a validation set B to assess the
2007 Jan 11
2
Account is not authorized to log in from this station
Hello All, When I try to connect to the samba installed linux machine from windows I get the message "Account is not authorized to log in from this station". I have made three entries(user name and password) in the smbpasswd file. But if I log in the windows machine using these user names and passwords I get the message "Account is not authorized to log in from this station".
2018 Feb 14
2
GSOC Proposal
Hey! I am Mehak, final year CSE student at IIT, India. I am keen on the project: Learning to Rank Stabilisation (Letor). I completed a course in Information Retrieval, passing with the highest grade at the university. I am proficient with C++. Also, I have research ML projects to my credit. I feel I fit the requirements for the project. I am exploring the project specific page of Letor. I
2008 Aug 05
0
[LLVMdev] Contributing to LLVM
On Aug 4, 2008, at 10:29 PM, Shrutarshi Basu wrote: > Hello all, > I'm a rising sophomore CS/Engineering student in the US and I'm > interested in programming languages/ compilers.I think the LLVM would > be a great project for me to contribute to, both to learn something > and help the project. Unfortunately, I don't know much about > compilers, much less anything
2010 May 05
5
Dynamic clustering?
Are there R packages that allow for dynamic clustering, i.e. where the number of clusters are not predefined? I have a list of numbers that falls in either 2 or just 1 cluster. Here an example of one that should be clustered into two clusters: two <- c(1,2,3,2,3,1,2,3,400,300,400) and here one that only contains one cluster and would therefore not need to be clustered at all. one <-
2014 Mar 10
2
A few more question about LETOR
1.Could you explain why are these libraries included in all the xapian-letor headers? #include<xapian/intrusive_ptr.h> #include<xapian/types.h> #include<xapian/visibility.h> Or just provide me with the documentation of these header. I looked into these header files but couldn't anything substantial. 2.