Displaying 20 results from an estimated 3000 matches similar to: "Modify Eset to allow specification of weighing scheme"
2013 Feb 07
0
Ideas for allowing specification of weighing scheme for Eset
Hey guys ,Hi :) I am working on a hack which will allow the user to
specify a weighing scheme (along with the parameters , if he does not not
want to use the default values) to build the Eset (rather than using the
hard coded TradWeight scheme with default k=1 ) as Olly had suggested
that we can probably get better terms (a more relevant Eset) for query
expansion if we use say something
2013 Jan 09
2
Explanation of how Eset works
Hey guys hi.I am trying to understand how Xapian works .I read the
Theoretical Background to Xapian doc
and the report by Salton and Jones.I still cant seem to understand how Eset
works How exactly does Xapian add terms to expand a query ? Assuming we
have a list of the k most important terms, how do we decide which term to
add to the query and will be in context with the query ?
And to decide r
2013 Mar 27
1
Need help as Pl2 tests not performing as expected
Hello guys. I just ran the updated tests for PL2 and they are not giving
the mset order I expect.Now,the thing is, dfr's behavior is a bit hard to
predict and so even if I expect a particular order ,it may give another
order and still be correct.So,the only way to write correct tests for PL2
is to manually calculate the weight of the documents to decide the expected
order.For that,I need to
2013 Mar 03
0
Added code and tests for the tf-idf weighting scheme.
Hello guys.I have sent a pull request for the code and tests of the Tf-Idf
weighting scheme.
Please do let me know if any changes are required.Meanwhile,Ill begin
working on implementing normalizations which require additional statistics
and on the DFR schemes.
https://github.com/xapian/xapian/pull/6
On Tue, Feb 26, 2013 at 5:30 PM, <xapian-devel-request at lists.xapian.org>wrote:
>
2018 Nov 12
0
Fwd: GoXapian
Hi Olly,
I setup the environment for the golang branch and build it without
problem. Cgo complain about the generated cc file and dosent regonize it as
swig generated file... curious. There no more problem with the MSet and
ESet and all class seem to be generated in Go correctly. I will try
tomorrow to manually invoke swig over your .i file and see if the code is
generated properly. The file
2014 Apr 13
2
Adding an external library to Xapian
We are using the --enable-maintainer-mode and will move to git soon.
The diff file is attached.
*Siddhant Mutha*
Undergraduate Student
Department of Computer Science and Engineering
IIT Madras
Chennai
http://www.siddhantmutha.com/ <http:/www.siddhantmutha.com/>
On Sun, Apr 13, 2014 at 8:26 PM, James Aylett <james-xapian at tartarus.org>wrote:
> On 13 Apr 2014, at 15:48, Pallavi
2018 Nov 11
2
Fwd: GoXapian
On Mon, Nov 05, 2018 at 09:04:23PM +0000, Olly Betts wrote:
> Dave Courtois wrote:
> > Hi there just a little word to tell you that Xapian 1.4.6 work in
> > Golang 1.9. There is some issue with the MSet ESet, the namespace part
> > of the name is drop by swig. There is also some error with define
> > macro in .i files, but with minor change all work perfectly.
I had a
2013 Mar 03
0
Sent a pull request for testing TradWeight using an Rset.
Hello guys.As discussed on IRC,I have sent a pull request for a test for
testing TradWeight with an Rset.
On Fri, Mar 1, 2013 at 5:30 PM, <xapian-devel-request at lists.xapian.org>wrote:
> Send Xapian-devel mailing list submissions to
> xapian-devel at lists.xapian.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
>
2014 May 14
2
Starting work on Perf Test Module
Hello,
I am beginning work on the perf test module. The initial steps that I aim
to accomplish are :-
-> Download the wikipedia dumps for multiple languages .
-> Write python scripts to tokenize the dump (will probably use something
like nltk which has powerful inbuilt tokenizers)
-> Discuss and finalize the design of the search and query expansion perf
tests as I want to complete them
2013 Mar 26
1
Merging of the TfIdf patch
Hello Guys. I have updated the code,tests,documentation,makefile entries
and the registry entry of the* *TfIdf patch as per the feedback.Please do
let me know if any additional changes are required before the patch can be
merged,
-Regards
-Aarsh
On Sun, Mar 3, 2013 at 2:50 PM, aarsh shah <aarshkshah1992 at gmail.com> wrote:
> Hello guys.I have sent a pull request for the code and
2018 Oct 28
2
Fwd: GoXapian
This is probably more useful to the general list. Thanks for the update, Dave — was this based on the previous golang bindings?
J
> Begin forwarded message:
>
> From: Dave Courtois <dave.courtois60 at gmail.com <mailto:dave.courtois60 at gmail.com>>
> Subject: GoXapian
> Date: 27 October 2018 at 15:27:09 BST
>
> Hi there just a little word to tell you that
2008 Feb 27
2
problem with creation of eSet
Hi,
I am having troubles with creating an eSet and would appreciate any help on
the following problem.
I am trying to create an eSet using the following code
pd <- read.table(file="pdata.txt",header =TRUE,row.names=1);
colnames(pd) <- c("type","tumor","time","id");
pdN <- list(type =
2013 Feb 25
0
Sent a pull request for the Tf-Idf Weighting scheme
Hello guys :) I have sent a pull request for the Tf-Idf Weighting scheme
incorporating as many normalizations as I could with the help of statistics
currently available from Xapian::Weight . Please let me know what you'll
think about it.
I used the weighting scheme in a simple searcher and it did a fine job with
it. I have no experience with writing tests for features like this.Please
give me
2013 Feb 19
2
Implementing tf-idf weighting scheme in Xapian
Hello guys.I just read up about tf-idf schemes and want to implement it in
Xapian (with some frequently used normalizations) as it will also give me a
good hang of implementing a weighting scheme before I start working on
implementing DFR schemes.
I read the following as references and I think Ive understood it well and
can write the hack :-
1.)
2013 Mar 15
1
DFR framework as a GSOC project
Hey guys,hi.:) I've finished implementing the PL2 scheme . The bounds I
have implemented for it are as good as I could, given the nature of the
scheme and my mathematical skills.However,tight bounds for other named DFR
schemes will be easier to implement because their forumlas are quite
simpler compared to PL2 . Will send in a pull request in a couple of days
once I'm done with the tests
2007 Apr 20
1
ExpandDecider and MatchDecider operator() return type
Currently ExpandDecider::operator() and MatchDecider::operator() return
`int' for no very good reason that I can see. It would be more natural to
return `bool', since these classes are making a "yes/no" decision about
whether to include a term in an ESet or a document in an MSet.
The problem is that this can't be done without breaking existing user
code which defines
2013 Feb 28
0
Implemented test coverage for using TradWeight with an Rset
Hello guys. Olly had pointed out on IRC that there is no test for using the
TradWeight scheme with a Rset and had directed me to the test coverage of
the TradWeight scheme :-
http://lcov.xapian.org/latest/weight/tradweight.cc.gcov.html
I have added a test to api_anydb.cc which tests the TradWeight scheme using
an Rset . I simply modified the rset2 test in api_anydb.cc so that the Mset
is made
2013 Jan 27
1
Added a python example to the community page
Hey guys,I have added a python indexer example to the SampleCode page of
our wiki.Please do have a look.The code can also be found here :-
https://github.com/aarshkshah1992/xapian/blob/efcf443527b74326119bbc0935fc41a002ce60db/xapian-bindings/python/docs/examples/simpleindexgrep.py/
Thanks :)
-Regards
-Aarsh
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
2013 Jan 24
1
Integrating a PaiceHusk stemmer into the library
Hey guys Hi :) I've implemented a PaiceHusk stemmer externally So what I
am doing right now is passing a pointer to my StemPaiceHusk class(which in
turn has been subclassed from Stemimplementation) to the
Stem::Stem(StemImplementation *p) constructor .So basically,I have to
include "paicehusk.h" in my indexer .However,I now want to make it a part
of the Xapian library so that I
2014 Mar 04
2
Test Dataset for performance and accuracy analysis
Hi Parth,
I implemented DFR algorithms in Xapian as
a part of GSOC last year under the mentorship of Olly. This year, I want to
work on analyzing and optimizing the performance of the DFR algorithms and
comparing them with BM25.I also want to work on profiling the query
expansion schemes and test the relevance(precision and recall) / speed(time
taken) of the