Displaying 20 results from an estimated 1000 matches similar to: "Updates on Evaluation Module"
2016 Jul 25
3
Weighting Schemes: Evaluation results
Hi James,
> We probably don't want them committed in git where they're evaluation
> runs (because we can recreate them); a gist might be more appropriate.
Sorry, I have moved results files over to gist for each individual
weighting scheme.
Link: https://gist.github.com/ivmarkp/secret
> I can't tell, but are some of those files from FIRE? If so, they
> shouldn't be
2016 Jul 24
2
Weighting Schemes: Evaluation results
Hi all,
I have evaluated new weighting schemes along with their existing
counterparts in Xapian to compare and see which one does better job.
Also, I have put together all the results files for easy access here:
https://github.com/ivmarkp/xapian-evaluation/tree/evaluation/run
and a README for getting started with xapian-evaluation module. Hopefully,
it might be of help to those who are new to
2011 Apr 29
2
read.csv fails to read a CSV file from google docs
Hello all,
I wish to use read.csv to read a google doc spreadsheet.
I try using the following code:
data_url <- "
http://spreadsheets0.google.com/spreadsheet/pub?hl=en&hl=en&key=0AgMhDTVek_sDdGI2YzY2R1ZESDlmZS1VYUxvblQ0REE&single=true&gid=0&output=csv
"
read.csv(data_url)
Which results in the following error:
Error in file(file, "rt") : cannot open
2013 Feb 07
0
Ideas for allowing specification of weighing scheme for Eset
Hey guys ,Hi :) I am working on a hack which will allow the user to
specify a weighing scheme (along with the parameters , if he does not not
want to use the default values) to build the Eset (rather than using the
hard coded TradWeight scheme with default k=1 ) as Olly had suggested
that we can probably get better terms (a more relevant Eset) for query
expansion if we use say something
2017 Mar 16
2
GSoC-2017 Introduction and Project Discussion
Hello,
I'm Shivang Bansal, a 3rd year Computer Science Engineering undergraduate
at Institute of Engineering & Technology in Lucknow, India. This mail is an
expression of my interest for Google Summer of Code program of this year. I
want to apologize for getting in so late. Actually I would have contacted
earlier, but sudden demise of my Grandfather disabled me in doing so.
I am
2013 Feb 04
1
is it possible to create a trellis object with multiple colorkeys/z-scale axis?
Dear R users-
Do the lattice/latticeExtra packages support multiple tileplots with unique
colorkeys? I am trying to create a visualization of the encounter rate of
certain stocks of salmon across two years. The issue I am having is that
some stocks are encountered at much higher rates than others.the high
encounter rate for one stock washes out the patterns I'm hoping to visualize
across the
2016 Jul 28
2
Weighting Schemes: Evaluation results
Ah. If FIRE doesn't have something that can show this suitably, then
> maybe Parth can advise on access to TREC, as I know he's used some of
> them in the past.
>
?I can say FIRE is also a reliable source but INEX/TREC are better. INEX
can give you free access and TREC is not freely available. I had used INEX
for xapian in the past and some details are here:
2014 Nov 23
2
GSoc Project Idea Weighting Schemes (Ranking)
Hi,
I am Abhishek
Currently Xapian::Weight follows BM25 scheme, many models such as the
Divergence from Randomness (DfR) family of models, Unigram Language Model
and the Bi-gram Language Model implemented two years ago in GSoc 2012 yet
not merged to the master.
The new weighing schemes or improvement in implementing the previous models
to change the default scheme of BM25 from SMART with
2012 Dec 04
0
latticeExtra tileplot question - tiles are not all the same size, need help.
Hello,
I have been creating many tileplots to try and illustrate the relative
abundance of fish through space and time. My issue is that the tiles that
border the plot are smaller than those in the center of the plot. In the
example I've provided the effect is pretty minor (I'm hoping this will be an
adequate example as I had the code already created/data uploaded). However,
I have other
2018 Apr 27
3
GSOC 2018: Diversification of Search Results
We are equally excited about working with you over summer.
I think you missed reply by Olly on IRC, you can find it in logs here:
https://botbot.me/freenode/xapian/2018-04-24/?msg=99336093&page=1
- olly
icebyte[m]: i think that probably needs to go through SFC (
https://sfconservancy.org/) as the "legal entity"
- 2:05 am
2005 Sep 06
2
Password expiration
Hello,
I still can't figure out why my users' passwords are expiring. Ever since we
switched to LDAP, passwords expire every 14 days. Here is an example LDAP user
entry:
version: 1
# LDIF Export for: uid=sgoodrich,ou=Users,dc=trec,dc=us
# Generated by phpLDAPadmin ( http://phpldapadmin.sourceforge.net/ ) on
September 6, 2005 10:56 am
# Server: TREC (localhost)
# Search Scope: base
#
2014 Mar 04
2
Test Dataset for performance and accuracy analysis
Hi Parth,
I implemented DFR algorithms in Xapian as
a part of GSOC last year under the mentorship of Olly. This year, I want to
work on analyzing and optimizing the performance of the DFR algorithms and
comparing them with BM25.I also want to work on profiling the query
expansion schemes and test the relevance(precision and recall) / speed(time
taken) of the
2005 Aug 25
2
LDAP and password expiry
Hello,
We are using Samba 3.0.14a-6, slapd 2.2.26-4 and smbldap-tools 0.9.1-2 on
Debian. My users are complaining about warnings that their password is about to
expire and that the are told "You do not have permission to change your
password" when they try to change it. sambaAcctFlags includes the X flag which
I thought meant "don't expire passwords." The password changing
2013 Jul 15
0
Xapian now has Divergence from Randomness schemes
Hello guys, you'll will be happy to know that the current codebase now
includes the divergence from randomness weighting schemes which are known
to outperform a lot of known weighting schemes such as BM25. Thanks to the
amazing mentorship of Olly Betts and Dan Colish, our search results will
now be better than before and Xapian will be more preferred in the research
community than it was
2013 Jul 15
0
Xapian now has Divergence from Randomness schemes
Hello guys, you'll will be happy to know that the current codebase now
includes the divergence from randomness weighting schemes which are known
to outperform a lot of known weighting schemes such as BM25. Thanks to the
amazing mentorship of Olly Betts and Dan Colish, our search results will
now be better than before and Xapian will be more preferred in the research
community than it was
2017 Apr 08
2
Omega: Missing support for newer weighting schemes
On Sat, Apr 08, 2017 at 09:11:22PM +0100, James Aylett wrote:
> On 8 Apr 2017, at 19:15, Vivek Pal <vivekpal.dtu at gmail.com> wrote:
>
> >> and the details of which weighting schemes were available in which version
> >> isn't a key part of the $set command itself.
> >
> > Do you suggest dropping that piece of information out? Since the reason behind
2011 Feb 18
1
Is it possible to reset the parameters in BM25 each time a new query enters?
Hi guys,
I'm trying to improve the search results of our collection by tuning the parameters in the BM25 weighting schema. Since our collection includes several databases, such as for pictures, websites, etc., I would like to use different values of the same schema to calculate the weights. Yet, rebuilding each time after the change was done to the head file seems not an optimal approach and
2017 Apr 09
3
Omega: Missing support for newer weighting schemes
On Sun, Apr 09, 2017 at 11:34:07PM +0530, Vivek Pal wrote:
> > Each scheme already has a human-readable name, and Xapian::Registry
> > can map that to an "examplar" object of the right type, so we
> > could take a string like "bm25 1 0.8", see the first word is "bm25"
> > and get a BM25Weight object, then call parse_params("1 0.8") on
2014 Mar 22
2
[GSOC 2014] Indexing INEX dataset
For unsupervised approaches like BM25 this approach works well but letor
does not need special weighting for title in this form as it itself assigns
weights to title features separately.
But I see your concern it would be a problem when BM25 is used on the index
with this setup. Hence its preferable to take a note of this uplift in
title weight for xapian-letor and normalize it everywhere
2013 Aug 26
2
Backend for Lucene format indexes-How to get doclength
On Mon, Aug 26, 2013 at 09:41:07AM +0800, jiangwen jiang wrote:
> > For now, using weighting schemes which don't use document length is
> > probably the simplest answer.
>
> There's tf-idf weighting scheme on svn master, is it suitable for lucene
> backend?
Yes - TfIdfWeight doesn't ever use the document length (at least with
the normalisations currently