thr3ads.net - similar to: "Updates on Evaluation Module"

Displaying 20 results from an estimated 1000 matches similar to: "Updates on Evaluation Module"

2016 Jul 25

Weighting Schemes: Evaluation results

Hi James, > We probably don't want them committed in git where they're evaluation > runs (because we can recreate them); a gist might be more appropriate. Sorry, I have moved results files over to gist for each individual weighting scheme. Link: https://gist.github.com/ivmarkp/secret > I can't tell, but are some of those files from FIRE? If so, they > shouldn't be

Weighting Schemes: Evaluation results

2016 Jul 24

Weighting Schemes: Evaluation results

Hi all, I have evaluated new weighting schemes along with their existing counterparts in Xapian to compare and see which one does better job. Also, I have put together all the results files for easy access here: https://github.com/ivmarkp/xapian-evaluation/tree/evaluation/run and a README for getting started with xapian-evaluation module. Hopefully, it might be of help to those who are new to

read.csv fails to read a CSV file from google docs

2011 Apr 29

read.csv fails to read a CSV file from google docs

Hello all, I wish to use read.csv to read a google doc spreadsheet. I try using the following code: data_url <- " http://spreadsheets0.google.com/spreadsheet/pub?hl=en&hl=en&key=0AgMhDTVek_sDdGI2YzY2R1ZESDlmZS1VYUxvblQ0REE&single=true&gid=0&output=csv " read.csv(data_url) Which results in the following error: Error in file(file, "rt") : cannot open

Ideas for allowing specification of weighing scheme for Eset

2013 Feb 07

Ideas for allowing specification of weighing scheme for Eset

Hey guys ,Hi :) I am working on a hack which will allow the user to specify a weighing scheme (along with the parameters , if he does not not want to use the default values) to build the Eset (rather than using the hard coded TradWeight scheme with default k=1 ) as Olly had suggested that we can probably get better terms (a more relevant Eset) for query expansion if we use say something

GSoC-2017 Introduction and Project Discussion

2017 Mar 16

GSoC-2017 Introduction and Project Discussion

Hello, I'm Shivang Bansal, a 3rd year Computer Science Engineering undergraduate at Institute of Engineering & Technology in Lucknow, India. This mail is an expression of my interest for Google Summer of Code program of this year. I want to apologize for getting in so late. Actually I would have contacted earlier, but sudden demise of my Grandfather disabled me in doing so. I am

is it possible to create a trellis object with multiple colorkeys/z-scale axis?

2013 Feb 04

is it possible to create a trellis object with multiple colorkeys/z-scale axis?

Dear R users- Do the lattice/latticeExtra packages support multiple tileplots with unique colorkeys? I am trying to create a visualization of the encounter rate of certain stocks of salmon across two years. The issue I am having is that some stocks are encountered at much higher rates than others.the high encounter rate for one stock washes out the patterns I'm hoping to visualize across the

Weighting Schemes: Evaluation results

2016 Jul 28

Weighting Schemes: Evaluation results

Ah. If FIRE doesn't have something that can show this suitably, then > maybe Parth can advise on access to TREC, as I know he's used some of > them in the past. > ?I can say FIRE is also a reliable source but INEX/TREC are better. INEX can give you free access and TREC is not freely available. I had used INEX for xapian in the past and some details are here:

GSoc Project Idea Weighting Schemes (Ranking)

2014 Nov 23

GSoc Project Idea Weighting Schemes (Ranking)

Hi, I am Abhishek Currently Xapian::Weight follows BM25 scheme, many models such as the Divergence from Randomness (DfR) family of models, Unigram Language Model and the Bi-gram Language Model implemented two years ago in GSoc 2012 yet not merged to the master. The new weighing schemes or improvement in implementing the previous models to change the default scheme of BM25 from SMART with

latticeExtra tileplot question - tiles are not all the same size, need help.

2012 Dec 04

latticeExtra tileplot question - tiles are not all the same size, need help.

Hello, I have been creating many tileplots to try and illustrate the relative abundance of fish through space and time. My issue is that the tiles that border the plot are smaller than those in the center of the plot. In the example I've provided the effect is pretty minor (I'm hoping this will be an adequate example as I had the code already created/data uploaded). However, I have other

GSOC 2018: Diversification of Search Results

2018 Apr 27

GSOC 2018: Diversification of Search Results

We are equally excited about working with you over summer. I think you missed reply by Olly on IRC, you can find it in logs here: https://botbot.me/freenode/xapian/2018-04-24/?msg=99336093&page=1 - olly icebyte[m]: i think that probably needs to go through SFC ( https://sfconservancy.org/) as the "legal entity" - 2:05 am

Password expiration

2005 Sep 06

Password expiration

Hello, I still can't figure out why my users' passwords are expiring. Ever since we switched to LDAP, passwords expire every 14 days. Here is an example LDAP user entry: version: 1 # LDIF Export for: uid=sgoodrich,ou=Users,dc=trec,dc=us # Generated by phpLDAPadmin ( http://phpldapadmin.sourceforge.net/ ) on September 6, 2005 10:56 am # Server: TREC (localhost) # Search Scope: base #

Test Dataset for performance and accuracy analysis

2014 Mar 04

Test Dataset for performance and accuracy analysis

Hi Parth, I implemented DFR algorithms in Xapian as a part of GSOC last year under the mentorship of Olly. This year, I want to work on analyzing and optimizing the performance of the DFR algorithms and comparing them with BM25.I also want to work on profiling the query expansion schemes and test the relevance(precision and recall) / speed(time taken) of the

LDAP and password expiry

2005 Aug 25

LDAP and password expiry

Hello, We are using Samba 3.0.14a-6, slapd 2.2.26-4 and smbldap-tools 0.9.1-2 on Debian. My users are complaining about warnings that their password is about to expire and that the are told "You do not have permission to change your password" when they try to change it. sambaAcctFlags includes the X flag which I thought meant "don't expire passwords." The password changing

Xapian now has Divergence from Randomness schemes

2013 Jul 15

Xapian now has Divergence from Randomness schemes

Hello guys, you'll will be happy to know that the current codebase now includes the divergence from randomness weighting schemes which are known to outperform a lot of known weighting schemes such as BM25. Thanks to the amazing mentorship of Olly Betts and Dan Colish, our search results will now be better than before and Xapian will be more preferred in the research community than it was

Xapian now has Divergence from Randomness schemes

2013 Jul 15

Xapian now has Divergence from Randomness schemes

Omega: Missing support for newer weighting schemes

2017 Apr 08

Omega: Missing support for newer weighting schemes

On Sat, Apr 08, 2017 at 09:11:22PM +0100, James Aylett wrote: > On 8 Apr 2017, at 19:15, Vivek Pal <vivekpal.dtu at gmail.com> wrote: > > >> and the details of which weighting schemes were available in which version > >> isn't a key part of the $set command itself. > > > > Do you suggest dropping that piece of information out? Since the reason behind

Is it possible to reset the parameters in BM25 each time a new query enters?

2011 Feb 18

Is it possible to reset the parameters in BM25 each time a new query enters?

Hi guys, I'm trying to improve the search results of our collection by tuning the parameters in the BM25 weighting schema. Since our collection includes several databases, such as for pictures, websites, etc., I would like to use different values of the same schema to calculate the weights. Yet, rebuilding each time after the change was done to the head file seems not an optimal approach and

Omega: Missing support for newer weighting schemes

2017 Apr 09

Omega: Missing support for newer weighting schemes

On Sun, Apr 09, 2017 at 11:34:07PM +0530, Vivek Pal wrote: > > Each scheme already has a human-readable name, and Xapian::Registry > > can map that to an "examplar" object of the right type, so we > > could take a string like "bm25 1 0.8", see the first word is "bm25" > > and get a BM25Weight object, then call parse_params("1 0.8") on

[GSOC 2014] Indexing INEX dataset

2014 Mar 22

[GSOC 2014] Indexing INEX dataset

For unsupervised approaches like BM25 this approach works well but letor does not need special weighting for title in this form as it itself assigns weights to title features separately. But I see your concern it would be a problem when BM25 is used on the index with this setup. Hence its preferable to take a note of this uplift in title weight for xapian-letor and normalize it everywhere

Backend for Lucene format indexes-How to get doclength

2013 Aug 26

Backend for Lucene format indexes-How to get doclength

On Mon, Aug 26, 2013 at 09:41:07AM +0800, jiangwen jiang wrote: > > For now, using weighting schemes which don't use document length is > > probably the simplest answer. > > There's tf-idf weighting scheme on svn master, is it suitable for lucene > backend? Yes - TfIdfWeight doesn't ever use the document length (at least with the normalisations currently

similar to: Updates on Evaluation Module