similar to: GSOC 2018: Diversification of Search Results

Displaying 20 results from an estimated 2000 matches similar to: "GSOC 2018: Diversification of Search Results"

2018 Apr 27
3
GSOC 2018: Diversification of Search Results
We are equally excited about working with you over summer. I think you missed reply by Olly on IRC, you can find it in logs here: https://botbot.me/freenode/xapian/2018-04-24/?msg=99336093&page=1 - olly icebyte[m]: i think that probably needs to go through SFC ( https://sfconservancy.org/) as the "legal entity" - 2:05 am
2016 Jul 31
2
Letor: returning MSet after re-ranking
On Sun, Jul 31, 2016 at 12:44:16AM +0100, Olly Betts wrote: > Would a method which swapped two elements of an MSet provide what you > need? That would provide a more generic way to adjust the ranking of > an MSet which for example could be used to implement a diversification > feature or something like SQL "GROUP BY". Isn't the most common use going to be that the
2016 Jul 30
2
Letor: returning MSet after re-ranking
> > > I'd prefer to avoid adding things to the public API that don't get > used by end users. However because LTR is outside the Xapian build > tree, we can't easily give it privileged access to Xapian internals. > Sorry for a delayed response. The way I was thinking of performing reranking with updated weights was to add a class MSetRanker (basically containing a
2012 Mar 22
1
GSOC : Language Modelling for information retrieval with Diversified Search results
Hello, I am a undergraduate student at DA-IICT,India pursuing Btech in Information and Communication Technology.Major field of my Research is Information Retrieval and Natural Language processing. xapain being an powerful Information retrieval library have attracted me towards implementing stuff learned in class for this project.I have worked on entity search on RDF data,SMS based FAQ
2010 Oct 08
1
tools:::fetchRdDB can't allocate memory block?
I'm developing an R package, and when I recompile (using Roxygen), detach, and re-load the package, I can't seem to look at some of its documented functions. When I try '?function.name', I get the following error: Error in tools:::fetchRdDB(RdDB, basename(file)) : cannot allocate memory block of size 2.6 Gb I'm using R.app's built-in documentation viewer. I
2018 Mar 09
2
GSoC aspirant - guruprasad hegde
Dear All, I'm guruprasad hegde. I would like to contribute to Xapian through GSOC-2018. Thank you for this wonderful opportunity. My Introduction: I study MSc in Computer science at the University of Saarland. I finished my 4th semester. Some of the courses I took include NLP, Information Retrieval & Data mining, statistical learning. These courses helped me develop the interest in
2007 Oct 16
1
Matches estimate varies with sorting method
Hi all, I found that the figure returned by MSet::get_matches_estimated() varies depending on how results are to be sorted. For instance, in my index, value 4 contains date and time in the format "yyyymmddhhmmss". For the same query, the number of results will be estimated to 20000+ when results are first sorted by date and time with set_sort_by_value_then_relevance(4) and to only 100
2009 Sep 09
4
undefined method `attr_accessor' for #<CustomersControl
pleas help -- Posted via http://www.ruby-forum.com/.
2012 Apr 13
1
R: Colouring phylogenetic tip labels and/or edges
Hi, I have reconstructed ancestral character states on a phylogeny using MuSSE in the diversitree package and plotted the character state probabilities as pie charts on the nodes. I would, however, like to colour the character states of my extant species, i.e. the tip labels, the same colours as my pie charts, such that all species in state 1 are e.g. blue, species in state 2 red and species in
2011 Jun 11
1
NDCG in R?
Dose R have a function to calculate NDCG? http://en.wikipedia.org/wiki/Discounted_cumulative_gain#Normalized_DCG Thanks!
2010 Aug 28
1
Matchspy and faceting
Have been working recently on a site that classifies posts using tags in taxonomies, so a post about the Oil spill in the Gulf of Mexico might be tagged 'Subscribers only' (access level), 'Barack Obama' (person), 'Tony Hayward' (person), 'BP' (company), 'Transocean' (company), 'Gulf of Mexico' (location). With some advice from Richard Boulton I
2023 Aug 28
1
DatabaseModifiedError while iterating on mset
Olly Betts <olly at survex.com> wrote: > On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote: > > I'm already retrying the ->get_mset operations; but now I'm > > wondering where I'd hit DatabaseModifiedErrors while inside a > > Xapian::MSetIterator loop. > > > > I assume ->get_document is a place where it gets thrown; > > but
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote: > In other words, is it possible to avoid duplicates if new > documents are inserted into the DB by another process in-between > ->get_mset calls when reusing Xapian::Enquire objects? The Database object itself effectively does (it works in a snapshot of the state of the database when you open it, or last called reopen() which
2023 Aug 27
1
DatabaseModifiedError while iterating on mset
On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote: > I'm already retrying the ->get_mset operations; but now I'm > wondering where I'd hit DatabaseModifiedErrors while inside a > Xapian::MSetIterator loop. > > I assume ->get_document is a place where it gets thrown; > but once a document is retrieved, can iterating through > terms in one document
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new documents are inserted into the DB by another process in-between ->get_mset calls when reusing Xapian::Enquire objects? I do some expensive processing on each mset window, so I always limit the results to limit heap usage even if I'm planning on going through a big chunk of the DB: $mset = $enq->get_mset(0, 1000);
2016 Jul 29
3
Pull requests: CJK words and Snippet generator
Hi James, thanks for the feedback. On Thu, Jul 28, 2016, at 00:22, James Aylett wrote: > This sounds great! I know sufficiently little about CJK that I won't > try to comment on that at all :) I've just opened a pull request for the CJK tokenizer: https://github.com/xapian/xapian/pull/114 > I wonder if we can arrange suitable defaults to use your > implementation with the
2010 Feb 11
2
RAID, temperature and FAN status manage and monitoring tool for RHEL4 Update 4 ia64 AS Linux
Hi, Currently I've been using an RHEL4 Update 4 ia64 AS Linux version and running in HP rx6600 server. I tried to google-out the RAID, temperature and FAN status manage and monitoring tool for RHEL4 Update 4 ia64 AS Linux But i can't able to findout the same for RHEL4 Update 4 ia64 AS Linux server Can anyone suggest me the write status monitoring and manage tool for
2014 May 21
2
Some questions about Letor project
Hi all, Thank you for giving me the opportunity to work with Xapian :) I am Jiarong Wei, a third year undergraduate student in Zhejiang University, China. In GSoC 2014, I will work on Letor module with Hanxiao Sun. Here are some questions I encountered these days, 1. In letor.cc, we have two parts of functions: the training part and the ranking part. I?ll use SVMRanker as an example. The
2020 Feb 19
2
prioritizing aggregated DBs
Olly Betts <olly at survex.com> wrote: > On Sat, Feb 08, 2020 at 06:04:42PM +0000, Eric Wong wrote: > > Olly Betts <olly at survex.com> wrote: > > > On Fri, Feb 07, 2020 at 09:33:08PM +0000, Eric Wong wrote: > > > > Or would I fiddle with wdf_inc for all ->index_text and ->add_term > > > > calls on a per-DB basis? > > > >
2010 Jan 09
1
Using HASH() and REALTIME_HASH()
Hi, I'm playing around with asterisk 1.6.2.0 and the first try was to replace my now non-functionning 'app-realtime' macro which emulated RealTime with REALTIME_HASH() There is very few documentation on the subject except for this bug report: https://issues.asterisk.org/view.php?id=13651#c94998 However when i try this syntax: