Displaying 20 results from an estimated 2000 matches similar to: "GSOC 2018: Diversification of Search Results"
2018 Apr 27
3
GSOC 2018: Diversification of Search Results
We are equally excited about working with you over summer.
I think you missed reply by Olly on IRC, you can find it in logs here:
https://botbot.me/freenode/xapian/2018-04-24/?msg=99336093&page=1
- olly
icebyte[m]: i think that probably needs to go through SFC (
https://sfconservancy.org/) as the "legal entity"
- 2:05 am
2016 Jul 31
2
Letor: returning MSet after re-ranking
On Sun, Jul 31, 2016 at 12:44:16AM +0100, Olly Betts wrote:
> Would a method which swapped two elements of an MSet provide what you
> need? That would provide a more generic way to adjust the ranking of
> an MSet which for example could be used to implement a diversification
> feature or something like SQL "GROUP BY".
Isn't the most common use going to be that the
2016 Jul 30
2
Letor: returning MSet after re-ranking
>
>
> I'd prefer to avoid adding things to the public API that don't get
> used by end users. However because LTR is outside the Xapian build
> tree, we can't easily give it privileged access to Xapian internals.
>
Sorry for a delayed response.
The way I was thinking of performing reranking with updated weights was to
add a class MSetRanker (basically containing a
2012 Mar 22
1
GSOC : Language Modelling for information retrieval with Diversified Search results
Hello,
I am a undergraduate student at DA-IICT,India pursuing Btech in
Information and Communication Technology.Major field of my Research is
Information Retrieval and Natural Language processing. xapain being an
powerful Information retrieval library have attracted me towards
implementing stuff learned in class for this project.I have worked on
entity search on RDF data,SMS based FAQ
2010 Oct 08
1
tools:::fetchRdDB can't allocate memory block?
I'm developing an R package, and when I recompile (using Roxygen),
detach, and re-load the package, I can't seem to look at some of its
documented functions. When I try '?function.name', I get the
following error:
Error in tools:::fetchRdDB(RdDB, basename(file)) :
cannot allocate memory block of size 2.6 Gb
I'm using R.app's built-in documentation viewer.
I
2018 Mar 09
2
GSoC aspirant - guruprasad hegde
Dear All,
I'm guruprasad hegde. I would like to contribute to Xapian through
GSOC-2018.
Thank you for this wonderful opportunity.
My Introduction:
I study MSc in Computer science at the University of Saarland. I finished
my 4th semester. Some of the courses I took include NLP, Information
Retrieval & Data mining, statistical learning. These courses helped me
develop the interest in
2007 Oct 16
1
Matches estimate varies with sorting method
Hi all,
I found that the figure returned by MSet::get_matches_estimated() varies
depending on how results are to be sorted.
For instance, in my index, value 4 contains date and time in the format
"yyyymmddhhmmss". For the same query, the number of results will be
estimated to 20000+ when results are first sorted by date and time
with set_sort_by_value_then_relevance(4) and to only 100
2009 Sep 09
4
undefined method `attr_accessor' for #<CustomersControl
pleas help
--
Posted via http://www.ruby-forum.com/.
2012 Apr 13
1
R: Colouring phylogenetic tip labels and/or edges
Hi,
I have reconstructed ancestral character states on a phylogeny using MuSSE in the diversitree package and plotted the character state probabilities as pie charts on the nodes. I would, however, like to colour the character states of my extant species, i.e. the tip labels, the same colours as my pie charts, such that all species in state 1 are e.g. blue, species in state 2 red and species in
2011 Jun 11
1
NDCG in R?
Dose R have a function to calculate NDCG?
http://en.wikipedia.org/wiki/Discounted_cumulative_gain#Normalized_DCG
Thanks!
2010 Aug 28
1
Matchspy and faceting
Have been working recently on a site that classifies posts using tags in
taxonomies, so a post about the Oil spill in the Gulf of Mexico might be
tagged 'Subscribers only' (access level), 'Barack Obama' (person), 'Tony
Hayward' (person), 'BP' (company), 'Transocean' (company), 'Gulf of
Mexico' (location).
With some advice from Richard Boulton I
2023 Aug 28
1
DatabaseModifiedError while iterating on mset
Olly Betts <olly at survex.com> wrote:
> On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> > I'm already retrying the ->get_mset operations; but now I'm
> > wondering where I'd hit DatabaseModifiedErrors while inside a
> > Xapian::MSetIterator loop.
> >
> > I assume ->get_document is a place where it gets thrown;
> > but
2023 Aug 18
1
does Xapian::Enquire hold an MVCC revision?
On Thu, Aug 17, 2023 at 09:28:26PM +0000, Eric Wong wrote:
> In other words, is it possible to avoid duplicates if new
> documents are inserted into the DB by another process in-between
> ->get_mset calls when reusing Xapian::Enquire objects?
The Database object itself effectively does (it works in a snapshot of
the state of the database when you open it, or last called reopen()
which
2023 Aug 27
1
DatabaseModifiedError while iterating on mset
On Wed, Aug 23, 2023 at 01:53:27PM +0000, Eric Wong wrote:
> I'm already retrying the ->get_mset operations; but now I'm
> wondering where I'd hit DatabaseModifiedErrors while inside a
> Xapian::MSetIterator loop.
>
> I assume ->get_document is a place where it gets thrown;
> but once a document is retrieved, can iterating through
> terms in one document
2023 Aug 17
1
does Xapian::Enquire hold an MVCC revision?
In other words, is it possible to avoid duplicates if new
documents are inserted into the DB by another process in-between
->get_mset calls when reusing Xapian::Enquire objects?
I do some expensive processing on each mset window, so I always
limit the results to limit heap usage even if I'm planning on
going through a big chunk of the DB:
$mset = $enq->get_mset(0, 1000);
2016 Jul 29
3
Pull requests: CJK words and Snippet generator
Hi James,
thanks for the feedback.
On Thu, Jul 28, 2016, at 00:22, James Aylett wrote:
> This sounds great! I know sufficiently little about CJK that I won't
> try to comment on that at all :)
I've just opened a pull request for the CJK tokenizer:
https://github.com/xapian/xapian/pull/114
> I wonder if we can arrange suitable defaults to use your
> implementation with the
2010 Feb 11
2
RAID, temperature and FAN status manage and monitoring tool for RHEL4 Update 4 ia64 AS Linux
Hi,
Currently I've been using an RHEL4 Update 4 ia64 AS Linux version
and running in HP rx6600 server.
I tried to google-out the RAID, temperature and FAN status manage
and monitoring tool for RHEL4 Update 4 ia64 AS Linux
But i can't able to findout the same for RHEL4 Update 4 ia64 AS
Linux server
Can anyone suggest me the write status monitoring and manage tool
for
2014 May 21
2
Some questions about Letor project
Hi all,
Thank you for giving me the opportunity to work with Xapian :) I am Jiarong
Wei, a third year undergraduate student in Zhejiang University, China. In
GSoC 2014, I will work on Letor module with Hanxiao Sun.
Here are some questions I encountered these days,
1. In letor.cc, we have two parts of functions: the training part and
the ranking part. I?ll use SVMRanker as an example. The
2020 Feb 19
2
prioritizing aggregated DBs
Olly Betts <olly at survex.com> wrote:
> On Sat, Feb 08, 2020 at 06:04:42PM +0000, Eric Wong wrote:
> > Olly Betts <olly at survex.com> wrote:
> > > On Fri, Feb 07, 2020 at 09:33:08PM +0000, Eric Wong wrote:
> > > > Or would I fiddle with wdf_inc for all ->index_text and ->add_term
> > > > calls on a per-DB basis?
> > >
>
2010 Jan 09
1
Using HASH() and REALTIME_HASH()
Hi,
I'm playing around with asterisk 1.6.2.0 and the first try was to
replace my now non-functionning
'app-realtime' macro which emulated RealTime with REALTIME_HASH()
There is very few documentation on the subject except for this bug report:
https://issues.asterisk.org/view.php?id=13651#c94998
However when i try this syntax: