search for: get_termfreq

Displaying 10 results from an estimated 10 matches for "get_termfreq".

2012 Jul 09
1
Question about Document and TermIterator.get_termfreq()
Hi, While porting the unit tests from perl for the node binding I noticed a test failed. I basically create a document, add a few terms, add the document to a database and then call doc->termlist_begin().get_termfreq(). This throws "Can't get term frequency from a document termlist which is not associated with a database." What I think this means is that I can not call get_termfreq from a TermIterator obtained from a document (this is only available for a TermIterator obtained from a Database obj...
2006 Jan 30
1
More than one Index?
Morning All, I use scriptindex to build my database and the PHP bindings to pull it all out. Is it possible to have more than one index but select what the bindings search on? So at the moment I index property addresses, I would also like to index property descriptions for more advanced searching but only as an optional extra...probably in an extra search box. Also I would like to analyse the
2010 Jan 16
1
PHP XapianTermIterator/XapianPositionIterator usage
Hello again, /thanks to Peter for previous response. I've been digging around trying to find sample usage of XapianTermIterator/XapianPositionIterator in PHP. The idea is to code up a test case in PHP to perform snippet extraction (with a possible view to coding a pecl extension in C). I found a C++ sample, but that wasn't much help. I must be dense this morning though, since I
2009 Feb 12
1
problem when using xapian's static libs in windows
...bj) : error LNK2001: ????????? "public: virtual double __thiscall RemoteDatabase::get_doclength(unsigned int)const " (?get_doclength at RemoteDatabase@@UBENI at Z) libbackend.lib(dbfactory_remote.obj) : error LNK2001: ????????? "public: virtual unsigned int __thiscall RemoteDatabase::get_termfreq(class std::basic_string<char,struct std::char_traits<char>,class std::allocator<char> > const &)const " (?get_termfreq at RemoteDatabase@@UBEIABV?$basic_string at DU?$char_traits at D@std@@V?$allocator at D@2@@std@@@Z) libnet.lib(progclient.obj) : error LNK2001: ?????????...
2010 Jul 26
2
related documents
Hi All, I would like to take a doc in the xapian DB and find all related documents by relevance e.g. so when you view one document it says "Related entries X Y Z". I'm aware of the "Morelikethis" Lucene plugin that is supposed to do something like this, by generating a query from a document based on term frequency. Has anyone developed a tool to generate a query from a
2005 Aug 12
1
error building xapian
...CntPtr<const InMemoryDatabase>, Xapian::docid, const InMemoryDoc&, Xapian::doclength)': inmemory_database.cc:184: error: class 'InMemoryTermList' does not have any field named 'db' inmemory_database.cc: In member function 'virtual Xapian::doccount InMemoryTermList::get_termfreq() const': inmemory_database.cc:205: error: 'db' was not declared in this scope inmemory_database.cc: In member function 'virtual OmExpandBits InMemoryTermList::get_weighting() const': inmemory_database.cc:223: error: 'db' was not declared in this scope inmemory_database....
2010 Aug 28
1
Matchspy and faceting
Have been working recently on a site that classifies posts using tags in taxonomies, so a post about the Oil spill in the Gulf of Mexico might be tagged 'Subscribers only' (access level), 'Barack Obama' (person), 'Tony Hayward' (person), 'BP' (company), 'Transocean' (company), 'Gulf of Mexico' (location). With some advice from Richard Boulton I
2009 Jan 27
1
Segmentation fault in MSetIterator get_weight
Hi, I'm using xapian with c# and mono and i'm having a segfault in get_weight. When i print the index variable, the value is clearly too high. I think something write over it. Do you have any idea on how i could trace the beginning of the segmentation fault ? Thanks, -- Yann
2020 Aug 23
2
MultiDatabase shard count limitations
...2.28.so [.] malloc 0.21% /mnt/btr/public libc-2.28.so [.] __memchr_sse2 0.21% /mnt/btr/public perl [.] Perl_pad_alloc 0.21% perl perl [.] Perl_Slab_Alloc 0.20% script/public-i libxapian.so.30.8.0 [.] OrPostList::get_termfreq_est 0.19% /mnt/btr/public perl [.] Perl_op_lvalue_flags 0.19% script/public-i libc-2.28.so [.] cfree at GLIBC_2.2.5 0.19% /mnt/btr/public libc-2.28.so [.] __libc_calloc 0.19% /mnt/btr/public perl [.] Perl_pad_leavemy 0...
2020 Aug 21
2
MultiDatabase shard count limitations
Going back to the "prioritizing aggregated DBs" thread from February 2020, I've got 390 Xapian shards for 130 public inboxes I want to search against(*). There's more on the horizon (we're expecting tens of thousands of public inboxes). After bumping RLIMIT_NOFILE and running ->add_database a bunch, the actual queries seem to be taking ~30s (not good :x). Now I'm