Displaying 20 results from an estimated 7000 matches similar to: "Xapian 1.0.7 released"
2010 Dec 18
1
Xapian index size 475GB = 170 million documents (URLs)
Xapians,
I am maintaining about two indexes for my search engines which
approximately is each the same size. I would like to share this
knowledge with you, since many of you have never seen Xapian index of
this size. And of course you can search the index by yourself at
- http://myhealthcare.com/
- http://find1friend.com/
I need 2 x 100 million more documents into each index, and I hope it
will
2007 Jul 17
1
BUG IN XAPIAN_FLUSH_THRESHOLD
There is is bug when setting XAPIAN_FLUSH_THRESHOLD=20000000
When trying for force Xapian flush documents to flush after 20 million
documents Xapian ignores the size and flush it after only 10,000
documents.
Data captured from delve after 60 seconds interval when has been set as follow:
XAPIAN_FLUSH_THRESHOLD=20000000
perl -e ' while(1) { system("delve ."); sleep(60); } '
2007 Jul 09
7
Xapian pubmeet
Hi all,
A few of us have been discussing whether we should have a Xapian social
gathering of some kind. The current idea is meeting up in a pub in
London some time in autumn for drinks and food. However all of this
really depends on who might be able to come! It would be a chance to
meet other Xapian enthusiasts in an informal social setting and talk
about all things search-related (and
2012 Nov 14
4
xapian-replicate errors
Hi,
While trying to setup xapian replication (initially for backup
purposes), I'm encountering some errors.
Our "fresh" index starts replication, and ends up with an index size
that matches the replication master (4.5GB), but then throws :
"Getting update for fresh from fresh
xapian-replicate: NetworkError: Unable to fully synchronise: Database
changing too fast"
I
2011 May 16
2
Xapian.org design
Hi guys
We were talking a little while ago about putting together some content
to provide more examples of operations for first-time users.
One of the other thoughts that's come up is about the design of the
Xapian site; we've a bit of spare designer capacity at work and were
wondering if we could do you a jazz-ed up site design (for free) and
wondered if this'd be something the
2007 Jun 05
7
Chinese, Japanese, Korean Tokenizer.
Hi,
I am looking for Chinese Japanese and Korean tokenizer that could can
be use to tokenize terms for CJK languages. I am not very familiar
with these languages however I think that these languages contains one
or more words in one symbol which it make more difficult to tokenize
into searchable terms.
Lucene has CJK Tokenizer ... and I am looking around if there is some
open source that we
2007 Sep 20
3
Incorrect get_matches_estimated() of Xapian::Mset
Hello, As I know, get_matches_estimated() return an estimate for the number of documents with matches the query.
But now, I found it get a disparity between the return value and real mathced number. For an example: the real matched number is 58, but the return value is 458; so when the users click the hinder page, get a blank page ... so they often complain to me.
I found that the main reason is
2007 Oct 01
3
How to beat Google aka Xapian & Natural Language Processing.
Xapians!
If tomorrow Xapian search engine would achieved the same performance
and result in searches as Google we would not be able to beat Google,
because we would create only a copy of the searches that already
exists from Google search engine. However there is a way to beat
anyone, and there is a way to beat Google successfully as well just do
not give up. Some see it as implementing Ajax, or
2010 Aug 23
2
NetBeans and Java Bindings
Hello,
I was wondering if anyone has succeeded in getting the Java bindings to work
with NetBeans, in order to make use of NetBeans's GUI developer. I've had no
luck so far, does anyone know how to do that?
Many thanks.
2007 Feb 07
2
My new record: Indexing 20 millions docs = 79m9.378s
Gentoo Linux 2.6
8 AMD Opteron 64-bit Processors
32GB Memory
--------------------------------------------------------------------------------
Environment:
------------------
XAPIAN_FLUSH_THRESHOLD=21000000
XAPIAN_FLUSH_THRESHOLD_LENGTH=16000000
XAPIAN_PREFER_FLINT=True
Indexing 20 million documents:
--stemmer=none
-------------------------------------------
real 79m9.378s
user 77m28.696s
2018 Mar 20
2
how to build 64bit xapian using MSVC2017?
On Tue, Mar 20, 2018 at 06:30:07PM +0000, Olly Betts wrote:
> https://lists.xapian.org/pipermail/xapian-discuss/2018-January/009585.html
Related to this, the appveyor build is currently failing on git master.
Unfortunately the change at which is started to fail was the addition
of the new "honey" backend, which doesn't narrow things down to a
useful degree.
I've checked over
2016 Jul 06
2
Xapian 1.4.0 released
I have installed the new Xapian 1.4.0 , during the installation, I
haven't seen any problems, however, when I execute commands quest and
delve I get different versions, and my Perl-based searches return
Exception: Couldn't detect type of database ... and what are these
glass things in the index directories? There is a no new version of
Perl Search::Xapian.
$ quest -version
quest -
2012 Dec 08
2
Want to contribute code to the Xapian project
Hey guys,I am a 3rd year Computer Science undergrad student.I a extremely
interested in contributing code to the XAPIAN project. The work you people
do sounds extremely fascinating and interesting.Can someone just give me a
brief overview of how to proceed ?. I Can code in C,C++ and Python and
have experience in Natural Lanuage Processing.Am also quite comfortable
with NLTK and using Wordnet.Am
2011 May 12
2
Xapian support for huge data sets?
Hello,
I?m currently using another open source search engine/indexer and am
having performance issues, which brought me to learn about Xapian. We
have approximately 350 million docs/10TB data that doubles every 3
years. The data mostly consists of Oracle DB records, webpage-ish
files (HTML/XML, etc.) and office-type docs (doc, pdf, etc.). There
are anywhere from 2 to 4 dozen users on the
2007 Sep 24
5
Deprecation policy question
Olly and I have been discussing details of our policy for deprecating
features, and we thought it would be useful to canvas users opinions on
a particular aspect.
From time to time, we remove features from the library (usually because
we have a better replacement). To make life easier for users, we've
been following a deprecation policy, so that users are given plenty of
warning of
2016 Jul 12
3
Xapian 1.4.0 released
On Mon, Jul 11, 2016 at 02:02:56PM -0700, Kevin Duraj wrote:
> You are saying that when I search for "delve Xapian 1.4" on Google, a
> company worth of 491 Billion of Dollars and you saying that their top
> of the search result has nothing to do with Xapian.
>
> https://www.google.com/search?q=xapian+delve&ie=utf-8&oe=utf-8#q=delve+xapian+1.4
Well, I'm not
2018 Jul 10
2
Xapian 1.4.5 "Db block overwritten - are there multiple writers?" with Glass
On Mon, Jul 09, 2018 at 10:29:18AM +0100, Olly Betts wrote:
> The attached patch reset this cursor each time commit() is called, and
> that fixes my C++ reproducer, though I think this ought to work as-is
> and the real bug is at a lower level.
I've dug deeper and that was indeed the case. Here's a patch which
addresses the root cause:
2010 Sep 01
8
FIXMEs in Search::Xapian
Carrying on this conversation:
http://lists.tartarus.org/pipermail/xapian-discuss/2007-March/003513.html
void
TermGenerator::set_stopper(stopper)
Stopper * stopper
CODE:
// FIXME: no corresponding SvREFCNT_dec(), but a leak seems better
than
// a SEGV!
SvREFCNT_inc(ST(1));
THIS->set_stopper(stopper);
It would be good to fix these FIXMEs.
A class-level HASH could be
2018 Apr 03
3
how to build 64bit xapian using MSVC2017?
hi, Olly Betts,
Thank you for you answer.
My xapian version is 1.2.8.
Need I upgrade it to 1.4 if I want to build it with MSVC2017 x64?
lishu at fiberhome.com
From: Olly Betts
Date: 2018-03-29 11:10
To: 李澍; xapian-discuss
Subject: Re: how to build 64bit xapian using MSVC2017?
On Tue, Mar 20, 2018 at 08:28:24PM +0000, Olly Betts wrote:
> Related to this, the appveyor build is
2007 Jun 17
2
Flint failed to deliver indexing performance to Quartz.
Flint failed to deliver indexing performance to Quartz.
I am proposing to remove Flint as default database and place Quartz
database back as default. The catch is not that Flint database is
smaller and faster during searches then Quartz database as developers
were concerning when were measuring and neglecting to measure
performance when creating the large indexes.
The truth is that Flint