Displaying 20 results from an estimated 44 matches for "duraj".
Did you mean:
juraj
2007 Oct 01
3
How to beat Google aka Xapian & Natural Language Processing.
...tors and we could add one more
OP_NLP.
What we can do now is to implement OP_NLP to tagged nouns,
adjectives, adverbs, punctuations, foreign words etc. Calculate
relation between them and assign boost value to the most occurred
terms in query for example noun.
Search query example: What is Kevin Duraj doing?
OP_NLP would analyze sentence as follow:
[what = pronoun, question|is =
werb|kevin=noun|duraj=noun|doing=verb|?=punctuation]
We have nouns dominating the question. Therefore in Xapian search
engine we look first for dominating nouns in this case my name Kevin
Duraj and then within the r...
2007 Jul 17
1
BUG IN XAPIAN_FLUSH_THRESHOLD
...ngth = 13.5549
number of documents = 8580000
average document length = 13.5563
number of documents = 8590000
average document length = 13.5568
PS: Please do not ask me create smaller index and then merge them. I
am indexing 500 million documents. 20 million is my small index.
--
Cheers,
Kevin Duraj
2016 Jul 12
3
Xapian 1.4.0 released
On Mon, Jul 11, 2016 at 02:02:56PM -0700, Kevin Duraj wrote:
> You are saying that when I search for "delve Xapian 1.4" on Google, a
> company worth of 491 Billion of Dollars and you saying that their top
> of the search result has nothing to do with Xapian.
>
> https://www.google.com/search?q=xapian+delve&ie=utf-8&...
2007 Jun 05
7
Chinese, Japanese, Korean Tokenizer.
...difficult to tokenize
into searchable terms.
Lucene has CJK Tokenizer ... and I am looking around if there is some
open source that we could use with Xapian.
http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/org/apache/lucene/analysis/cjk/package-summary.html
Cheers
-Kevin Duraj
2010 Aug 23
2
NetBeans and Java Bindings
Hello,
I was wondering if anyone has succeeded in getting the Java bindings to work
with NetBeans, in order to make use of NetBeans's GUI developer. I've had no
luck so far, does anyone know how to do that?
Many thanks.
2009 Sep 30
2
C++ parser for doc.get_data() result.
...and
retrieve it as: doc.get_value(1), but I think that this has a overhead
and would show to be slower when indexing and retrieving millions of
documents. Please correct me if I am wrong, thank you.
PS: I don't have any friends so I started http://find1friend.com/
*lol* runs on Xapian!
Kevin Duraj
2007 Jul 09
7
Xapian pubmeet
Hi all,
A few of us have been discussing whether we should have a Xapian social
gathering of some kind. The current idea is meeting up in a pub in
London some time in autumn for drinks and food. However all of this
really depends on who might be able to come! It would be a chance to
meet other Xapian enthusiasts in an informal social setting and talk
about all things search-related (and
2012 Nov 14
4
xapian-replicate errors
Hi,
While trying to setup xapian replication (initially for backup
purposes), I'm encountering some errors.
Our "fresh" index starts replication, and ends up with an index size
that matches the replication master (4.5GB), but then throws :
"Getting update for fresh from fresh
xapian-replicate: NetworkError: Unable to fully synchronise: Database
changing too fast"
I
2007 Oct 11
2
Xapian 1.0.3 installation issues.
...stalled Xapian 1.0.3 and the search would not execute when run as
Apache user. I could run the search fine inside ssh. I rolled Xapian
to previous version 1.0.2 and the search still does not work even when
I put back the old index made by Xapian 1.0.2
... my search engine is out of work ...
Kevin Duraj
http://myhealthcare.com
2007 Feb 07
2
My new record: Indexing 20 millions docs = 79m9.378s
...e /home/kevin/index
---------------------------------------
number of documents = 19999995
average document length = 8.18631
PS: In my scenario after 25 million records the indexing significantly slows
down (2x-4x)
I do not know why? Could it be because of the B-Tree become very complex?
- Kevin Duraj
2016 Jul 10
3
Xapian 1.4.0 released
On Fri, Jul 08, 2016 at 06:42:23PM -0700, Kevin Duraj wrote:
> The issue is that delve was renamed to xapian-delve but documentation
> is still saying that delve is delve. Who has access to update the
> documentation?
>
> http://www.linuxfromscratch.org/blfs/view/svn/general/xapian.html
That website has nothing to do with Xapian, so...
2010 Dec 18
1
Xapian index size 475GB = 170 million documents (URLs)
...in kevin 1.2M 2010-12-18 12:19 termlist.baseB
-rw-r--r-- 1 kevin kevin 76G 2010-12-18 12:18 termlist.DB
$ delve .
number of documents = 169346678
average document length = 230970
document length lower bound = 1
document length upper bound = 3585385
highest document id ever used = 169346678
Kevin Duraj
http://pacificair.com/
2016 Jul 06
2
Xapian 1.4.0 released
I have installed the new Xapian 1.4.0 , during the installation, I
haven't seen any problems, however, when I execute commands quest and
delve I get different versions, and my Perl-based searches return
Exception: Couldn't detect type of database ... and what are these
glass things in the index directories? There is a no new version of
Perl Search::Xapian.
$ quest -version
quest -
2016 Jul 24
3
Xapian 1.4.0 released
On Fri, Jul 22, 2016 at 07:19:43PM -0700, Kevin Duraj wrote:
> I would like to propose to change the following code while indexing a
> term that is larger than 245 characters and then crashing and aborting
> the entire index, we could rather truncate the term to 245 characters
> and continue with indexing.
Kevin -- I wonder what others a...
2007 Jun 17
2
Flint failed to deliver indexing performance to Quartz.
...00000 in less then 16 hours.
Please provide settings to remove Flint and add Quartz as default
database. Unless the unacceptable indexing performance using Flint
database will be resolved.
Do not even think about to removing support for Quartz database from Xapian.
Thank you,
--
Cheers,
Kevin Duraj
2007 Feb 02
1
Working demo of search engine using boolean query.
...an queries for search
engine but I haven't seen any complete working demo. Therefore I put
together very simple working demo of search engine using boolean query. Feel
free to suggest any performance improvement or error while keeping it as
simple as possible for understanding.
Thanks,
-Kevin Duraj
http://myhealthcare.com
#------------------------------------------------------------------------------#
# Sample Data #
#------------------------------------------------------------------------------#
url=webmd.com
text=fitness health cancer
url=health.com
text=diseases health calorie disability...
2007 Oct 16
1
Xapian 1.0.3_svn9466 - OK!
...roximately
21 hours on Intel 8 core CPU with 12 GB memory
XAPIAN_FLUSH_THRESHOLD=1000000
number of documents = 52746432
average document length = 89.6394
You can visit and test Xapian 1.0.3_svn9466 search engine with 52
million of indexed web sites on http://pacific-design.com
--
Cheers
Kevin Duraj
http://pacific-design.com
Los Angeles, California
2016 Jul 09
0
Xapian 1.4.0 released
...talled Programs: copydatabase, delve, quest, simpleexpand,
simpleindex, simplesearch, xapian-check, xapian-chert-update,
xapian-compact, xapian-config, xapian-inspect, xapian-metadata,
xapian-progsrv, xapian-replicate, xapian-replicate-server and
xapian-tcpsrv
On Wed, Jul 6, 2016 at 1:15 PM, Kevin Duraj <kevin.duraj at zefr.com> wrote:
> I have installed the new Xapian 1.4.0 , during the installation, I
> haven't seen any problems, however, when I execute commands quest and
> delve I get different versions, and my Perl-based searches return
> Exception: Couldn't detect t...
2011 Apr 01
0
Xapian-discuss Digest, Vol 83, Issue 1
...r bit (mostly on
www.reportbuyer.com) together with a new wrapper for our CMS and have a
bit of spare time. I'd be happy to write up examples of how to use some
of the bindings, particularly PHP as that's my area.
> Message: 1
> Date: Thu, 31 Mar 2011 11:55:32 -0700
> From: Kevin Duraj <kevinduraj at gmail.com>
> Subject: [Xapian-discuss] Xapian Index: 607GB = 219 million of unique
> documents
> To: xapian-discuss at lists.xapian.org
> Message-ID:
> <AANLkTiku6tA06=s9hmX7nTcBHWSDfxdDgnHJuLUKhRBN at mail.gmail.com>
> Content-Type: text/plain; charse...
2008 Jul 16
3
Xapian 1.0.7 released
I've uploaded Xapian 1.0.7, which as usual you can download from:
http://xapian.org/download
This release fixes an assortment of bugs, and improves efficiency in a few
cases. It's intended to be a relatively safe incremental update over 1.0.6.
For a more detailed overview see:
http://trac.xapian.org/wiki/ReleaseOverview/1.0.7
The full lists of user-visible changes are linked to from