search for: duraj

Displaying 20 results from an estimated 44 matches for "duraj".

Did you mean: juraj
2007 Oct 01
3
How to beat Google aka Xapian & Natural Language Processing.
...tors and we could add one more OP_NLP. What we can do now is to implement OP_NLP to tagged nouns, adjectives, adverbs, punctuations, foreign words etc. Calculate relation between them and assign boost value to the most occurred terms in query for example noun. Search query example: What is Kevin Duraj doing? OP_NLP would analyze sentence as follow: [what = pronoun, question|is = werb|kevin=noun|duraj=noun|doing=verb|?=punctuation] We have nouns dominating the question. Therefore in Xapian search engine we look first for dominating nouns in this case my name Kevin Duraj and then within the r...
2007 Jul 17
1
BUG IN XAPIAN_FLUSH_THRESHOLD
...ngth = 13.5549 number of documents = 8580000 average document length = 13.5563 number of documents = 8590000 average document length = 13.5568 PS: Please do not ask me create smaller index and then merge them. I am indexing 500 million documents. 20 million is my small index. -- Cheers, Kevin Duraj
2016 Jul 12
3
Xapian 1.4.0 released
On Mon, Jul 11, 2016 at 02:02:56PM -0700, Kevin Duraj wrote: > You are saying that when I search for "delve Xapian 1.4" on Google, a > company worth of 491 Billion of Dollars and you saying that their top > of the search result has nothing to do with Xapian. > > https://www.google.com/search?q=xapian+delve&ie=utf-8&...
2007 Jun 05
7
Chinese, Japanese, Korean Tokenizer.
...difficult to tokenize into searchable terms. Lucene has CJK Tokenizer ... and I am looking around if there is some open source that we could use with Xapian. http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/org/apache/lucene/analysis/cjk/package-summary.html Cheers -Kevin Duraj
2010 Aug 23
2
NetBeans and Java Bindings
Hello, I was wondering if anyone has succeeded in getting the Java bindings to work with NetBeans, in order to make use of NetBeans's GUI developer. I've had no luck so far, does anyone know how to do that? Many thanks.
2009 Sep 30
2
C++ parser for doc.get_data() result.
...and retrieve it as: doc.get_value(1), but I think that this has a overhead and would show to be slower when indexing and retrieving millions of documents. Please correct me if I am wrong, thank you. PS: I don't have any friends so I started http://find1friend.com/ *lol* runs on Xapian! Kevin Duraj
2007 Jul 09
7
Xapian pubmeet
Hi all, A few of us have been discussing whether we should have a Xapian social gathering of some kind. The current idea is meeting up in a pub in London some time in autumn for drinks and food. However all of this really depends on who might be able to come! It would be a chance to meet other Xapian enthusiasts in an informal social setting and talk about all things search-related (and
2012 Nov 14
4
xapian-replicate errors
Hi, While trying to setup xapian replication (initially for backup purposes), I'm encountering some errors. Our "fresh" index starts replication, and ends up with an index size that matches the replication master (4.5GB), but then throws : "Getting update for fresh from fresh xapian-replicate: NetworkError: Unable to fully synchronise: Database changing too fast" I
2007 Oct 11
2
Xapian 1.0.3 installation issues.
...stalled Xapian 1.0.3 and the search would not execute when run as Apache user. I could run the search fine inside ssh. I rolled Xapian to previous version 1.0.2 and the search still does not work even when I put back the old index made by Xapian 1.0.2 ... my search engine is out of work ... Kevin Duraj http://myhealthcare.com
2007 Feb 07
2
My new record: Indexing 20 millions docs = 79m9.378s
...e /home/kevin/index --------------------------------------- number of documents = 19999995 average document length = 8.18631 PS: In my scenario after 25 million records the indexing significantly slows down (2x-4x) I do not know why? Could it be because of the B-Tree become very complex? - Kevin Duraj
2016 Jul 10
3
Xapian 1.4.0 released
On Fri, Jul 08, 2016 at 06:42:23PM -0700, Kevin Duraj wrote: > The issue is that delve was renamed to xapian-delve but documentation > is still saying that delve is delve. Who has access to update the > documentation? > > http://www.linuxfromscratch.org/blfs/view/svn/general/xapian.html That website has nothing to do with Xapian, so...
2010 Dec 18
1
Xapian index size 475GB = 170 million documents (URLs)
...in kevin 1.2M 2010-12-18 12:19 termlist.baseB -rw-r--r-- 1 kevin kevin 76G 2010-12-18 12:18 termlist.DB $ delve . number of documents = 169346678 average document length = 230970 document length lower bound = 1 document length upper bound = 3585385 highest document id ever used = 169346678 Kevin Duraj http://pacificair.com/
2016 Jul 06
2
Xapian 1.4.0 released
I have installed the new Xapian 1.4.0 , during the installation, I haven't seen any problems, however, when I execute commands quest and delve I get different versions, and my Perl-based searches return Exception: Couldn't detect type of database ... and what are these glass things in the index directories? There is a no new version of Perl Search::Xapian. $ quest -version quest -
2016 Jul 24
3
Xapian 1.4.0 released
On Fri, Jul 22, 2016 at 07:19:43PM -0700, Kevin Duraj wrote: > I would like to propose to change the following code while indexing a > term that is larger than 245 characters and then crashing and aborting > the entire index, we could rather truncate the term to 245 characters > and continue with indexing. Kevin -- I wonder what others a...
2007 Jun 17
2
Flint failed to deliver indexing performance to Quartz.
...00000 in less then 16 hours. Please provide settings to remove Flint and add Quartz as default database. Unless the unacceptable indexing performance using Flint database will be resolved. Do not even think about to removing support for Quartz database from Xapian. Thank you, -- Cheers, Kevin Duraj
2007 Feb 02
1
Working demo of search engine using boolean query.
...an queries for search engine but I haven't seen any complete working demo. Therefore I put together very simple working demo of search engine using boolean query. Feel free to suggest any performance improvement or error while keeping it as simple as possible for understanding. Thanks, -Kevin Duraj http://myhealthcare.com #------------------------------------------------------------------------------# # Sample Data # #------------------------------------------------------------------------------# url=webmd.com text=fitness health cancer url=health.com text=diseases health calorie disability...
2007 Oct 16
1
Xapian 1.0.3_svn9466 - OK!
...roximately 21 hours on Intel 8 core CPU with 12 GB memory XAPIAN_FLUSH_THRESHOLD=1000000 number of documents = 52746432 average document length = 89.6394 You can visit and test Xapian 1.0.3_svn9466 search engine with 52 million of indexed web sites on http://pacific-design.com -- Cheers Kevin Duraj http://pacific-design.com Los Angeles, California
2016 Jul 09
0
Xapian 1.4.0 released
...talled Programs: copydatabase, delve, quest, simpleexpand, simpleindex, simplesearch, xapian-check, xapian-chert-update, xapian-compact, xapian-config, xapian-inspect, xapian-metadata, xapian-progsrv, xapian-replicate, xapian-replicate-server and xapian-tcpsrv On Wed, Jul 6, 2016 at 1:15 PM, Kevin Duraj <kevin.duraj at zefr.com> wrote: > I have installed the new Xapian 1.4.0 , during the installation, I > haven't seen any problems, however, when I execute commands quest and > delve I get different versions, and my Perl-based searches return > Exception: Couldn't detect t...
2011 Apr 01
0
Xapian-discuss Digest, Vol 83, Issue 1
...r bit (mostly on www.reportbuyer.com) together with a new wrapper for our CMS and have a bit of spare time. I'd be happy to write up examples of how to use some of the bindings, particularly PHP as that's my area. > Message: 1 > Date: Thu, 31 Mar 2011 11:55:32 -0700 > From: Kevin Duraj <kevinduraj at gmail.com> > Subject: [Xapian-discuss] Xapian Index: 607GB = 219 million of unique > documents > To: xapian-discuss at lists.xapian.org > Message-ID: > <AANLkTiku6tA06=s9hmX7nTcBHWSDfxdDgnHJuLUKhRBN at mail.gmail.com> > Content-Type: text/plain; charse...
2008 Jul 16
3
Xapian 1.0.7 released
I've uploaded Xapian 1.0.7, which as usual you can download from: http://xapian.org/download This release fixes an assortment of bugs, and improves efficiency in a few cases. It's intended to be a relatively safe incremental update over 1.0.6. For a more detailed overview see: http://trac.xapian.org/wiki/ReleaseOverview/1.0.7 The full lists of user-visible changes are linked to from