similar to: Positive experiences with Xapian

Displaying 20 results from an estimated 10000 matches similar to: "Positive experiences with Xapian"

2005 Nov 19
14
FULLTEXT search in MySQL on rails
I tried adding a FULLTEXT search index to a table of mine in MySQL, only to discover that the InnoDB table format doesn''t seem to support this feature. Switching to the MyISAM table type seemed to work, but I seem to have some recollection that ActiveRecord transactions aren''t fully atomic on MyISAM tables. Is this true or am I just remembering wrong? If so, does anyone have
2019 Jul 04
2
solr vs fts
>> A few clients have 25K and more e-mail >> >> I thinking about use solr like: >> ?fts = solr >> ?fts_solr = debug url=http://IP:8983/solr/ (solr in external machine) >> >> Does it make sense ? use dovecot_indexes and fts ? >> What is the difference in performance? >> > Hi! > > Dovecot indexes are not actually related to FTS that
2008 Sep 03
4
delta index in Sphinx
Hello, all! Help me please to solve problem with Sphinx and its delta index. Configuration file is located in attachment to this topic. ------------------------------------------------------------------- mysql> select id, e_mail from users where e_mail LIKE ''%test%''; -------------------------------------------------------------------
2006 Feb 07
15
So, this search thing...
I am using ferret right now, and it works great for all my regular text documents/information. My problem arises when I want to index/search all of our assets (mostly pdf files). Currently, there is no way to READ pdfs from Ruby. Because of this I have to resort to using Java to read the PDF''s and then Lucene to index them. My problem here is a couple things. One, to index a asset I have
2008 Dec 18
5
distributing a rails application
What are the standard best practices for sharing your rails application on the web? As far as I can tell, you just delete the logs directory and clear out the database.yml What am I forgetting? -- Posted via http://www.ruby-forum.com/. --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk"
2020 May 19
5
FTS-lucene errors : language not available for stemming
I'm getting some log errors with clucene that I am having no luck tracking down on the interwebs. Errors: May 19 05:05:16 indexer-worker(gessel at blackrosetech.com)<62971><aPAEI3zLw17A/QAA0J78UA:EF25M3zLw1779QAA0J78UA>: Error: lucene index /mail/blackrosetech.com/gessel//lucene-indexes: IndexWriter::addDocument() failed (#4): language not available for stemming May 19 05:05:16
2015 Oct 27
2
Dovecot frequently full-text reindexes the whole maildir
Hi! I'm using Dovecot 2.2.13 (Debian Jessie package 1:2.2.13-12~deb8u1) on my personal mail server (the address I'm writing from is on this server). I use Maildirs, I have fts + fts_squat enabled, and I have a problem with it for a long time - dovecot seems to not update the index always "incrementally". Yesterday I've finally made a test by telneting to imap port and
2009 Mar 13
6
Using LIKE
I''ve heard that using LIKE is very slow, but I see it being used a lot in examples, blogs etc. Is it really that bad? Since Rails doesn''t directly support Fulltext search, this is the easiest way to get searching done, right? Or are there any other easier ways? I''m using acts_as_indexed right now, but it still doesn''t do what LIKE does.
2007 Nov 16
18
Multithreading / multiprocessing woes
I''ve been running some multithreaded tests on Ferret. Using a single Ferret::Index::Index inside a DRb server, it definitely behaves for me as if all readers are locked out of the index when writing is going on in that index, not just optimization -- at least when segment merging happens, which is when the writes take the longest and you can therefore least afford to lock out all reads.
2009 Apr 12
2
Indexing speed benchmark - Xapian, Solr
I came across this benchmark between Xapian & Solr: http://www.anur.ag/blog/2009/03/xapian-and-solr/ According to the benchmark, a doc set that took Solr 34 min to index took Xapian 7 hours. Solr's index is also much smaller - 2.5GB to Xapian's 8.9GB. I'm new to Xapian. Just wondering if results like these are typical? Is indexing speed & size a known issue in Xapian? Or is
2019 Jan 12
2
Solr -> Xapian ?
THank you Now, for the results I see the member of fts_result is : ARRAY_TYPE(seq_range) definite_uids; I have the UID as a aray of uint32_t * How to put my UIDs into this "definite_uids" ? Obviously this is not a simple array/pointer. How to say someting similar to result->definite_uids[1]=my_uid ? On 2019-01-12 10:25, Timo Sirainen wrote: > On 11 Jan 2019, at 21.23,
2011 Oct 24
1
reindexing
Hello I just jumped into xapian, so my question could be quite basic, but I haven't found anything really useful in the list. I'm using xapian to make a fulltext search on our local servers, including some pdf's and doc files, but also our own filetypes. Anyway, I managed to write an indexer in php and it works fine. My problem is however, that I have to reindex everything when just
2020 Jun 23
2
dovecot fts configuration without solr
Hello All, I always used dovecot and I'm pretty happy with it, but recently I changed OS (to ubuntu 18) and to get dovecot LTS with lucene (the default I was using on my previous arch installation) I needed to build from source the version 2.3.10 (the one that came with the repos was older and clucene was not actually supported) The problem is that the Clucene is not behaving well, It
2006 Jun 01
8
Windows progress
Hi there, What''s the current status of the Windows port? I may be in a position to lend a hand over the next couple of weeks - where should I start looking? And what''s the best way to get SVN HEAD? This happens: $ svn checkout svn://www.davebalmain.com/ferret/trunk ferret svn: Can''t connect to host ''www.davebalmain.com'': Connection refused --
2019 Jan 12
2
Solr -> Xapian ?
I somehow fixed the folder issue. (seems some unix rights after too many tests) Getting back on the "fts_results" structure: I am trying: I_ARRAY_INIT(&(RESULT->DEFINITE_UIDS),R->SIZE); I_ARRAY_INIT(&(RESULT->MAYBE_UIDS),0); uint32_t uid; for(i=0;i<r->size;i++) { try {
2020 Feb 04
5
Strategy for fts and Replication
Hi Philon, Thanks a lot for your thoughts! Can I ask you if using Solr improved things for you? I have a mailbox with 15 years of e-mail and searching things take a long time. On 04.02.2020 09:39, Philon wrote: > Hi Francis, > > next to fts-solr there was fts-lucene. But that Lucene there seems > heavily outdated why the Dovecot docs also suggest using Solr. > Elasticsearch
2009 Apr 30
2
how to search all fields for a string in SQL ?
Using SQL or activerecord I am not sure how to search all fields for a specific value ? I know how to do this by just writing ruby code. To do it in SQL or an activerecord find w/condition, I don''t know how. I actually am supposed to combine an find w/ condition and a search all fields for a value and AND those together. I think perhaps it can''t be done with a single find ?
2012 Jun 16
3
question about fts_squat
just installed dovecot with fts_squat config attached after message. When i telnet to imap server and execute by hand 1 login user password select foldername search body "someword" it works fine, and at blazing speed except first run (indexing). i already indexed everything by doveadm index offline to prevent server overload if multiple users (after i tell them) will try fulltext
2007 Nov 15
8
Ferret/AAF Stability?
Hello. I''m the author of DataMapper (http://datamapper.org), and am trying to choose what Full-Text-Indexing engine/plugin I want to include by default. I was hoping you guys could help. :-) Sphinx comes highly recommended, but without live index updates, it just doesn''t seem practical for most of my work. I''m most experienced with Solr, but the whole HTTP::Request and
2013 Jun 17
2
Backend for Lucene format indexes-How to get doclength
*Or do you mean that it's one number per document whereas the other stats are per database, so it's harder to store it?* yes, I mean this. It's a huge data. If a new doclength list(contains all the doclength in a list, like chert) is added by myself, I am concern about: 1. This doclength list may be the bottlenect in this backend, http://trac.xapian.org/ticket/326 2. Change too much