Displaying 20 results from an estimated 10000 matches similar to: "distinct search"
2006 Jan 23
7
Search functionality and CMS
Hello,
I am planning to build a bigger Internet platform and actually evaluating
Java EE and Rails. I have a lot of Java experience and I am quite new to
Rails. After playing some weeks with Rails I am sure that it is a mature web
framework and I really like the productivtity of Rails. One of the key
advantages is that new developers will understand this platform much quicker
than all the Java
2013 Jun 17
2
Backend for Lucene format indexes-How to get doclength
*Or do you mean that it's one number per document whereas the other stats
are per database, so it's harder to store it?*
yes, I mean this. It's a huge data. If a new doclength list(contains all
the doclength in a list, like chert)
is added by myself, I am concern about:
1. This doclength list may be the bottlenect in this backend,
http://trac.xapian.org/ticket/326
2. Change too much
2013 Jun 16
3
Backend for Lucene format indexes-How to get doclength
Hi, all:
I have wrote a demo patch for Backend for Lucene format indexes, Lucene
version is 3.6.2.
http://lucene.apache.org/core/3_6_2/fileformats.html
Now, this demo patch just support the basic features in Lucene. Compound
File(.cfs/.cfe)?term vector(.tvx/.tvd/.tvf)
delete document(.del) are not supported, skip list in .fdx is not supported
too
example/quest.cc is used to test this demo.
2013 Oct 30
2
Lucene 3.6.2 backend for xapian (#25)
[Replying to xapian-devel, as I think a wider audience would be useful]
On Mon, Oct 21, 2013 at 11:24:51PM +0800, jiangwen jiang wrote:
> yes, it's less efficient. Lucene database has multiple segments, each
> segment can treat as a independent database. The same term may exists in >=
> 1 segments.
Sorry for taking a while to respond - I've been both busy and mulling
this
2009 Apr 15
2
does xapian have these disadvantages?
hi! alls
i have read an article about comments on Lucene.
http://www.jroller.com/melix/entry/why_lucene_isn_t_that
i have more understanding about Lucene through this article,especially its disadvantage or limitation.
then i would like to question that are there similar disadvantages to xapian?
any advice would be appreciated.
baijl
2009 Apr 15
2
does xapian have these disadvantages?
hi! alls
i have read an article about comments on Lucene.
http://www.jroller.com/melix/entry/why_lucene_isn_t_that
i have more understanding about Lucene through this article,especially its disadvantage or limitation.
then i would like to question that are there similar disadvantages to xapian?
any advice would be appreciated.
baijl
2013 Aug 25
2
Backend for Lucene format indexes-How to get doclength
On Tue, Aug 20, 2013 at 07:28:42PM +0800, jiangwen jiang wrote:
> I think norm(t, d) in Lucene can used to caculate the number which is
> similar to doc length(see norm(t,d) in
> http://lucene.apache.org/core/3_5_0/api/all/org/apache/lucene/search/Similarity.html#formula_norm).
It sounds similar (especially if document and field boosts aren't in use),
though some places may rely on
2005 Dec 14
4
Fuzzy search on a phrase
I''m trying to use Ferret to do fuzzy searches. If I use fuzzy search for
just one word, it works fine:
index.search(''name:gogle~0.4'')
However, if I try to use a phrase, it doesn''t work:
index.search(''name:"gogle search engine"~0.4'')
On the other hand, I could do:
index.search(''name:gogle~0.4 AND
2018 May 09
2
Search problem
Hello,
when doing a simple search through the lucene indexes in some mailboxes
(with actually many subfolders) the search terminates after a few
seconds with no result and the dovecot log shows this error:
Fatal: master: service(imap): child 15433 killed with signal 6 (core not
dumped)
Any clue about the reasons? the lucene indexes have been already rebuilt
and look OK.
Regards
-federico
2005 Dec 14
4
Is it possible to highlight search keywords in results?
I''m wondering if ferret has any built-in search/replace mechanism that
I might be able to use to highlight the query data in each search
result. The reason I think this would be a good idea is that I could
end up having to practically duplicate the ferret query parser just to
interpret the query so that I can figure out how to highlight the
keywords in the search results. Just in case
2011 Feb 14
1
Idea: Backend for Lucene format indexes
Hi,
I'm interested to implement the idea of using search indexes build by
Lucene. I have some idea of how to do that. I'm currently studying the index
format of both. I have written an application of search using Lucene and now
I'll do the same using Xapian. then I'll check the details of those index
format and then find how to convert Lucene index to Xapian one. Is there
somebody
2006 Feb 22
2
Chinese search support
I need decide on if our site will go with Java or Ruby on Rails. The
major factor is that does Farret support Lucene''s ChineseAnalyzer or
CJKAnalyzer or not.
Can anyboby shine some lights on Farret''s Chinese search support?
Really appreciate.
--
Posted via http://www.ruby-forum.com/.
2006 Dec 10
1
Full text search indexing
There are now two full text search backends in CVS HEAD:
- squat: My own full text search index based on ideas from Cyrus Squat
indexes. Supports substring searches.
- lucene: Uses CLucene library.
It should be pretty easy to add support for more backends.
As I've explained a couple of times already, IMAP RFC says that searches
are done by matching substrings, so the Lucene backend
2003 Aug 29
2
better body and text search?
(this is a resend, sorry if duplicate but my first post didn't get through).
I was wondering what plans dovecot has for text and body search?
This is one of those issues that *no* imap implementation ever
seems to document :(.
Not cyrus, courier, bincimap, or dovecot....
i speed-read some of the sources and found src/lib-mail/message-body-search.c
After a quick scan, it seems:
- it uses no
2004 Oct 28
1
Lucene ranking
Kevin Burton has posted about poor ranking in Lucene preferring
shorter documents over longer ones[1]. A similar search in Xapian
returns documents in the expected order:
Performing query `Xapian::Query(foo)'
3 results found
ID 3 99% [foo foo foo]
ID 2 94% [foo foo]
ID 1 80% [foo]
Anyone know what Lucene is doing here? Their FAQ doesn't mention what
weighting scheme they use, and I
2023 Mar 28
1
uncaught C++ exception from Perl Search::Xapian XS?
On Mon, Mar 27, 2023 at 11:46:04AM +0000, Eric Wong wrote:
> I'm using Search::Xapian XS from Debian stable and I'm getting what
> appears to be an unwrapped C++ exception:
>
> terminate called after throwing an instance of 'Xapian::DatabaseModifiedError'
>
> Using `eval' from Perl5 doesn't seem effective in catching it.
>
> I'm using
2012 Oct 15
1
[BUG] Lucene plugin breaks header substring search
According to the IMAP spec if I do a search for "TO isocpp.org" it
should find all the messages whose To: field contains the string
"isocpp.org", but dovecot is returning me an empty list. However, a
search for "TO tm at isocpp.org" produces a long list of messages. This
behavior is present if I *even load* the lucene fts plugin.
Note that lucene isn't in use
2015 May 08
1
Full text search indexes not used for header/body OR queries?
I've noticed that when using Lucene full text search, most queries use
the indexes and/or header cache and are fast:
. SEARCH BODY test
. OK Search completed (0.001 secs).
. SEARCH SUBJECT test
. OK Search completed (0.053 secs).
. SEARCH BODY test SUBJECT test
. OK Search completed (0.002 secs).
. SEARCH OR SUBJECT test FROM test
. OK Search completed (0.093 secs).
But an OR query that
2011 Aug 02
2
Positive experiences with Xapian
Hi Guys,
I just wanted to take a moment to give some positive feedback regarding my
experiences with Xapian recently.
I've been doing a fair amount of research into search engines recently, as
we have some fairly specific requirements with what we're attempting to do
with them. Long story short, after a few weeks of playing around with just
about everything under the sun (or at least,
2007 Apr 02
5
any support for wordsegment search?
Anybody who knows whether ferret or acts_as_ferret support wordsegment
search?
like what lucene can done.
I wanna know,if not i will use lucene instead of this
can''t found relevant documents on this aspect in ruby
--
Posted via http://www.ruby-forum.com/.