Displaying 20 results from an estimated 1000 matches similar to: "Google launches social search copied idea from Find1Friend.com running on Xapian!"
2010 Mar 29
0
Optimal usage of xapian-compact for merging
On Tue, March 23, 2010 19:46, Kevin Duraj wrote:
> I am merging 300 indexes at once, it takes less than a day for merge
> to happen for 100 million documents, during merging I notice very heavy IO.
That IO sounds pretty normal. To help with IO load, we have a dedicated
index store cluster, dedicated source data cluster, dedicated indexing
cluster, etc. Sigh. Each time I think we have
2009 Sep 30
2
C++ parser for doc.get_data() result.
Xapians!
Did anybody wrote and would like to share a routines that parse result
from doc.get_data() into some key and pair values in C++ ?
Code:
Xapian::Document doc = i.get_document();
string data = doc.get_data();
mymap = parse_result(data);
As you know the data string contain all the data within the document
delimited by "=" sign and "\n" new line and needs to be parse
2010 Dec 18
1
Xapian index size 475GB = 170 million documents (URLs)
Xapians,
I am maintaining about two indexes for my search engines which
approximately is each the same size. I would like to share this
knowledge with you, since many of you have never seen Xapian index of
this size. And of course you can search the index by yourself at
- http://myhealthcare.com/
- http://find1friend.com/
I need 2 x 100 million more documents into each index, and I hope it
will
2011 Feb 15
1
Anybody succeeded to run PhotoInstrument
Hi,,
Anybody succeeded to run photoinstrument in wine,, I've been struggling with it for so long no good news, the screen keeps disappearing,
This is the link,,
http://www.photoinstrument.com/index.html
for those who do not know photoinstrument
PhotoInstrument is a powerful and easy-to-learn raster graphics editor used to process digital photographs. With only a few clicks, anyone can
2010 Aug 23
2
NetBeans and Java Bindings
Hello,
I was wondering if anyone has succeeded in getting the Java bindings to work
with NetBeans, in order to make use of NetBeans's GUI developer. I've had no
luck so far, does anyone know how to do that?
Many thanks.
2007 Oct 01
3
How to beat Google aka Xapian & Natural Language Processing.
Xapians!
If tomorrow Xapian search engine would achieved the same performance
and result in searches as Google we would not be able to beat Google,
because we would create only a copy of the searches that already
exists from Google search engine. However there is a way to beat
anyone, and there is a way to beat Google successfully as well just do
not give up. Some see it as implementing Ajax, or
2007 Jul 17
1
BUG IN XAPIAN_FLUSH_THRESHOLD
There is is bug when setting XAPIAN_FLUSH_THRESHOLD=20000000
When trying for force Xapian flush documents to flush after 20 million
documents Xapian ignores the size and flush it after only 10,000
documents.
Data captured from delve after 60 seconds interval when has been set as follow:
XAPIAN_FLUSH_THRESHOLD=20000000
perl -e ' while(1) { system("delve ."); sleep(60); } '
2007 Feb 07
2
My new record: Indexing 20 millions docs = 79m9.378s
Gentoo Linux 2.6
8 AMD Opteron 64-bit Processors
32GB Memory
--------------------------------------------------------------------------------
Environment:
------------------
XAPIAN_FLUSH_THRESHOLD=21000000
XAPIAN_FLUSH_THRESHOLD_LENGTH=16000000
XAPIAN_PREFER_FLINT=True
Indexing 20 million documents:
--stemmer=none
-------------------------------------------
real 79m9.378s
user 77m28.696s
2009 Apr 03
2
how to sort and plot data?
hi,
Previously my email very simple and less information. Let say I have raw
data as below:-
user_id website time
20 google 0930
21 yahoo 0935
20 facebook 1000
25 facebook 1015
61 google 0940
60 yahoo 1050
40 friendster 1115
61 google 1200
40
2007 Jun 05
7
Chinese, Japanese, Korean Tokenizer.
Hi,
I am looking for Chinese Japanese and Korean tokenizer that could can
be use to tokenize terms for CJK languages. I am not very familiar
with these languages however I think that these languages contains one
or more words in one symbol which it make more difficult to tokenize
into searchable terms.
Lucene has CJK Tokenizer ... and I am looking around if there is some
open source that we
2009 May 02
1
Re: no internet
ok... i ran the program like it said to... and here's what it did...
btw...sorry to be a noob... been ten years since i was introduced to linux, so i'm having to work with trial and error...lol...
~/.wine/drive_c/Program Files/MySpace/IM$ wine MySpaceIM.exe
fixme:win:EnumDisplayDevicesW ((null),0,0x32f7ec,0x00000000), stub!
fixme:wtsapi:WTSRegisterSessionNotification Stub 0x10030
2008 May 06
16
Run-Time Error 445
Application: Friend Blaster Pro 10.0.1
Download Link: http://www.addnewfriends.com/download/friendblasterpro_v10_0_1.zip
Wine Version: 0.9.61
OS: Ubuntu 8.04 Hardy
Kernel: 2.6.24-16-generic
Laptop: HP DV9220us 1.6 Dual Core, 2gb Ram, GeFrorce Go 7600
I tried running $ wine friendblasterpro.exe &> /tmp/log.txt 2>&1 and it appears to do nothing, unless the text file it's supposed
2006 Mar 13
0
Adoppt Ruby on Rails: A social Networking Platform
So some of you here may have been wondering just what the heck I have been working on and asking all of these questions, some really basic quesitons, other very advanced.
Well, I''m writing this post to tell you what I have been working on.
I am an experienced PHP developer who with this project has totally converted to Rails.
Despite my years of experience with PHP, I
2008 Dec 19
0
MySpace.com clone - is it possible with Ruby on Rails?
Hi, I wanna know wich of the well known frameworks is the best choice
for myspace-clone community website.
CakePHP ? Zend Framework? Symfony?
Ruby on Rails :) ?
Django?
or maybe
Kohana? Prado? or something else?
Is it better to use PHP or Ruby or Python? I will pay a webdesigner
for this job, but I want to know which lang and framework is in your
opinion the best for this kind of application.
2013 Mar 20
0
[LLVMdev] Debug info for namespaces and "using"
I've noticed the work going on to support namespace and "using" directives.
Awesome that this is happening, I've had a couple users wonder about it.
(The main issue actually seems to be with anonymous namespaces; gdb will
auto-import those declarations, but our debugger doesn't.)
I thought I should also bring up another aspect of how Clang/LLVM handles
namespaces, in case
2011 Feb 03
1
RE : a server that offers free icecast includes bandwidth for free
You should contact them on their irc channel , they will open the
mountpoint for you I guess .
Le 3 f?vr. 2011 00:18, Timothy Clark <timothyclark113 at gmail.com> a ?crit :
> how would I get around the code also known as a capsha as I am a blind
> individual.Thanks
> On Wed, Feb 2, 2011 at 6:05 PM, michel memeteau
> michel.memeteau at gmail.com> wrote:
>
2016 Jul 09
0
Xapian 1.4.0 released
The issue is that delve was renamed to xapian-delve but documentation
is still saying that delve is delve. Who has access to update the
documentation?
http://www.linuxfromscratch.org/blfs/view/svn/general/xapian.html
Installed Programs: copydatabase, delve, quest, simpleexpand,
simpleindex, simplesearch, xapian-check, xapian-chert-update,
xapian-compact, xapian-config, xapian-inspect,
2007 Jan 06
1
Hosting Music for Free
I would like to be able to host music not by myself for free. The only
problem is, I can't find a place that will do it. When I want to share a
song with a forum, I am currently forced to use RapidShare, MegaUpload, or
UploadingIt, all of which causes problems for somebody. I would like to be
able to host a song, have a link to click on it, and it doesn't have to be
downloaded and cause
2011 Apr 01
0
Xapian-discuss Digest, Vol 83, Issue 1
I think this is a shining example of how well Xapian works with large
document collections. I was just discussing this with my colleagues here
and one of the issues that came up is that we'd love Xapian to become
really lot more popular but have found that the documentation's a bit
difficult to get into, as is the API.
So I was wondering: do you have any thoughts on improving this and
2006 Oct 03
1
recipe for a myspace type clone?
loginGenrator for accounts and custom views
ferret for search engine
acts_as_attachment with imagick
backroundRb for DB cleanup
anything else one would use for a social networking site? and would the
above plugins be ''good enough'' for a much smaller instance of myspace?
looking to have around 1000 or so members.
--
Posted via http://www.ruby-forum.com/.