Displaying 20 results from an estimated 21 matches for "myhealthcare".
2007 Nov 08
0
Xapian Search Websites Listings
Xapian Search Websites Listings,
I come across Xapian Search Websites Listings for Xapian search engines.
http://xapian.org/users.php
Can you please ad MyHealthcare.com search engine to section: Search Websites
MyHealthcare.com using Xapian to crawl and search 50 million web sites
on single 1U server.
MyHealthcare.com
Url: http://myhealthcare.com
General web search engine with 50 million websites.
Thank you,
Kevin Duraj
2007 Feb 02
1
Working demo of search engine using boolean query.
...r search
engine but I haven't seen any complete working demo. Therefore I put
together very simple working demo of search engine using boolean query. Feel
free to suggest any performance improvement or error while keeping it as
simple as possible for understanding.
Thanks,
-Kevin Duraj
http://myhealthcare.com
#------------------------------------------------------------------------------#
# Sample Data #
#------------------------------------------------------------------------------#
url=webmd.com
text=fitness health cancer
url=health.com
text=diseases health calorie disability
url=healthfinder.g...
2011 Mar 31
0
Xapian Index: 607GB = 219 million of unique documents
...documents, while testing
Lucene, Solr, MySQL, Hadoop and Oracle. Probably that would be the
real reason why Xapian was not approved last year, for Google's Summer
of Code. Xapian is the type of open source that they don't want you to
know about.
Following index can be search from: http://myhealthcare.com/
total 607G
-rw-r--r-- 1 kevin kevin 28 2011-03-31 06:09 iamchert
-rw-r--r-- 1 kevin kevin 14 2011-03-31 01:50 position.baseA
-rw-r--r-- 1 kevin kevin 622K 2011-03-31 06:09 position.baseB
-rw-r--r-- 1 kevin kevin 311G 2011-03-31 06:09 position.DB
-rw-r--r-- 1 kevin kevin 14 2011-03-30 17...
2011 May 13
0
Xapian Index 253 million documents = 704G
...le processor 2.0 GHz. I do not see any search performance
decreases in searching my indexes between 100 million and 250 million,
which indicates a good scalability of Xapian and it looks like, I can
push it easily forwards 300 million documents on single Index.
You can check it yourself at: http://myhealthcare.com/
number of documents = 253717716
average document length = 35670.3
document length lower bound = 1
document length upper bound = 181656
highest document id ever used = 253717716
total 704G
-rw-r--r-- 1 kevin kevin 28 2011-05-13 08:30 iamchert
-rw-r--r-- 1 kevin kevin 14 2011-05-13 03:28 p...
2011 Apr 01
0
Xapian-discuss Digest, Vol 83, Issue 1
...Lucene, Solr, MySQL, Hadoop and Oracle. Probably that would be the
> real reason why Xapian was not approved last year, for Google's Summer
> of Code. Xapian is the type of open source that they don't want you to
> know about.
>
> Following index can be search from: http://myhealthcare.com/
>
> total 607G
> -rw-r--r-- 1 kevin kevin 28 2011-03-31 06:09 iamchert
> -rw-r--r-- 1 kevin kevin 14 2011-03-31 01:50 position.baseA
> -rw-r--r-- 1 kevin kevin 622K 2011-03-31 06:09 position.baseB
> -rw-r--r-- 1 kevin kevin 311G 2011-03-31 06:09 position.DB
> -rw-r--r...
2007 Oct 01
3
How to beat Google aka Xapian & Natural Language Processing.
...erb|?=punctuation]
We have nouns dominating the question. Therefore in Xapian search
engine we look first for dominating nouns in this case my name Kevin
Duraj and then within the result we search for next dominant verb and
pronoun.
PS: Can you see the future?
--
Cheers
Kevin Duraj
http://MyHealthcare.com
Los Angeles, California
2012 Nov 14
4
xapian-replicate errors
Hi,
While trying to setup xapian replication (initially for backup
purposes), I'm encountering some errors.
Our "fresh" index starts replication, and ends up with an index size
that matches the replication master (4.5GB), but then throws :
"Getting update for fresh from fresh
xapian-replicate: NetworkError: Unable to fully synchronise: Database
changing too fast"
I
2010 Dec 18
1
Xapian index size 475GB = 170 million documents (URLs)
Xapians,
I am maintaining about two indexes for my search engines which
approximately is each the same size. I would like to share this
knowledge with you, since many of you have never seen Xapian index of
this size. And of course you can search the index by yourself at
- http://myhealthcare.com/
- http://find1friend.com/
I need 2 x 100 million more documents into each index, and I hope it
will fit on one hard disk of 2TB, and I will soon beat single handedly
the largest Xapian BrightStation's Webtop search engine implementation
(archive.org snapshot), which offered a sub-second s...
2011 Apr 02
1
Xapian docs (was Re: Xapian-discuss Digest, Vol 83, Issue 2)
..., Hadoop and Oracle. Probably that would be the
>> real reason why Xapian was not approved last year, for Google's Summer
>> of Code. Xapian is the type of open source that they don't want you to
>> know about.
>>
>> Following index can be search from: http://myhealthcare.com/
>>
>> total 607G
>> -rw-r--r-- 1 kevin kevin 28 2011-03-31 06:09 iamchert
>> -rw-r--r-- 1 kevin kevin 14 2011-03-31 01:50 position.baseA
>> -rw-r--r-- 1 kevin kevin 622K 2011-03-31 06:09 position.baseB
>> -rw-r--r-- 1 kevin kevin 311G 2011-03-31 06:09 p...
2007 Oct 11
2
Xapian 1.0.3 installation issues.
...n 1.0.3 and the search would not execute when run as
Apache user. I could run the search fine inside ssh. I rolled Xapian
to previous version 1.0.2 and the search still does not work even when
I put back the old index made by Xapian 1.0.2
... my search engine is out of work ...
Kevin Duraj
http://myhealthcare.com
2012 Dec 08
2
Want to contribute code to the Xapian project
Hey guys,I am a 3rd year Computer Science undergrad student.I a extremely
interested in contributing code to the XAPIAN project. The work you people
do sounds extremely fascinating and interesting.Can someone just give me a
brief overview of how to proceed ?. I Can code in C,C++ and Python and
have experience in Natural Lanuage Processing.Am also quite comfortable
with NLTK and using Wordnet.Am
2011 May 12
2
Xapian support for huge data sets?
Hello,
I?m currently using another open source search engine/indexer and am
having performance issues, which brought me to learn about Xapian. We
have approximately 350 million docs/10TB data that doubles every 3
years. The data mostly consists of Oracle DB records, webpage-ish
files (HTML/XML, etc.) and office-type docs (doc, pdf, etc.). There
are anywhere from 2 to 4 dozen users on the
2010 Aug 23
2
NetBeans and Java Bindings
Hello,
I was wondering if anyone has succeeded in getting the Java bindings to work
with NetBeans, in order to make use of NetBeans's GUI developer. I've had no
luck so far, does anyone know how to do that?
Many thanks.
2008 Jul 16
3
Xapian 1.0.7 released
I've uploaded Xapian 1.0.7, which as usual you can download from:
http://xapian.org/download
This release fixes an assortment of bugs, and improves efficiency in a few
cases. It's intended to be a relatively safe incremental update over 1.0.6.
For a more detailed overview see:
http://trac.xapian.org/wiki/ReleaseOverview/1.0.7
The full lists of user-visible changes are linked to from
2007 Jun 05
7
Chinese, Japanese, Korean Tokenizer.
Hi,
I am looking for Chinese Japanese and Korean tokenizer that could can
be use to tokenize terms for CJK languages. I am not very familiar
with these languages however I think that these languages contains one
or more words in one symbol which it make more difficult to tokenize
into searchable terms.
Lucene has CJK Tokenizer ... and I am looking around if there is some
open source that we
2007 Jul 09
7
Xapian pubmeet
Hi all,
A few of us have been discussing whether we should have a Xapian social
gathering of some kind. The current idea is meeting up in a pub in
London some time in autumn for drinks and food. However all of this
really depends on who might be able to come! It would be a chance to
meet other Xapian enthusiasts in an informal social setting and talk
about all things search-related (and
2008 Oct 09
3
Sorting results by a "sort expression"
Olly,
We currently use Sphinx for our website search function, but we're planning
on using Xapian instead for a few of the extra features it has. Our website
is written in Ruby on Rails, so of course we're using Xapian with Ruby
bindings. I don't know if you're familiar with Sphinx but Sphinx allows you
to pass a sort expression when you execute the search that will be evaluated
2008 Apr 24
3
how to delete all document from the DB (without deleting the DB itself)
Hello,
I'm still testing PHP5 bindings and I could'nt find a way to delete
all documents from a DB without deleting other informations stored in
the DB such as synonyms.
Since the process of adding synonyms is time consuming, I would like
to use the same DB but restart my test without any document in the DB,
is this possibile?
I could'nt find a delete_all or a method like that, nor
2012 Mar 09
3
128 bit Document IDs (Please don't hurt me)
I apologize for what may be a sore subject. 4 billion documents is a
heck of a lot. 64 bit vs 32 bit would be an incredibly large database
with an average document and term size. Why 128 bit? Simply for
address space.
Mapping a UUID (128 bit) or MongoDB ObjectID (96 bit) directly into
the Xapian document space removes the need for referencing one or the
other from one or both. I see a common
2007 Jun 17
2
Flint failed to deliver indexing performance to Quartz.
Flint failed to deliver indexing performance to Quartz.
I am proposing to remove Flint as default database and place Quartz
database back as default. The catch is not that Flint database is
smaller and faster during searches then Quartz database as developers
were concerning when were measuring and neglecting to measure
performance when creating the large indexes.
The truth is that Flint