Hello,
I'm working on a document manager using Xapian. If you are familiar
with, it will hopefully be something similar to Devonthink on Mac.
There are two crucial features, finding similar documents and
suggesting an existing folder for a document. Xapian website claims to
do that:
"Relevance feedback - given one or more documents, Xapian can suggest
the most relevant index terms to expand a query, suggest related
documents, categorise documents, etc."
Currently to find related documents I get a document, create a one
item rset, get eset of max size 100 of it and search those terms to
get a list of documents. Here is the code:
Xapian::RSet rSelection=Xapian::RSet();
rSelection.add_document(currentDocID);
Xapian::ESet eset = enquire.get_eset(100, rSelection);
Xapian::Query query(Xapian::Query::OP_OR, eset.begin(), eset.end());
enquire.set_query(query);
Xapian::MSet matches = enquire.get_mset(0, 10);
and here are my questions:
1) Is there a better way to get similar documents for a given document?
2) Is there way to suggest a folder for a given document to be
classified in?
Thank you very much in advance.
--
Serkan Cabi
MIT Center for Theoretical Physics