Hello, Is there a chance to determine similar documents like Googles "Similar pages" feature? Thank you very much double
2008/6/19 double <ninive at gmx.at>:> Hello, > > Is there a chance to determine similar documents like > Googles "Similar pages" feature?Hi, I've had a similar requirement and I ended up using the whole document as a search string, the retrieved results are the most "similar" documents. Of course here is the fuzzy meaning of "similar" that could make the difference. Regards -- Alessandro Pasotti w3: www.itopen.it
On Thu, Jun 19, 2008 at 10:03:02AM +0200, double wrote:> Is there a chance to determine similar documents like > Googles "Similar pages" feature?I've just added an FAQ entry for this: http://trac.xapian.org/wiki/FAQ/FindSimilar Cheers, Olly