Rafael "SDM" Sierra
2007-Jan-22 21:43 UTC
[Xapian-discuss] (Very)Wrong number in get_matches_estimated
Hi again, when I do "mset = enq.get_mset(0,10)" and after "mset.get_matches_estimated()", it returns 32 results, but, if I do "enq.get_mset(11,10)" in the same query, I got only 11 results, there's some reason for it?? The database was not modified The system are: Xapian-0.9.9 Python-2.4 FreeBSD (amd64) -- SDM Underlinux http://stiod.wordpress.com Membro da equipe UnderLinux -- PEP-8 There is only 2 kinds of peoples in the world, who know English, and me. oO
Rafael "SDM" Sierra
2007-Jan-22 22:21 UTC
[Xapian-discuss] Re: (Very)Wrong number in get_matches_estimated
Sorry, it's a stupid mail, I'm using match decider, and have some documents that don't pass on it, so, the number will be really different On 1/22/07, Rafael SDM Sierra <sdm@underlinux.com.br> wrote:> > Hi again, > when I do "mset = enq.get_mset(0,10)" and after " > mset.get_matches_estimated()", it returns 32 results, but, if I do " > enq.get_mset(11,10)" in the same query, I got only 11 results, there's > some reason for it?? The database was not modified > > The system are: > Xapian-0.9.9 > Python-2.4 > FreeBSD (amd64) > > -- > SDM Underlinux > http://stiod.wordpress.com > Membro da equipe UnderLinux > -- > PEP-8 > There is only 2 kinds of peoples in the world, who know English, and me. > oO-- SDM Underlinux http://stiod.wordpress.com Membro da equipe UnderLinux -- PEP-8 There is only 2 kinds of peoples in the world, who know English, and me. oO
Sam Liddicott
2007-Jan-22 22:30 UTC
[Xapian-discuss] (Very)Wrong number in get_matches_estimated
That's the nature of estimation for you. The search is optimized not to find all matches every time - in most cases the user only wants a few results and isn't harmed by the guess being wrong - this is more true when there are billions of results. One of the eqnuire options will let you set how many hits the search should check the actual existance of. Maybe you would set this to 100 beyond the set of hits you want to show, but in cases where there are only a few dozen hits you might make xapian work harder proving that there are no more. Perhaps it's worth xapian or a caller holding some data on recent queries to avoid it repeatedly slowly and exhaustively proving that there are only 3 hits and not 100 (out of a database of 10 Million). Sam -----Original Message----- From: "Rafael "SDM" Sierra" <sdm@underlinux.com.br> To: "Xapian Discussion" <xapian-discuss@lists.xapian.org> Sent: 22/01/07 21:43 Subject: [Xapian-discuss] (Very)Wrong number in get_matches_estimated Hi again, when I do "mset = enq.get_mset(0,10)" and after "mset.get_matches_estimated()", it returns 32 results, but, if I do "enq.get_mset(11,10)" in the same query, I got only 11 results, there's some reason for it?? The database was not modified The system are: Xapian-0.9.9 Python-2.4 FreeBSD (amd64) -- SDM Underlinux http://stiod.wordpress.com Membro da equipe UnderLinux -- PEP-8 There is only 2 kinds of peoples in the world, who know English, and me. oO _______________________________________________ Xapian-discuss mailing list Xapian-discuss@lists.xapian.org http://lists.xapian.org/mailman/listinfo/xapian-discuss