aarsh shah
2013-Jan-10 18:47 UTC
[Xapian-devel] Add an example to the community page and contribute more code
Hi guys.I've finished an example indexer which acts like a grep replacement for a file.It indexes each line containing a proper noun in a given text file.The line containing the proper noun will be displayed upon searching for that noun.I would like to add it to the community code examples.I'm planning to write more examples which demonstrate some advanced features of Xapian along similar lines soon. Can anyone please help me on how to add my examples to the community ? I now want to develop a Paice/Husk stemmer for Xapian.The stemmer itself is not that hard to implement and I already read the page mentioned in the resources for the stemming project.But Ive yet to fiddle with the code base and so don't know how to add the stemmer to the codebase.Il be extremely grateful if you could guide me here Moreover,I wold also love to write a patch for multiple spelling suggestions as Xapian only suggests one right now.Any help/suggestions ? My basic problem is that given any feature I want to fiddle with/improve/change,I don't know which part of the code base to access .Thank you for the awesome docs and the resources.In just two weeks, I feel like I know a lot more about IR and Xapian than I used to :) -Regards -Aarsh -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20130111/c45b7d0b/attachment-0001.html>
Olly Betts
2013-Jan-12 06:49 UTC
[Xapian-devel] Add an example to the community page and contribute more code
On Fri, Jan 11, 2013 at 12:17:07AM +0530, aarsh shah wrote:> Hi guys.I've finished an example indexer which acts like a grep replacement > for a file.It indexes each line containing a proper noun in a given text > file.The line containing the proper noun will be displayed upon searching > for that noun.I would like to add it to the community code examples.I'm > planning to write more examples which demonstrate some advanced features of > Xapian along similar lines soon. Can anyone please help me on how to add my > examples to the community ?Cool. There's a page on the wiki for examples here: http://trac.xapian.org/wiki/SampleCode Anyone can edit the wiki - you just need to register first. You can attach files to the wiki page, but I'd probably suggest using a code hosting site (that's certainly a better option if it's more than a single file).> I now want to develop a Paice/Husk stemmer for Xapian.The stemmer itself is > not that hard to implement and I already read the page mentioned in the > resources for the stemming project.But Ive yet to fiddle with the code base > and so don't know how to add the stemmer to the codebase.Il be extremely > grateful if you could guide me hereYou can subclass Xapian::StemImplementation and then wrap that in a Xapian::Stem object to use where you'd use the built-in stemmers: http://xapian.org/docs/apidoc/html/stem_8h.html I'd probably suggest you just do it as an external stemmer first, and once you have that working, then work on including it as part of the library.> Moreover,I wold also love to write a patch for multiple spelling > suggestions as Xapian only suggests one right now.Any help/suggestions ?There's actually already a patch for that, along with a number of other enhancements to spelling correction done as a GSoC project, but they're all on a branch currently, and haven't yet been merged.> My basic problem is that given any feature I want to fiddle > with/improve/change,I don't know which part of the code base to accessYeah, finding your way around a sizable source tree can take some getting used to. Have you seen this overview page? http://xapian.org/docs/code_structure.html> feel like I know a lot more about IR and Xapian than I used to :)Great. Cheers, Olly