search for: snippetgenerator

Displaying 4 results from an estimated 4 matches for "snippetgenerator".

2013 Mar 13
1
patch - add SnippetGenerator class
G'day, This is some code which have been using for some time now in our searching beta, forward ported from 1.2.12 to today's git. It adds a class Xapian::SnippetGenerator which can be used to generate human-readable snippet strings. By default, the snippets are HTML formatted, with the matched search term inside a <b> tag, 5 words of context around matched search terms, and ellipsis "..." between non-adjacent context. There is a certain amount of c...
2020 Jan 16
2
[llvm-exegesis]?==?utf-8?q? [RFC] Renaming Uops- classes
Since the option of running -mode=inverse_throughput was added to llvm-exegesis the names of classes like UopsSnippetGenerator and UopsBenchmarkRunner, that this mode shares with uops, started to be less descriptive. Inverse_throughput doesn't use the uops counters, so for example, the instruction layout shared between these two modes is really connected to parallelism, not uops. It's doubly confusing for architec...
2016 Jul 29
3
Pull requests: CJK words and Snippet generator
...n't > try to comment on that at all :) I've just opened a pull request for the CJK tokenizer: https://github.com/xapian/xapian/pull/114 > I wonder if we can arrange suitable defaults to use your > implementation with the older API, and come up with a newer API that > allows a SnippetGenerator class to be used from the MSet. The FastMail snippet generator has been written when MSet didn't create snippets. I'll first compare both implementations to see if there is a good reason for them to coexist, or might just as well merge any additional features into MSet. > A good start...
2016 Jul 26
2
Pull requests: CJK words and Snippet generator
Hi, The Cyrus IMAP mail server uses Xapian as search engine. Recently, FastMail has sponsored implementation of two Xapian features: CJK word splitting and a generator for search snippets. I've been working on both features and we would be happy to get them merged into Xapian master. The CJK word tokenizer uses the word segmentation algorithms of the International Components for Unicode