We use Nutch and Lucene for our heavy duty text analysis jobs but I?m trying to use ferrret to do some experiments. I understood that Ferret used the same index format as lucene but I cannot look into a lucene index with ferret and cannot read a ferret index with luke (the lucene index browser). Am I doing somehting wrong or have the formats diverged? -------------- next part -------------- An HTML attachment was scrubbed... URL: http://rubyforge.org/pipermail/ferret-talk/attachments/20080124/db73387d/attachment.html
On Jan 24, 2008, at 9:25 PM, Pat Ferrel wrote:> We use Nutch and Lucene for our heavy duty text analysis jobs but > I?m trying to use ferrret to do some experiments. I understood that > Ferret used the same index format as lucene but I cannot look into a > lucene index with ferret and cannot read a ferret index with luke > (the lucene index browser). Am I doing somehting wrong or have the > formats diverged?The formats have diverged. -ryan
Can I use an earlier version of ferret maybe? Does anyone know when the formats diverged? On 1/24/08 10:41 PM, "Ryan King" <ryan at theryanking.com> wrote:> On Jan 24, 2008, at 9:25 PM, Pat Ferrel wrote: > >> > We use Nutch and Lucene for our heavy duty text analysis jobs but >> > I?m trying to use ferrret to do some experiments. I understood that >> > Ferret used the same index format as lucene but I cannot look into a >> > lucene index with ferret and cannot read a ferret index with luke >> > (the lucene index browser). Am I doing somehting wrong or have the >> > formats diverged? > > The formats have diverged. > > -ryan > > _______________________________________________ > Ferret-talk mailing list > Ferret-talk at rubyforge.org > http://rubyforge.org/mailman/listinfo/ferret-talk-------------- next part -------------- An HTML attachment was scrubbed... URL: http://rubyforge.org/pipermail/ferret-talk/attachments/20080125/05fdf039/attachment.html
On Fri, Jan 25, 2008 at 08:12:24AM -0800, Pat Ferrel wrote:> Can I use an earlier version of ferret maybe? Does anyone know when the > formats diverged?Yeah, Versions 0.3.x should definitely work, and possibly also 0.9.x. Afair there always were some substantial problems in terms of UTF8 character handling, so you might run into problems even with the older versions. Cheers, Jens> > > On 1/24/08 10:41 PM, "Ryan King" <ryan at theryanking.com> wrote: > > > On Jan 24, 2008, at 9:25 PM, Pat Ferrel wrote: > > > >> > We use Nutch and Lucene for our heavy duty text analysis jobs but > >> > I?m trying to use ferrret to do some experiments. I understood that > >> > Ferret used the same index format as lucene but I cannot look into a > >> > lucene index with ferret and cannot read a ferret index with luke > >> > (the lucene index browser). Am I doing somehting wrong or have the > >> > formats diverged? > > > > The formats have diverged. > > > > -ryan > > > > _______________________________________________ > > Ferret-talk mailing list > > Ferret-talk at rubyforge.org > > http://rubyforge.org/mailman/listinfo/ferret-talk > >> _______________________________________________ > Ferret-talk mailing list > Ferret-talk at rubyforge.org > http://rubyforge.org/mailman/listinfo/ferret-talk-- Jens Kr?mer webit! Gesellschaft f?r neue Medien mbH Schnorrstra?e 76 | 01069 Dresden Telefon +49 351 46766-0 | Telefax +49 351 46766-66 kraemer at webit.de | www.webit.de Amtsgericht Dresden | HRB 15422 GF Sven Haubold
I am a little new to Ruby so forgive if there is an obvious answer. But one way to solve my problem might be to use jruby and manipulate the lucene index through java. I have never done the ruby->java->ruby thing but it seems it would be nicer if there were a ruby class interface that hides some of the complexity of the raw java one. Has anyone done this or know of such a ruby interface or example code? On 1/25/08 8:33 AM, "Jens Kraemer" <kraemer at webit.de> wrote:> On Fri, Jan 25, 2008 at 08:12:24AM -0800, Pat Ferrel wrote: >> > Can I use an earlier version of ferret maybe? Does anyone know when the >> > formats diverged? > > Yeah, Versions 0.3.x should definitely work, and possibly also 0.9.x. > > Afair there always were some substantial problems in terms of UTF8 > character handling, so you might run into problems even with the older > versions. > > Cheers, > Jens > >> > >> > >> > On 1/24/08 10:41 PM, "Ryan King" <ryan at theryanking.com> wrote: >> > >>> > > On Jan 24, 2008, at 9:25 PM, Pat Ferrel wrote: >>> > > >>>>> > >> > We use Nutch and Lucene for our heavy duty text analysis jobs but >>>>> > >> > I?m trying to use ferrret to do some experiments. I understoodthat>>>>> > >> > Ferret used the same index format as lucene but I cannot look intoa>>>>> > >> > lucene index with ferret and cannot read a ferret index with luke >>>>> > >> > (the lucene index browser). Am I doing somehting wrong or have the >>>>> > >> > formats diverged? >>> > > >>> > > The formats have diverged. >>> > > >>> > > -ryan >>> > > >>> > > _______________________________________________ >>> > > Ferret-talk mailing list >>> > > Ferret-talk at rubyforge.org >>> > > http://rubyforge.org/mailman/listinfo/ferret-talk >> > >> > > >> > _______________________________________________ >> > Ferret-talk mailing list >> > Ferret-talk at rubyforge.org >> > http://rubyforge.org/mailman/listinfo/ferret-talk-------------- next part -------------- An HTML attachment was scrubbed... URL: http://rubyforge.org/pipermail/ferret-talk/attachments/20080203/16e25a64/attachment.html
On Sun, Feb 03, 2008 at 01:01:10PM -0800, Pat Ferrel wrote:> I am a little new to Ruby so forgive if there is an obvious answer. But one > way to solve my problem might be to use jruby and manipulate the lucene > index through java. > > I have never done the ruby->java->ruby thing but it seems it would be nicer > if there were a ruby class interface that hides some of the complexity of > the raw java one. Has anyone done this or know of such a ruby interface or > example code?Yeah, a Ruby-friendly wrapper around Lucene, maybe even compatible with Ferret''s API, would for sure be a nice thing. But afaik there''s no such thing yet. Cheers, Jens -- Jens Kr?mer http://www.jkraemer.net/ - Blog http://www.omdb.org/ - The new free film database