Displaying 17 results from an estimated 17 matches for "set_docu".
Did you mean:
  set_doc
  
2010 Oct 24
1
Cannot index with dynamic spelling data (Perl/Search::Xapian)
...an qw(:all);
use strict;
my $xa = new Search::Xapian::WritableDatabase ("/tmp/xapian",
            DB_CREATE_OR_OVERWRITE);
my $indexer = Search::Xapian::TermGenerator->new();
$indexer->set_flags(Search::Xapian::FLAG_SPELLING);
my $doc = new Search::Xapian::Document;
$indexer->set_document($doc);
$indexer->index_text("hello 123 blah blah");
$xa->add_document($doc);
--- >8 ---
Output:
terminate called after throwing an instance of 'Xapian::InvalidOperationError'
Aborted
It works fine without "$indexer->set_flags(Search::Xapian::FLAG_SPELLING)...
2018 Nov 30
1
Xapian Benchmark results
....getline(line, sizeof(line));
                       string str(line);
                       if(!str.empty())
                       {
                        line_string += string(line);
                       }
       }
       
       Xapian::Document doc;
		doc.set_data(line_string);
		indexer.set_document(doc);
		indexer.index_text(line_string);
       database.add_document(doc); 
       file.close();
   }
   database.commit();
   cout <<"count::" <<database.get_doccount() <<endl;
   }
   catch (const Xapian::Error &e) {
    cout << e.get_description() <...
2010 Jun 09
1
TermGenerator incorrectly tokenizes German text which contains special characters
...the MySql database is also in
UTF-8 encoding.
*
#1 $lIndexer = new XapianTermGenerator();
#2 $lStemmer = new XapianStem(XapianHelper::GetStemmer($pLanguage)); //
?german?
#3 $lIndexer->set_stemmer($lStemmer);
#4 $lDoc = new XapianDocument();
#5 $lDoc->add_term($lObj->Id);
#6 $lIndexer->set_document($lDoc);
#7 $lIndexer->index_text("Nahrungserg?nzungsmittel Ausrei?er");
#8 $lIndexer->index_text($lSomeStringFromDb);*
In the code example just above here the problem only occurs when I try to
index text on line #8. The string which get indexed on line #7 is indexed
correctly (...
2014 Jan 27
4
Perl Search::Xapian
...1);
while (my $csvline = <$fh>) {
    my $description = $csvline->{DESCRIPTION};
    my $title = $csvline->{TITLE};
    my $identifier = $csvline->{id_NUMBER};
    # We make a doc and tell the term generator to use this.
    my $doc = Search::Xapian::Document->new();
    $tg->set_document($doc);
    $tg->index_text($title, 1, 'S');
    $tg->index_text($description, 1, 'XD');
    # index fields without prefixes for general search.
    $tg->index_text($title);
    $tg->increase_termpos();
    $tg->index_text($description);
    # Store all the feil...
2007 Dec 17
1
Crashes with spelling enabled and perl.
...t;,  
Search::Xapian::DB_CREATE_OR_OPEN);
if (!defined($db)) {
     die("Failed to open xapian_database: $!");
   }
my $indexer = Search::Xapian::TermGenerator->new();
$indexer->set_flags(Search::Xapian::FLAG_SPELLING);
my $document = Search::Xapian::Document->new();
$indexer->set_document($document);
$indexer->index_text(lc('test'), 1);
$db->add_document($document);
undef $db;
Here's the patch to enable spelling against Search-Xapian-1.0.4.0:
http://rusty.devel.infogears.com/xap-perl-spelling.diff
Here's the backtrace against 1.0.4:
Program received sig...
2007 Dec 29
3
Term-Flags
Hi,
Is it necessary to set the down below flag to the TermGenerator,
if I want the "Did you mean ..." spelling corrections?
Xapian::TermGenerator::flags::FLAG_SPELLING
Thank you very much
Markus
2012 Jun 04
1
Search not finding queries with stop words.
...my $indexer = Search::Xapian::TermGenerator->new();
             my $stemmer = Search::Xapian::Stem->new('english');
             $doc->set_data($jsonText);
             $indexer->set_stemmer($stemmer);
             $indexer->set_stopper($stopper);
             $indexer->set_document($doc);
             $indexer->index_text($docBody);
             $indexer->increase_termpos();
             $indexer->index_text($subject);
... (other index_text and add_value calls)
             $xdb->add_document($doc);
If I look for something like index of elements, I get no re...
2012 Nov 03
1
get the title from the document
...? content = open(filePath).read()
??????? # Prepare document
??????? document = xapian.Document()
??????? document.set_data(content)
??????? # Store fileName
??????? fileName = os.path.basename(filePath)
??????? document.add_value(xapian_file_name, fileName)
??????? # Index document
??????? indexer.set_document(document)
??????? indexer.index_text(content)?????????????????? 
??????? 
??????? #Store indexed content in database
??????? database.add_document(document)
///////////// code? for searching?
??? query = queryParser.parse_query(queryString)??? 
??? enquire.set_query(query)
??? matches = enqu...
2018 Jun 21
0
Welcome to the "Xapian-discuss" mailing list
...play with this is using the NEAR syntax in the query parser. So if you had a plain text document:
I am walking, always walking.
And index it in a very simple fashion (in python):
import xapian
db = xapian.WritableDatabase("testdb")
doc = xapian.Document()
tg = xapian.TermGenerator()
tg.set_document(doc)
tg.index_text("I am walking, always walking.")
db.add_document(doc)
Then you can run NEAR queries:
import xapian
db = xapian.Database("testdb")
qp = xapian.QueryParser()
qp.set_database(db)
def query(query):
    enq = xapian.Enquire(db)
    q = qp.parse_query(query)...
2008 Jan 15
7
PHP indexing, what's the PHP method for indexscript
...$doc->add_value(1,date('Ymd',$postrow['postdate']));
$doc->add_value(2,$postrow['author_id']);
      
$doc->add_term("XAUTHORID".$postrow["author_id"]);
$doc->add_term("XAUTHORNAME".$postrow["forum_id"]);
$indexer->set_document($doc);
$indexer->index_text($postrow['post']);          //post == sample  
        
// Add the document to the database.
$database->add_document($doc);
      ____________________________________________________________________________________
Be a better friend, newshound, and...
2013 Sep 22
2
How to filter search result with query with has white space.
...ian::WritableDatabase db("/Users/ramesh/Desktop/xapian",
Xapian::DB_CREATE_OR_OPEN);
        Xapian::TermGenerator indexer;
        Xapian::Stem stemmer("english");
        indexer.set_stemmer(stemmer);
        Xapian::Document doc;
        doc.set_data(d.title);
        indexer.set_document(doc);
        indexer.index_text(d.title,1,"title");
        indexer.index_text(d.content,1,"content");
        indexer.index_text(d.url,1,"url");
        doc.add_boolean_term("title"+d.title);
        db.replace_document(d.url,doc);
        db.commit();...
2013 Sep 22
2
How to filter search result with query with has white space.
...ian::WritableDatabase db("/Users/ramesh/Desktop/xapian",
Xapian::DB_CREATE_OR_OPEN);
        Xapian::TermGenerator indexer;
        Xapian::Stem stemmer("english");
        indexer.set_stemmer(stemmer);
        Xapian::Document doc;
        doc.set_data(d.title);
        indexer.set_document(doc);
        indexer.index_text(d.title,1,"title");
        indexer.index_text(d.content,1,"content");
        indexer.index_text(d.url,1,"url");
        doc.add_boolean_term("title"+d.title);
        db.replace_document(d.url,doc);
        db.commit();...
2015 Jul 26
1
Get term from document by position
...dexing
    Xapian::WritableDatabase db_w(database_dir, Xapian::DB_CREATE_OR_OVERWRITE);
    MyText text_to_index;
    text_to_index.set_string();
    Xapian::TermGenerator indexer;
    Xapian::Stem stemmer("english");
    indexer.set_stemmer(stemmer);
    Xapian::Document doc;
    indexer.set_document(doc);
    indexer.set_stemming_strategy(Xapian::TermGenerator::STEM_ALL_Z);
    indexer.index_text(text_to_index.text_str);
    db_w.add_document(doc);
    db_w.commit();
    db_w.close();
    
    
    
//searching
Xapian::Database db(database_dir);
Xapian::Enquire enquire(db);
    Xapian::Qu...
2007 Feb 12
0
[859] trunk/wxruby2/doc/textile/docchildframe.txtl: Added ''methods'' section; removed C++ members
...e_new
+* "DocChildFrame#get_document":#DocChildFrame_getdocument
+* "DocChildFrame#get_view":#DocChildFrame_getview
+* "DocChildFrame#on_activate":#DocChildFrame_onactivate
+* "DocChildFrame#on_close_window":#DocChildFrame_onclosewindow
+* "DocChildFrame#set_document":#DocChildFrame_setdocument
+* "DocChildFrame#set_view":#DocChildFrame_setview
</ins><span class="cx"> 
</span><del>-h3(#DocChildFrame_mchilddocument). DocChildFrame#m__child_document
</del><ins>+</div>
</ins><span clas...
2007 Feb 12
0
[858] trunk/wxruby2/doc/textile/docmdichildframe.txtl: Added ''methods'' section; removed C++ members
...rame#get_document":#DocMDIChildFrame_getdocument
+* "DocMDIChildFrame#get_view":#DocMDIChildFrame_getview
+* "DocMDIChildFrame#on_activate":#DocMDIChildFrame_onactivate
+* "DocMDIChildFrame#on_close_window":#DocMDIChildFrame_onclosewindow
+* "DocMDIChildFrame#set_document":#DocMDIChildFrame_setdocument
+* "DocMDIChildFrame#set_view":#DocMDIChildFrame_setview
</ins><span class="cx"> 
</span><del>-Document*m_childDocument
</del><ins>+</div>
</ins><span class="cx"> 
</span&g...
2018 Jun 20
2
Welcome to the "Xapian-discuss" mailing list
Hi,
I'm new to Xapian and wanted to know if it has a specific feature. I want
to be able to check the relation between two terms on a page based on how
close they are together on the page. I want to use a combination of n-gram
based labeling and the "slop" feature found in Elasticsearch. Does Xapian
have this/a similar feature? I haven't been able to find any programs that
have
2008 Mar 27
2
Proper noun stemming
Hi All
I was wondering if anyone had a solution for the following problem.
I user QueryParser to stem my documents before adding them to a  
database. During the stemming process I would like to find a way of  
keeping proper nouns that span two or more words together as a phrase.  
For example "New York" or "Gordon Brown" or "Prime Minister" get spilt  
up. I see