search for: catdoc

Displaying 7 results from an estimated 7 matches for "catdoc".

2007 Apr 01
10
indexing mostly-binary documents (.ppt)
Here''s an interesting problem: In my app, we are indexing various types of documents, including microsoft powerpoint. Powerpoint documents are mostly binary, but have a bunch of text (all of the text in the document?) as well. My thinking is that the binary will never get searched for, and the proper text will be indexed and queried as expected, so the indexed binary will never
2001 Aug 17
1
Excel -> *.CSV in Unix (Linux) command line?
A colleague has a dozen of excel sheets and also expects to get updates regularly. He could open these in M$-Excel and export as *.csv manually, "bring back to Unix" and then read into R. Of course there must be options to start programming this in something like visual basic, but we wouldn't to really want to... We also know that probably Gnumeric could do the job (since the xls
2011 Nov 20
9
scanning word document in ruby
I am new to the ROR and just got stucked into something.... ....that is i have to scan a word document through ruby... Please suggest me how to do...... -- You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk" group. To post to this group, send email to rubyonrails-talk-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To unsubscribe from this
2004 Mar 03
4
How to read Excel file and access the data item?
In R, How to read Excel file and access the data item? Thank you. --------------------------------- [[alternative HTML version deleted]]
2020 Sep 07
2
Indexer error after upgrade to 2.3.11.3
...sr/bin/pdftotext $path - 2>/dev/null& +? if [ -x /usr/lib/xpdf/pdftotext ]; then +? ? /usr/lib/xpdf/pdftotext $path - 2>/dev/null& +? else +? ? /usr/local/bin/pdftotext $path - 2>/dev/null& +? fi ? ?wait_timeout 2>/dev/null ?elif [ $fmt = "doc" ]; then -? (/usr/bin/catdoc $path; true) 2>/dev/null& +? (/usr/local/bin/catdoc $path; true) 2>/dev/null& ? ?wait_timeout 2>/dev/null ?elif [ $fmt = "ppt" ]; then -? (/usr/bin/catppt $path; true) 2>/dev/null& +? (/usr/local/bin/catppt $path; true) 2>/dev/null& ? ?wait_timeout 2>/dev...
2020 Aug 19
7
Indexer error after upgrade to 2.3.11.3
Hi, after the upgrade to Dovecot 2.3.11.3, from 2.3.10.1, I see frequently these errors from different users: Aug 18 11:02:35 Panic: indexer-worker(info at domain.com) session=<g71KISOttvS5LNVj:O3ahCyuZO18cYAAAEPCW+w>: file http-client-request.c: line 1232 (http_client_request_send_more): assertion failed: (req->payload_input != NULL) Aug 18 11:02:35 Error: indexer-worker(info at
2011 Mar 28
0
Draft Application for GSoC 11 - Text extraction libraries - please review
...w file-format support. 3:Add thumbnail generation feature. 4:Add a testing framework 5:minimize 'ignore' file list. B:What is new or different about your approach which hasn't been done or wasn't possible before? Currently we require external programs like xpdf, unzip , xls2csv, catdoc etc ? which the user would need to have installed on the users coputer to make use of the fileindexing for those corresponding formats.But it has a problem.It requires that a new process need to be started everytime we come across a fileformat and the external program is run which would then ex...