similar to: indexing mostly-binary documents (.ppt)

Displaying 20 results from an estimated 1000 matches similar to: "indexing mostly-binary documents (.ppt)"

2013 Dec 12
2
PPT text extracter
Hi, I've had a heads up from a user that catppt did not work at all on semi-recent PowerPoint files (ppt, not pptx). I checked, and indeed it misses most of the content on many files. After looking around, I found Python code from the libreoffice project which makes a nice ppt text extractor after adding a very thin command line wrapper:
2008 Oct 15
3
Extract text from Microsoft PowerPoint files
Hello CentOS people, I'm wondering if there are command tools like antiword and docx2txt for Microsoft PowerPoint files (.ppt and .pptx). The idea is to extract text from PowerPoint files. Sorry this isn't exactly about CentOS, but I'd really like it if Yum has something. I tried xlhtml, but it hasn't been updated in a while and isn't exactly wanting to work on CentOS
2011 Nov 20
9
scanning word document in ruby
I am new to the ROR and just got stucked into something.... ....that is i have to scan a word document through ruby... Please suggest me how to do...... -- You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk" group. To post to this group, send email to rubyonrails-talk-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To unsubscribe from this
2001 Aug 17
1
Excel -> *.CSV in Unix (Linux) command line?
A colleague has a dozen of excel sheets and also expects to get updates regularly. He could open these in M$-Excel and export as *.csv manually, "bring back to Unix" and then read into R. Of course there must be options to start programming this in something like visual basic, but we wouldn't to really want to... We also know that probably Gnumeric could do the job (since the xls
2004 Mar 03
4
How to read Excel file and access the data item?
In R, How to read Excel file and access the data item? Thank you. --------------------------------- [[alternative HTML version deleted]]
2020 Aug 19
7
Indexer error after upgrade to 2.3.11.3
Hi, after the upgrade to Dovecot 2.3.11.3, from 2.3.10.1, I see frequently these errors from different users: Aug 18 11:02:35 Panic: indexer-worker(info at domain.com) session=<g71KISOttvS5LNVj:O3ahCyuZO18cYAAAEPCW+w>: file http-client-request.c: line 1232 (http_client_request_send_more): assertion failed: (req->payload_input != NULL) Aug 18 11:02:35 Error: indexer-worker(info at
2012 Mar 31
2
Is it there any std pattern in R which show same representation style of ppt found on net
i m searching on net,material related to R, whenever i get ppt presentation of some topic ,i found it in speical format white backgroud & blue bars,having links for further topics see link 1) https://www.rmetrics.org/files/Meielisalp2007/Presentations/Pfaff.pdf 2)http://grapevine.com.au/~yanchang/docs/Time-Series-Mining-slides.pdf If this is format,how can i made it,if i want to give my R
2009 Aug 28
2
OT: .doc,.xls,.pdf,.ppt (etc.) string parser/indexers
Does anyone have experience with linux tools to parse the text from common non-text file formats for searching? I'm trying to use the kinosearch add-on for twiki which is fine as far as the search goes, but it takes forever to generate the index. It uses xpdf to extract strings from pdf's, antiword for .doc, and since it is perl, the Spreadsheet::ParseExcel module for .xls. Some
2009 Jan 16
1
PDF slided (beamer or prosper) to an editable PPT
Hello, I am getting requests to place our PDF slides (output from beamer) into Microsoft Powerpoint formats (.ppt). What's the best practice or any recommended software packages (any success with open or commercial) that we can use to convert PDF slides into an EDITABLE powerpoint deck? Thanks in advance.
2010 Aug 21
7
Error when openning MS Office files (*.doc, *.xls, *.ppt)
Error when openning MS Office files (*.doc, *.xls, *.ppt) (Wine 1.0.1 and Wine 1.1.42, Ubuntu 10.04) -- Hello, I get this error when I try to open MSOffice 2003 files (like *.doc or *.xls or *.ppt): "The file '/home/blabla/Doc1.doc' is not marked as executable. If this was downloaded or copied form an untrusted source, it may be dangerous to run. For more details, read about
2007 Nov 26
8
search not working after upgrade
Izit Izit wrote: > Correction on my previous post. > > The correct way to do it is: > > Product.find_by_contents("*",{},:conditions =>search_conditions,:include > => [:supplier],:order =>"products.id" ) > > Leave out the :limit=>:all that is put in by default. Exactly - I tried to make aaf a bit more clever by letting it assume :limit
2009 Apr 29
1
antiword
Hi guys, I've been noticing more and more that antiword has trouble with many word documents. It may look like it's converted a document but leaves out headings and bits of text. I've been looking into getting openoffice to do it in headless mode but still have a way to go before it's stable. I was wondering if anyone else had any luck on this front? One quick fix I have found
2009 Apr 06
2
omindex => Unknown extension
Hi all, I'm having a recurrent problem with Omega's indexing. When I run omindex, it sometimes misses to recognize the extension of some files (.doc, .pdf) and skips them. In the same run, omindex is otherwise perfectly able to index other files with same extensions. The reason is not clear but it should occur before it selects a content converter since for example, if I manually run
2009 Aug 17
2
reading in MS Word files
I am familiar with packages that read and write Excel files on both Windows and Linux platforms. Do any packages provide similar functionality for MS Word files? I have a lot of text processing to do and the text is embedded in ~200 different Word files (.doc format Office 2003). All I need to do is read, not write. Thanks, Mark ------------------------------------------------------------ Mark
2020 Sep 07
2
Indexer error after upgrade to 2.3.11.3
Not sure if I mentioned it but I'm on FreeBSD too. I wonder if any of the patches FreeBSD applies automatically is causing this. I looked through them but couldn't find anything obvious that might cause this --- configure.orig? ? ? 2020-08-12 12:20:51 UTC +++ configure @@ -28901,13 +28901,13 @@ fi ? ?if test $want_stemmer != no; then -? ? { $as_echo "$as_me:${as_lineno-$LINENO}:
2007 Apr 09
5
IndexReader#terms for all fields?
Is it possible to query the index for a TermEnum for all fields in the index instead of just ? Thanks, John
2006 Oct 12
3
How to get a decent stack trace?
Hi, I just gave Mongrel a try on my Rails app. After running ''mongrel_rails start'' everything seems fine. Yet, when I start a request I get the following error message on the console: ERROR: You have a nil object when you didn''t expect it! The error occured while evaluating nil.readpartial That''s it. The application works fine with Webrick and
2007 Nov 14
4
scripts installed
On a debian production server and my os x development mcahine, there is a script at /usr/local/bin/mongrel_rails On an ubuntu 7.1 machine, this script appears at /var/lib/gems/1.8/bin/mongrel_rails mongrel was installed using gem in all cases. What is the cause of the discrepancy? Thanks, John -- John Joseph Bachir http://lyceum.ibiblio.org http://blog.johnjosephbachir.org
1999 Sep 29
0
Problems saving large ppt-files to samba server
Hello, We have problems with saving large ppt-files (>20MB) from a NTClient Sp5 to the Samba server (2.0.4b on Solaris 7). During save procedure the client hangs, some time later the client displays an error messages like "Error during saving into buffer" and the application is killed. In the log file of the client on the samba server you see the error message: [1999/09/29 12:25:50,
2007 Oct 25
0
Open ppt in Browser
Hi In my application i want to open ppt file in web browser[IE,FF].Any one please help me with this... -- Posted via http://www.ruby-forum.com/. --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Ruby on Rails: Talk" group. To post to this group, send email to