search for: pdfinfo

Displaying 6 results from an estimated 6 matches for "pdfinfo".

2008 Nov 13
1
readPDF() -- unsure how to install xpdf to make this work?
...letter (http://cran.r-project.org/doc/Rnews/ Rnews_2008-2.pdf), the package 'tm' for text mining is mentioned. In that lovely package, there is a function called 'readPDF()'. In order to use this, ?readPDF says "Note that this PDF reader needs both the tools pdftotext and pdfinfo installed and accessable on your system." These tools are available from http://www.foolabs.com/xpdf/download.html I am able to download this and use it easily from a dos window to convert a pdf file into a txt file. Question: how do i make these tools available to R, so that i can use the...
2018 Mar 02
5
evince
We have some small networks with connectivity to the Internet through firewall routers.? The smallest has one Windows 7 system and three Linux systems including both CentOS 6 and CentOS 7 machines.? The Windows 7 systems have full Adobe packages that are updated regularly and are trouble free. On the Linux systems, evince has been our go to product for viewing and printing .pdf documents.? This
2019 Mar 21
2
[GSoC] Questions about project Text-Extraction Libraries
Hello! I have a few question related to the project Text-Extraction Libraries. Firstly, I think that trying to isolate library bugs in subprocesses could get to work, but I am not sure about how to handle deadlocks or infinite loops. I feel that using a timer is the only way to deal with it but I would like to know what you think about it. Secondly, I have been reading the source code of
2006 Aug 11
3
Proposed changes to omindex
...ormation that may not be contained in the actual document. Future Items ============ 6) Stream indexer. Instead of reading the entire file into memory, process it line by line. This should make indexing large files more efficient. 7) Clean up the fixme?s in mime type handlers i.e. // FIXME: run pdfinfo once and parse the output ourselves. I woudl use pcre to extract the desired text. 8) Change the way stemmed terms are added to the database. Remove the R prefix from raw terms and only write stemmed terms to the DB if they differ from the original term, prefixing them with Z?. If stemming was...
2019 Mar 23
2
[GSoC] Questions about project Text-Extraction Libraries
...formats under the same > > interface. When indexing files, are all file formats treated in a similar > > way, or are there special formats that require a different work (beyond > the > > use of external filters)? > > A few do - e.g. for PDF files we currently need to run pdfinfo and > pdftotext on the file, PostScript files are first converted to a > temporary PDF (because there doesn't seem to be a Unicode-aware > filter which converts PostScript to text), etc. > > It may be possible to come up with a common interface still though. > > > To sum...
2006 Jun 24
8
How to install programs in wine?
I am a rank newbie to Linux and wine. I am running Ubuntu Dapper on an AMD 1800 mhz machine, wine 0.9.15 Everything I have read says use the installer to load windows programs. Where is the installer? Thanks, -- Ron Thompson On the Beautiful Florida Space Coast, right beside the Kennedy Space Center, USA http://www.plansandprojects.com My hobby pages are here: