thr3ads.net - search: "pdftools"

Displaying 20 results from an estimated 25 matches for "pdftools".

2016 Dec 02

pdftools on Ubuntu

...es usages > UMR 9190 - University of Montpellier > Montpellier, France > > ???.???`?..><((((?>.???`?.???.???`?...?><((((?> > > 2016-12-02 12:21 GMT+01:00 Eduardo Morelli <emorelli1966 at gmail.com>: > >> Hi, >> >> I am trying to install pdftools package on R 3.3.1 (Ubuntu 16.04), but the >> following issue occurs: >> >> ------------------------- ANTICONF ERROR --------------------------- >> Configuration failed because poppler-cpp was not found. Try installing: >> * deb: libpoppler-cpp-dev (Debian, Ubuntu, et...

pdftools on Ubuntu

2016 Dec 02

pdftools on Ubuntu

Hi, I am trying to install pdftools package on R 3.3.1 (Ubuntu 16.04), but the following issue occurs: ------------------------- ANTICONF ERROR --------------------------- Configuration failed because poppler-cpp was not found. Try installing: * deb: libpoppler-cpp-dev (Debian, Ubuntu, etc) * rpm: poppler-cpp-devel (Fedora, CentOS...

pdftools on Ubuntu

2016 Dec 02

pdftools on Ubuntu

...umr-marbec.fr> - Biodiversit? Marine et ses usages UMR 9190 - University of Montpellier Montpellier, France ???.???`?..><((((?>.???`?.???.???`?...?><((((?> 2016-12-02 12:21 GMT+01:00 Eduardo Morelli <emorelli1966 at gmail.com>: > Hi, > > I am trying to install pdftools package on R 3.3.1 (Ubuntu 16.04), but the > following issue occurs: > > ------------------------- ANTICONF ERROR --------------------------- > Configuration failed because poppler-cpp was not found. Try installing: > * deb: libpoppler-cpp-dev (Debian, Ubuntu, etc) > * rpm: popp...

Consulta

2019 Sep 23

Consulta

Buenas tarde a todo en s: Tenia la versión de R 3.6 y utilizaba la paquetería de pdftools para extraer información de archivos en pdf actualice la versión 3.6.1 y ya no reconoce la paquetería alguien que me pueda ayudar. Prácticamente no reconoce las funciones de pdftools library(pdftools) library(stringr)? library(NLP)? library(tm)? library(tesseract)? library(magick)? install.package...

Consulta

2019 Sep 24

Consulta

Emilio Ahora cuando quiero instalar los paquetes pdftools, magick y otros más me salen el siguiente error WARNING: Rtools is required to build R packages but is not currently installed. Please download and install the appropriate version of Rtools before proceeding: https://cran.rstudio.com/bin/windows/Rtools/ Installing package into ?C:/Users/bdomingu...

textual analysis - transforming several pdf to txt - naming the files

2023 Jul 05

textual analysis - transforming several pdf to txt - naming the files

...running the following scrip, but the names of txt files that I obtain are: pdftext1, pdftext2, pdftext3... What do I need to change? Thank you very much, Cec?lia Carmo Universidade de Aveiro - Portugal dirpath <- ("/Users/ceciliacarmo/documents/RTextualAnalysis/data/pdfs") library(pdftools) library(dplyr) convertpdf2txt <- function(dirpath){ files <- list.files(dirpath, full.names = T) x <- sapply(files, function(x){ x <- pdftools::pdf_text(x) %>% paste0(collapse = " ") %>% stringr::str_squish() return(x) }) } # apply function...

help

2011 Apr 02

help

...ng something critical.can someone provide help? My intention is to create a pdf file of a plot in R and then attach existing files from my system as attachment into the newly created pdf file. Any help would be greatly appreciated.. Here is the command line script i want to execute from within R. pdftools -S "attachfiles=C:\test1.pdf" -i C:\test2.pdf -o C:\test4.pdf Regards Vijayan Padmanabhan [[alternative HTML version deleted]]

Minería de testo en R

2017 Oct 02

Minería de testo en R

Hola, Hay una adaptación específica a R de una solución comercial, pero que se puede usar hasta cierto número de llamdas: pdftools https://cloud.r-project.org/web/packages/pdftools/index.html Saludos, Carlos Ortega www.qualityexcellence.es El 2 de octubre de 2017, 9:22, Isidro Hidalgo Arellano <ihidalgo en jccm.es> escribió: > Yo he utilizado "tm" para tratar PDF de forma masiva, pero hay que tener >...

Problemas usando paquete textreuse

2024 Nov 25

Problemas usando paquete textreuse

...ivos para utilizar las funciones TextReuseCorpus() o TextReuseTextDocument(). En la documentación del paquete los archivos los cargan desde ¿Alguien sabe cómo se hace? He conseguido calcular la similitud de jaccard utilizando este paquete, pero para ello he empleado el siguiente código. library(pdftools) library(textreuse) text1 <- pdf_text("uno.pdf") text2 <- pdf_text("dos.pdf") full_text1 <- paste(text1, collapse = " ") full_text2 <- paste(text2, collapse = " ") a <- tokenize_words(full_text1) b <- tokenize_words(full_text2) jacc...

adding "Page X of XX" to PDFs

2023 Dec 02

adding "Page X of XX" to PDFs

You could easily omit the Page X of xX, but leave the timestamp Then add Page X of XX programmatically using pdftools or some similar pdf command line tools. On Sat, 2 Dec 2023, 22:35 , <avi.e.gross at gmail.com> wrote: > Having read all of the replies, it seems there are solutions for the > question and the OP points out that some solutions such as making the > document twice will affect the crea...

Minería de testo en R

2017 Oct 02

Minería de testo en R

...acias Carlos. > > Manuel > > El 2 de octubre de 2017, 1:42, Carlos Ortega <cof en qualityexcellence.es> > escribió: > >> Hola, >> >> Hay una adaptación específica a R de una solución comercial, pero que se >> puede usar hasta cierto número de llamdas: pdftools >> >> https://cloud.r-project.org/web/packages/pdftools/index.html >> >> Saludos, >> Carlos Ortega >> www.qualityexcellence.es >> >> El 2 de octubre de 2017, 9:22, Isidro Hidalgo Arellano <ihidalgo en jccm.es> >> escribió: >> >&gt...

adding "Page X of XX" to PDFs

2023 Dec 02

adding "Page X of XX" to PDFs

...But what about the idea of creating a PDF with a placeholder like "Page N of XXX" and after the file has been created, dates and all, perhaps edit it programmatically and replace all instances of XXX with something of the same length like " 23" as there seem to be tools like the pdftools package that let you get the number of pages. I have no idea if some program, perhaps external, can do that and retain the date you want. -----Original Message----- From: R-help <r-help-bounces at r-project.org> On Behalf Of Dennis Fisher Sent: Friday, December 1, 2023 3:53 PM To: r-help at...

textual analysis - transforming several pdf to txt - naming the files

2023 Jul 05

textual analysis - transforming several pdf to txt - naming the files

convertpdf2txt <- function(dirpath){ files <- list.files(dirpath, pattern = "Consoli.*\\.pdf$", full.names = TRUE) files <- chartr("\\", "/", files) x <- lapply(files, function(x){ pdftools::pdf_text(x) %>% paste0(collapse = " ") %>% stringr::str_squish() }) new_names <- tools::file_path_sans_ext(files) new_names <- paste(new_names, "txt", sep = ".") setNames(x, new_names) } # apply function # note that my test files...

Adding text to existing PDF's created with R

2020 Oct 07

Adding text to existing PDF's created with R

...ld be unchanged (except for a new filename). The intent is as follows: I have multiple PDFs that I eventually merge into a single PDF, separating each one with a separator page. The content of the separator pages comes from a Word document. The task is performed with textreadr, officer, and pdftools. I can insert page numbers into the separator pages (created as PDF documents). I join the separator pages and the original PDFs using python's join command. But I have not been able to figure out how to add page numbers to the existing PDF's. Any help would be appreciated. Dennis Denni...

de pdf a csv

2016 Sep 10

de pdf a csv

Estimados En ocasionas hay informaciones epidemiológicas en reportes pdf semanales como el que adjunto que quisiéramos llevar a csv o txt USANDO R para poder analizarlas estadísticamente. Apreciaríamos su ayuda si nos diesen un script, el paquete pdftable no me resultó. Saludos José -- Este mensaje le ha llegado mediante el servicio de correo electronico que ofrece Infomed para respaldar

Minería de testo en R

2017 Sep 29

Minería de testo en R

Estimados miembros del grupo, Estoy buscando paquetes de R que permitan hacer minería de textos de archivos PDF o Word que tengan una estructura tabular (cuadros) de resultado de talleres de trabajo donde se tratan diferentes ejes temáticos. Especifico esto porque he visto que algunos paquetes analizan directamente texto de libros, tweets u otras fuentes donde no hay una estructura como cuadros

Extracting a a chunk of text from a pdf file

2011 Sep 17

Extracting a a chunk of text from a pdf file

In an R script I need to extract some figures from many web pages in pdf format. As an example see http://www.terna.it/LinkClick.aspx?fileticket=TTQuOPUf%2fs0%3d&tabid=435&mid=3072 from which I would like to extract the "Totale: 1,025,823"). Is there any solution? Ciao Vittorio

Warning object has offset 0. PDF file

2024 Oct 13

Warning object has offset 0. PDF file

Good day to all Using > library(pdftools) Using poppler version 23.04.0 I get a number of warnings such as: > pdf_subset(infile, pages = 156:157, output = outfile) WARNING: /Users/jla/Library/CloudStorage/Dropbox/7cordas/Caio/record/90 NEW RODA SONG BOOK.pdf (object 7 0): object has offset 0 WARNING: /Users/jla/Library/CloudStorage/D...

Resumen de R-help-es, Vol 187, Envío 10

2024 Nov 26

Resumen de R-help-es, Vol 187, Envío 10

...eCorpus() o TextReuseTextDocument(). > > En la documentación del paquete los archivos los cargan desde > > ¿Alguien sabe cómo se hace? > > He conseguido calcular la similitud de jaccard utilizando este paquete, > pero para ello he empleado el siguiente código. > > library(pdftools) > > library(textreuse) > > text1 <- pdf_text("uno.pdf") > > text2 <- pdf_text("dos.pdf") > > full_text1 <- paste(text1, collapse = " ") > > full_text2 <- paste(text2, collapse = " ") > > a <- tokenize_word...

R 4.0.0 rebuild status

2020 Jun 25

R 4.0.0 rebuild status

On Thu, 25 Jun 2020 at 19:01, Jos? Ab?lio Matos <jamatos at fc.up.pt> wrote: > > On Wednesday, 24 June 2020 10.42.10 WEST I?aki Ucar wrote: > > Thanks, Jos? and Elliott. I can help with reviews. > > > > I attach here a list of batches of CRAN packages to be rebuilt in > > order (batches separated by a blank line), and the script that > > generates it. Hope

search for: pdftools