search for: pdftools

Displaying 20 results from an estimated 25 matches for "pdftools".

2016 Dec 02
1
pdftools on Ubuntu
...es usages > UMR 9190 - University of Montpellier > Montpellier, France > > ???.???`?..><((((?>.???`?.???.???`?...?><((((?> > > 2016-12-02 12:21 GMT+01:00 Eduardo Morelli <emorelli1966 at gmail.com>: > >> Hi, >> >> I am trying to install pdftools package on R 3.3.1 (Ubuntu 16.04), but the >> following issue occurs: >> >> ------------------------- ANTICONF ERROR --------------------------- >> Configuration failed because poppler-cpp was not found. Try installing: >> * deb: libpoppler-cpp-dev (Debian, Ubuntu, et...
2016 Dec 02
2
pdftools on Ubuntu
Hi, I am trying to install pdftools package on R 3.3.1 (Ubuntu 16.04), but the following issue occurs: ------------------------- ANTICONF ERROR --------------------------- Configuration failed because poppler-cpp was not found. Try installing: * deb: libpoppler-cpp-dev (Debian, Ubuntu, etc) * rpm: poppler-cpp-devel (Fedora, CentOS...
2016 Dec 02
0
pdftools on Ubuntu
...umr-marbec.fr> - Biodiversit? Marine et ses usages UMR 9190 - University of Montpellier Montpellier, France ???.???`?..><((((?>.???`?.???.???`?...?><((((?> 2016-12-02 12:21 GMT+01:00 Eduardo Morelli <emorelli1966 at gmail.com>: > Hi, > > I am trying to install pdftools package on R 3.3.1 (Ubuntu 16.04), but the > following issue occurs: > > ------------------------- ANTICONF ERROR --------------------------- > Configuration failed because poppler-cpp was not found. Try installing: > * deb: libpoppler-cpp-dev (Debian, Ubuntu, etc) > * rpm: popp...
2019 Sep 23
5
Consulta
Buenas tarde a todo en s: Tenia la versión de R 3.6 y utilizaba la paquetería de pdftools para extraer información de archivos en pdf actualice la versión 3.6.1 y ya no reconoce la paquetería alguien que me pueda ayudar. Prácticamente no reconoce las funciones de pdftools library(pdftools) library(stringr)? library(NLP)? library(tm)? library(tesseract)? library(magick)? install.package...
2019 Sep 24
2
Consulta
Emilio Ahora cuando quiero instalar los paquetes pdftools, magick y otros más me salen el siguiente error WARNING: Rtools is required to build R packages but is not currently installed. Please download and install the appropriate version of Rtools before proceeding: https://cran.rstudio.com/bin/windows/Rtools/ Installing package into ?C:/Users/bdomingu...
2023 Jul 05
1
textual analysis - transforming several pdf to txt - naming the files
...running the following scrip, but the names of txt files that I obtain are: pdftext1, pdftext2, pdftext3... What do I need to change? Thank you very much, Cec?lia Carmo Universidade de Aveiro - Portugal dirpath <- ("/Users/ceciliacarmo/documents/RTextualAnalysis/data/pdfs") library(pdftools) library(dplyr) convertpdf2txt <- function(dirpath){ files <- list.files(dirpath, full.names = T) x <- sapply(files, function(x){ x <- pdftools::pdf_text(x) %>% paste0(collapse = " ") %>% stringr::str_squish() return(x) }) } # apply function...
2011 Apr 02
4
help
...ng something critical.can someone provide help? My intention is to create a pdf file of a plot in R and then attach existing files from my system as attachment into the newly created pdf file. Any help would be greatly appreciated.. Here is the command line script i want to execute from within R. pdftools -S "attachfiles=C:\test1.pdf" -i C:\test2.pdf -o C:\test4.pdf Regards Vijayan Padmanabhan [[alternative HTML version deleted]]
2017 Oct 02
2
Minería de testo en R
Hola, Hay una adaptación específica a R de una solución comercial, pero que se puede usar hasta cierto número de llamdas: pdftools https://cloud.r-project.org/web/packages/pdftools/index.html Saludos, Carlos Ortega www.qualityexcellence.es El 2 de octubre de 2017, 9:22, Isidro Hidalgo Arellano <ihidalgo en jccm.es> escribió: > Yo he utilizado "tm" para tratar PDF de forma masiva, pero hay que tener >...
2024 Nov 25
1
Problemas usando paquete textreuse
...ivos para utilizar las funciones TextReuseCorpus() o TextReuseTextDocument(). En la documentación del paquete los archivos los cargan desde ¿Alguien sabe cómo se hace? He conseguido calcular la similitud de jaccard utilizando este paquete, pero para ello he empleado el siguiente código. library(pdftools) library(textreuse) text1 <- pdf_text("uno.pdf") text2 <- pdf_text("dos.pdf") full_text1 <- paste(text1, collapse = " ") full_text2 <- paste(text2, collapse = " ") a <- tokenize_words(full_text1) b <- tokenize_words(full_text2) jacc...
2023 Dec 02
1
adding "Page X of XX" to PDFs
You could easily omit the Page X of xX, but leave the timestamp Then add Page X of XX programmatically using pdftools or some similar pdf command line tools. On Sat, 2 Dec 2023, 22:35 , <avi.e.gross at gmail.com> wrote: > Having read all of the replies, it seems there are solutions for the > question and the OP points out that some solutions such as making the > document twice will affect the crea...
2017 Oct 02
2
Minería de testo en R
...acias Carlos. > > Manuel > > El 2 de octubre de 2017, 1:42, Carlos Ortega <cof en qualityexcellence.es> > escribió: > >> Hola, >> >> Hay una adaptación específica a R de una solución comercial, pero que se >> puede usar hasta cierto número de llamdas: pdftools >> >> https://cloud.r-project.org/web/packages/pdftools/index.html >> >> Saludos, >> Carlos Ortega >> www.qualityexcellence.es >> >> El 2 de octubre de 2017, 9:22, Isidro Hidalgo Arellano <ihidalgo en jccm.es> >> escribió: >> >&gt...
2023 Dec 02
1
adding "Page X of XX" to PDFs
...But what about the idea of creating a PDF with a placeholder like "Page N of XXX" and after the file has been created, dates and all, perhaps edit it programmatically and replace all instances of XXX with something of the same length like " 23" as there seem to be tools like the pdftools package that let you get the number of pages. I have no idea if some program, perhaps external, can do that and retain the date you want. -----Original Message----- From: R-help <r-help-bounces at r-project.org> On Behalf Of Dennis Fisher Sent: Friday, December 1, 2023 3:53 PM To: r-help at...
2023 Jul 05
1
textual analysis - transforming several pdf to txt - naming the files
convertpdf2txt <- function(dirpath){ files <- list.files(dirpath, pattern = "Consoli.*\\.pdf$", full.names = TRUE) files <- chartr("\\", "/", files) x <- lapply(files, function(x){ pdftools::pdf_text(x) %>% paste0(collapse = " ") %>% stringr::str_squish() }) new_names <- tools::file_path_sans_ext(files) new_names <- paste(new_names, "txt", sep = ".") setNames(x, new_names) } # apply function # note that my test files...
2020 Oct 07
1
Adding text to existing PDF's created with R
...ld be unchanged (except for a new filename). The intent is as follows: I have multiple PDFs that I eventually merge into a single PDF, separating each one with a separator page. The content of the separator pages comes from a Word document. The task is performed with textreadr, officer, and pdftools. I can insert page numbers into the separator pages (created as PDF documents). I join the separator pages and the original PDFs using python's join command. But I have not been able to figure out how to add page numbers to the existing PDF's. Any help would be appreciated. Dennis Denni...
2016 Sep 10
6
de pdf a csv
Estimados En ocasionas hay informaciones epidemiológicas en reportes pdf semanales como el que adjunto que quisiéramos llevar a csv o txt USANDO R para poder analizarlas estadísticamente. Apreciaríamos su ayuda si nos diesen un script, el paquete pdftable no me resultó. Saludos José -- Este mensaje le ha llegado mediante el servicio de correo electronico que ofrece Infomed para respaldar
2017 Sep 29
3
Minería de testo en R
Estimados miembros del grupo, Estoy buscando paquetes de R que permitan hacer minería de textos de archivos PDF o Word que tengan una estructura tabular (cuadros) de resultado de talleres de trabajo donde se tratan diferentes ejes temáticos. Especifico esto porque he visto que algunos paquetes analizan directamente texto de libros, tweets u otras fuentes donde no hay una estructura como cuadros
2011 Sep 17
1
Extracting a a chunk of text from a pdf file
In an R script I need to extract some figures from many web pages in pdf format. As an example see http://www.terna.it/LinkClick.aspx?fileticket=TTQuOPUf%2fs0%3d&amp;tabid=435&amp;mid=3072 from which I would like to extract the "Totale: 1,025,823"). Is there any solution? Ciao Vittorio
2024 Oct 13
1
Warning object has offset 0. PDF file
Good day to all Using > library(pdftools) Using poppler version 23.04.0 I get a number of warnings such as: > pdf_subset(infile, pages = 156:157, output = outfile) WARNING: /Users/jla/Library/CloudStorage/Dropbox/7cordas/Caio/record/90 NEW RODA SONG BOOK.pdf (object 7 0): object has offset 0 WARNING: /Users/jla/Library/CloudStorage/D...
2024 Nov 26
0
Resumen de R-help-es, Vol 187, Envío 10
...eCorpus() o TextReuseTextDocument(). > > En la documentación del paquete los archivos los cargan desde > > ¿Alguien sabe cómo se hace? > > He conseguido calcular la similitud de jaccard utilizando este paquete, > pero para ello he empleado el siguiente código. > > library(pdftools) > > library(textreuse) > > text1 <- pdf_text("uno.pdf") > > text2 <- pdf_text("dos.pdf") > > full_text1 <- paste(text1, collapse = " ") > > full_text2 <- paste(text2, collapse = " ") > > a <- tokenize_word...
2020 Jun 25
2
R 4.0.0 rebuild status
On Thu, 25 Jun 2020 at 19:01, Jos? Ab?lio Matos <jamatos at fc.up.pt> wrote: > > On Wednesday, 24 June 2020 10.42.10 WEST I?aki Ucar wrote: > > Thanks, Jos? and Elliott. I can help with reviews. > > > > I attach here a list of batches of CRAN packages to be rebuilt in > > order (batches separated by a blank line), and the script that > > generates it. Hope