Displaying 20 results from an estimated 8000 matches similar to: "PDF text strangeness (PR#7043)"
2023 Jul 05
1
textual analysis - transforming several pdf to txt - naming the files
convertpdf2txt <- function(dirpath){
files <- list.files(dirpath, pattern = "Consoli.*\\.pdf$", full.names
= TRUE)
files <- chartr("\\", "/", files)
x <- lapply(files, function(x){
pdftools::pdf_text(x) %>%
paste0(collapse = " ") %>%
stringr::str_squish()
})
new_names <-
2023 Jul 05
1
textual analysis - transforming several pdf to txt - naming the files
I am taking my first steps in textual analysis with R.
I have pdf files consisting of company reports for several years (1 file corresponds to 1 company and 1 year).
My idea is to start by transforming all my pdf files into txt files for further treatment and analysis (this will allow me to group the files by company or by year, depending on the future analysis to be performed).
I do not have
2004 Oct 07
1
Mac: importing saved PDF figures into Illustrator CS
Hello,
This is a usage question for others with experience of R under the Aqua Mac OS X interface.
Basically, I don't succeed in importing PDF files (created with the 'Save As' menu to Quartz device windows) into Illustrator. Versions up to (and including?) 10 loose paths (lines/polygons) and or fill them in black. Version CS (11) on the Mac imports almost correctly, but, "to
2024 Nov 25
1
Problemas usando paquete textreuse
Muy buenas:
He encontrado el paquete textreuse y lo quiero utilizar para comparar dos
archivos pdf.
Me ha sido imposible cargar los archivos para utilizar las funciones
TextReuseCorpus() o TextReuseTextDocument().
En la documentación del paquete los archivos los cargan desde
¿Alguien sabe cómo se hace?
He conseguido calcular la similitud de jaccard utilizando este paquete,
pero para ello he
2004 May 10
5
Gmane for list?
Hi all,
I was wondering if I might subscribe this list to gmane.org, so I (and
everyone else) might view and search in with a newsreader?
Also--is there any way to get it indexed by google, maybe along with the
r-help list (or is it already but I haven't found it)?
Thanks for your consideration
W
2004 Jun 03
3
printing tabular data nicely
Hi R-heplers,
I would like to print various matrices, dataframes, tables, etc to
files, preferably nicely formatted postscript for import into papers.
Is there a way to do this?
I know ?cat, ?writeLines, ?format, ?paste. But I am not sure of a good
combination of these in order to get a nice looking table of information.
Any ideas? I guess I want (almost) publication ready output, just
2002 May 07
3
good procedure for creating plots for PowerPoint
I am generating some graphs (on GNU/Linux using R 1.5.x) for a
PowerPoint presentation my boss is giving next week. He just tried
cut/pasted a plot off of a PDF file I had created, but he complained
that the fonts were too small and fuzzy, and that he couldn't change
the background. I have been playing with png(bg=transparent), but I
am not sure what appropriate height,width parameters and
2013 Apr 08
1
problems with text in plots when using illustrator
Hi folks,
I have some problems with plots (any) saved from R (saved from the menu). It
seems that text (either plot titles or axes) is sometimes not "concatenated"
in a full "vector" (Illustrator-speaking). I mean, sometimes, a given title
is broken in several different chunks even though in R, it's in one "block".
Moreover, axis values (especially those < 10)
2004 May 19
4
GUI data browsers
Hi R-helpers,
Does anyone know of a good Tk (or other) script that allows somewhat
intuitive viewing of all of the variables (especially complex lists) in
a session? I would also like to delete, save to file, and rename them
graphically. I would also like a lot of supplemental information like
class, atrributes, names, modes, whatever else...
Context: I am running simulations
2012 Jun 03
3
Modify pdf file with Illustrator.
Dear list,
Some times I want to modify the pdf file which produced by R with
illustrator. But when I use Illustrator open the pdf file, it often
makes the pdf some little changed. Anyone have some suggestions? Is it
better to use other file type, not pdf? Or when I produced the pdf, I
should set some other parameters?
Thank you
2017 Oct 10
2
[RFC] Add SeaHorn and Crab-llvm to Users.html
Hi folks,
We would like to add SeaHorn <http://seahorn.github.io/> and Crab-llvm
<https://github.com/seahorn/crab-llvm> to the list of LLVM's users on
http://llvm.org/Users.html under Open Source Projects. The descriptions
would be:
1. SeaHorn | An Algorithmic Logic-Based Reasoning Framework.
2. Crab-llvm | A static analyzer based on abstract interpretation for LLVM.
I attach a
2020 Oct 07
1
Adding text to existing PDF's created with R
R 4.0.2
OS X
Colleagues
Does R have the capability of adding text (e.g., page numbers) to an existing PDF (previously created with R) -- other than adding this text, the PDF should be unchanged (except for a new filename).
The intent is as follows:
I have multiple PDFs that I eventually merge into a single PDF, separating each one with a separator page.
The content of the separator pages
2008 Jun 11
3
Finding Coordinate of Max/Min Value in a Data Frame
Hi,
Suppose I have the following data frame.
__BEGIN__
> library(MASS)
> data(crabs)
> crab.pca <- prcomp(crabs[,4:8],retx=TRUE)
> crab.pca$rotation
PC1 PC2 PC3 PC4 PC5
FL 0.2889810 0.3232500 -0.5071698 0.7342907 0.1248816
RW 0.1972824 0.8647159 0.4141356 -0.1483092 -0.1408623
CL 0.5993986 -0.1982263 -0.1753299 -0.1435941 -0.7416656
CW
2007 Dec 06
2
End of whiskers of boxplots are repeated on PDF device (PR#10499)
Full_Name: Michael Toews
Version: 2.61
OS: WinXP SP2
Submission from: (NULL) (142.58.206.114)
Using boxplot on a PDF device with more than one group (or boxes) produces
multiple (and overlain) 1st and 3rd quartile ticks. There are exactly the
multiple of boxplot groups as there are of each 1st and 3rd quartile ticks for
each boxplot (drawn as a horizontal line at the end of each boxplot), which
2004 May 05
5
cgi/servlets/httpd in R
Hi R-helpers
Has anyone had any experience doing CGI or Servlets or using an httpd
server in R?
Context: we are interested in running dynamic demographic calculations
on the internet (life tables, and such) in response to form submissions.
I might do simple cgi to get started, but loading in data each time
seems like it would be quite, quite slow.
I have looked at Rserv, but I am most
2005 Nov 25
0
Questions before starting an interesting Xen project
Hello,
Me and four other students at Royal Institute of Technology (KTH) in
Sweden are in the startup phase for an interesting project utilizing
Xen. We have been asked to modernise and improve the performance of a
router simulation package called Crab[1]. Currently Crab runs a few
instances of UML and inside each guest OS Quagga[2] is run. Quagga is a
routing daemon with interface similar
2001 Dec 20
0
nmbd forkbombed my machine :-\
Hello everyone,
Assuming this is the right place to post potential bug reports, try this :
(Apologies if it's been posted/covered somewhere else, but your lists
don't seem to have a search facility...)
nmbd had a good go at fork bombing my "server" just now, with a setup as
follows :
Samba 2.2.0a on Slackware Linux v8.0, linux kernel 2.4.16. i586 with
500ish mb of ram.
2004 May 28
5
vector normal to a plane
Hi All,
(I have a degree in math, but I am too embarassed to ask my colleagues,
so here goes:)
I would like to get a vector normal (orthogonal) to a plane formed by
two other vectors. In matlab I do this:
v1 = [.4, .6, .8]; v2 = [.9, .7, .2]; nn = cross(v1,v2) (gives ~[-.48,
.65, -.24]
if I do R> cross(v1, v2), I get .94. Huh?
Thanks for all your help, again.
W
2011 Mar 23
0
removing data randomly based on previous observations
Hello,
I have a dataset which contains multiple trap pulls over the course of a
fishing season. Each trap was baited and then returned to the water to be
pulled again. I am trying to remove that bait from the catch observed on
the next pull (the bait is the same as the catch, lobster). The traps have
been baited differently throughout the season.
I would like to randomly remove two lobsters
2006 Aug 31
2
R-Project logo in SVG format
Hi,
I'm looking for a version of the R-Project logo in SVG format. I've found
the bitmapped versions,
http://developer.r-project.org/Logo/
but would prefer a scalable version as it usually looks better when
printed.
Where may I find one?
-Lars
Lars Nooden (lars at umich.edu)
On the Internet, nobody knows you're a dog ...
... until you start barking.