similar to: textual analysis - transforming several pdf to txt - naming the files

Displaying 20 results from an estimated 300 matches similar to: "textual analysis - transforming several pdf to txt - naming the files"

2023 Jul 05
1
textual analysis - transforming several pdf to txt - naming the files
I am taking my first steps in textual analysis with R. I have pdf files consisting of company reports for several years (1 file corresponds to 1 company and 1 year). My idea is to start by transforming all my pdf files into txt files for further treatment and analysis (this will allow me to group the files by company or by year, depending on the future analysis to be performed). I do not have
2008 Mar 06
1
dictionary lookup
Hi, I have a character-valued vector (old_names) and want to translate its entries whenever possible, using a dictionary (dict=data.frame). The translation direction is dict$V3 --> dict$V2, but some values may be undefined (NA). I suppose this is a very basic task, but I tried in vain to make it more efficient than below. In particular I would like to avoid the explicit (and slow) loop Any
2024 Apr 11
1
[External] Re: Repeated library() of one package with different include.only= entries
> I would assume that > library(Matrix, include.only="isDiagonal") > implies that only `isDiagonal` ends up on the search path This could also be a reasonable behavior, but neither does that happen today. > I think a far better approach to solve Michael's problem is simply to use > fac2sparse <- Matrix::fac2sparse This does not fully simulate attachment, e.g.
2024 Apr 15
1
[External] Re: Repeated library() of one package with different include.only= entries
I think we should try to advance and hopefully finalize this thread before we forget about it .. >>>>> Michael Chirico n Thu, 11 Apr 2024 09:10:11 -0700 writes: >> I would assume that >> library(Matrix, include.only="isDiagonal") >> implies that only `isDiagonal` ends up on the search path > This could also be a reasonable
2012 May 15
2
Renaming names in R matrix
I have the following matrix: > dat [,1] [,2] [,3] [,4] foo 0.7574657 0.2104075 0.02922241 0.002705617 foo 0.0000000 0.0000000 0.00000000 0.000000000 foo 0.0000000 0.0000000 0.00000000 0.000000000 foo 0.0000000 0.0000000 0.00000000 0.000000000 foo 0.0000000 0.0000000 0.00000000 0.000000000 foo 0.0000000 0.0000000 0.00000000 0.000000000 and given this:
2023 Jul 18
1
change language at console
On Wed, 1 Apr 2015, Prof Brian Ripley wrote: &gt; I would start by trying LANGUAGE=en , e.g. More specifically, you can use en_US or en_GB. Rich [...] ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
2019 Feb 14
1
Proposed function file.backup
Dear R Core: In the kutils package, I wrote a function that is so handy that I would like to ask you put it in R itself. file.backup() will look at a file, find its last modification time, and create a new backup with a name that appends YYYYMMDD-HHMM to the file name. So now, whenever I worry that running write.csv or saving a graph might destroy something valuable, I use an idiom like fn
2010 Apr 20
1
May need ruby developer
Hi, I''m looking for a developer to help build an SMS messaging site. Basically, users will have the ability to sign up for subscriptions to receive daily SMS text messages of a particular topic. I know there are a lot of sites out there doing this already from horoscopes to diet tips, so there is likely someone on this board with experience and it doesn''t take much work to get
2024 Nov 25
1
Problemas usando paquete textreuse
Muy buenas: He encontrado el paquete textreuse y lo quiero utilizar para comparar dos archivos pdf. Me ha sido imposible cargar los archivos para utilizar las funciones TextReuseCorpus() o TextReuseTextDocument(). En la documentación del paquete los archivos los cargan desde ¿Alguien sabe cómo se hace? He conseguido calcular la similitud de jaccard utilizando este paquete, pero para ello he
2003 Feb 20
0
RE: pam_unix.so likeauth? (Was: Help with Winbind)
It makes it work! I was reading through the other messages on the list with similar subjects, and found this one: -----Original Message----- From: Matthias Rutzki [mailto:mrutzki@gmx.de] Sent: Thursday, February 20, 2003 4:51 AM To: david.boynton2@asu.edu; bgmilne@cae.co.za; cdevidal@yahoo.com Cc: samba@lists.samba.org Subject: [Samba] domain users in local groups with Winbind/Samba/Redhat It
2012 Jun 15
0
MCMCfactanal inquiries
Dear list, Hi, I am emailing to see if it would be possible to get some help on running MCMCfactanal on R. I am trying to derive a single score measuring a likelihood to recieve IMF loans from 8 different variables related to IMF representation but for reason, my codes do not seem to take me to where I want to be. For some reason my code drops certain rows ( countries) automatically, making it
2008 Oct 24
1
Emerging dilema? DID forwarding meets SMS
We have a number of DID's that do the standard VoIP tricks: ringing multiple locations, findme-followme etc. What is happening more and more is that customers call those DID numbers, and draw the reasonable conclusion that they are calling mobile numbers because they literally can HEAR that the called party is on a mobile. Consequently many of those customers draw the conclusion that they
2023 Mar 20
1
preguntas múltiples y creación de variables
Hola, cómo andan! Tengo el siguiente problema. Tengo una pregunta cuya respuesta es múltiple, pero en en la base están todas las respuestas en una misma variable y cada respuesta está separada por ",". Así está [image: image.png] Hago esto para poder separar en columnas las distintas respuestas: *mult_bas_dat<-mult_bas_dat %>% separate_rows(bases, sep = ",")
2023 Mar 21
1
preguntas múltiples y creación de variables
Buenas, las imágenes no se ven por lo que no puedo reproducir tu ejemplo, pero tal vez lo siguiente te valga: > library('stringr') > d <- data.frame(respuestas = c('manzana, naranja', 'manzana, melocotón', 'naranja, melocotón')) > d respuestas 1 manzana, naranja 2 manzana, melocotón 3 naranja, melocotón > d$manzana <-
2015 Oct 20
1
[PATCH v3 01/13] v2v: factor out opening input VM
Opening the source VM and amending the properties in its internal representation in accordance with command-line options fit nicely into two isolated functions. Signed-off-by: Roman Kagan <rkagan@virtuozzo.com> --- v2v/v2v.ml | 124 ++++++++++++++++++++++++++++++++----------------------------- 1 file changed, 65 insertions(+), 59 deletions(-) diff --git a/v2v/v2v.ml b/v2v/v2v.ml index
2011 Nov 17
2
[LLVMdev] Fwd: Problem getting LoopInfo inside non-LoopPass
Nick, Thanks for this info, though this didn't help my problem at all. On Wed, Nov 16, 2011 at 7:21 PM, Nick Lewycky <nicholas at mxc.ca> wrote: > Never create a Twine as a local variable. > > V->setName(Twine("new_name")); > > should work fine, however. Note that Twine itself has an implicit > constructor from const char *, so this code: > >
2008 Mar 26
0
swat is now broken
seems to mess up the pam for swat. [root at host67 security]# rpm -qf /etc/pam.d/samba samba-3.0.25b-1.el4_6.4 [root at host67 security]# cat /etc/pam.d/samba #auth required /lib/security/pam_stack.so service=system-auth #account required /lib/security/pam_stack.so service=system-auth auth required pam_stack.so service=system-auth account required
2011 Nov 17
0
[LLVMdev] Fwd: Problem getting LoopInfo inside non-LoopPass
Never create a Twine as a local variable. V->setName(Twine("new_name")); should work fine, however. Note that Twine itself has an implicit constructor from const char *, so this code: V->setName("new_name"); should also work fine. Nick Ryan Taylor wrote: > Basically I have two separate passes (first is a loop pass) which are > two different files and
2011 Nov 17
0
[LLVMdev] Fwd: Problem getting LoopInfo inside non-LoopPass
So is this simply not possible? On Thu, Nov 17, 2011 at 10:31 AM, Ryan Taylor <ryta1203 at gmail.com> wrote: > Nick, > > Thanks for this info, though this didn't help my problem at all. > > > On Wed, Nov 16, 2011 at 7:21 PM, Nick Lewycky <nicholas at mxc.ca> wrote: > >> Never create a Twine as a local variable. >> >>
2020 Aug 25
2
Re: [RFC nbdkit PATCH] protocol: Alter .list_exports, add .default_export
On Tue, Aug 25, 2020 at 06:16:17AM -0500, Eric Blake wrote: > On 8/25/20 5:00 AM, Richard W.M. Jones wrote: > > >>>+=head2 C<.default_export> > >>>+ > >>>+ const char *default_export (int readonly, int is_tls); > >> > >>Oh fun. For some plugins (like ondemand), this is trivial: return a > >>compile-time constant string.