similar to: Parsers for input to index?

Displaying 20 results from an estimated 300 matches similar to: "Parsers for input to index?"

2009 Dec 22
2
Reading PDF files
Hi: I need to do text mining on PDF files. I understand there is a readPDF command in tm that can be used. Have read the 2008 posts on converting PDF files to text by Tony Breyal and others. Wondering if the procedure has been standardized in any tutorial or otherwise? Being new to R, I was able to follow only part of the discussion. Any way to get a set of step by step instructions
2009 Jun 07
1
libwpd/libwpd-devel deps screwed?
# yum install libwpd-devel Loaded plugins: fastestmirror Loading mirror speeds from cached hostfile Setting up Install Process Parsing package install arguments Resolving Dependencies --> Running transaction check ---> Package libwpd-devel.i386 0:0.8.7-3.el5 set to be updated --> Processing Dependency: libwpd = 0.8.7-3.el5 for package: libwpd-devel --> Finished Dependency Resolution
2008 Aug 22
2
smbclient 3.2: smbc errno NT_STATUS_OK -> 22
As I was upgrading other things, the Debian package management system required me to upgrade Samba, from 3.0.24-6 to 3.2.0-4. All the sudden it quit working, which means I can't run an smbget or smbclient will allow DIR and CD but not GET. Either way with the new version I get (debugging on at level 10) smbc errno NT_STATUS_OK -> 22 followed by Can't read 64000 bytes at offset 0, file
2010 Sep 23
1
eps file
Dear All,   I need to create eps file which is the required figure format  of the journal that I want to submit a paper. I am able to create files in pdf or wmf format but not in eps format. Is there a way to convert pdf or wmf to eps? or alternatively, how can I create an eps file in R?   Any help is deeply appreciated.   Kind Regards   Seyit Ali
2008 Jan 04
1
Evaluating R expressions
All, Thank you for the prompt and useful answers to my questions. I had missed references in 5.7.6 which would have answered some of the points. As Bill pointed out a newer version of acrobat would help, but the Sun system here is still running 5.0. (An oversubscribed sysadmin). Then I could have searched and at at least avoided the most trivial. All three comments were different,
2008 Nov 13
1
readPDF() -- unsure how to install xpdf to make this work?
Dear R-Help, I need to convert a set of '.pdf' files into an equivalent set of '.txt' files. This is so that i can do some text mining on the content. In the latest R-News letter (http://cran.r-project.org/doc/Rnews/ Rnews_2008-2.pdf), the package 'tm' for text mining is mentioned. In that lovely package, there is a function called 'readPDF()'. In order to use
2023 Dec 11
1
Base R wilcox.test gives incorrect answers, has been fixed in DescTools, solution can likely be ported to Base R
While using the Hodges Lehmann Mean in DescTools (DescTools::HodgesLehmann), I found that it generated incorrect answers (see <https://github.com/AndriSignorell/DescTools/issues/97> https://github.com/AndriSignorell/DescTools/issues/97). The error is driven by the existence of tied values forcing wilcox.test in Base R to switch to an approximate algorithm that returns incorrect results - see
2006 May 29
1
OT: Monograph on Statistical Programming
Dear all, my question might be a bit off-topic. Is there anything like a standard textbook on statistical programming? With that I don't mean anything like MASS, S Programming, Programming with Data, ... (no offense meant, they are fantastic books and each of those three helped me a great deal). Rather in the direction of the "Numerical Recipes" Series addressing how to implement
2010 Nov 08
1
"yum -yq --security check-update" spouting lots of text?
uname -a Linux 2.6.18-194.17.4.el5.centos.plus #1 SMP Tue Oct 26 04:07:11 EDT 2010 x86_64 x86_64 x86_64 GNU/Linux I thought -q meant 'no output'. Recently yum -q has started spewing the following garbage, can anyone tell me why? thanks, Dave # yum -yq --security check-update 2>/dev/null Loaded plugins: fastestmirror, priorities, security Loading mirror speeds from cached hostfile *
2009 Aug 06
3
rpmforge.repo
How come rpmforge.repo wants to install a whole slew of perl modules, yet if I leave rpmforge.repo out of the update process, none of the perl modules shows up? Here's the rpmforge.repo file.. do any of yall see anything wrong with anything in there? Sam # Name: RPMforge RPM Repository for Red Hat Enterprise 5 - dag # URL: http://rpmforge.net/ [rpmforge] name = Red Hat Enterprise
2009 Nov 04
6
xVM and zones not co-existing happily
Hi all, I installed 2009.06, xVM and happily installed a domU with no issues. I just did this through the virt-manager GUI, and kept it as simple as possible. Later I installed and booted a zone with very simple and standard configuration. My existing xVM virtual machine did not immediately stop working, or have any interruption at all. But, I can no longer create new virtual machines, or even
2010 Jan 09
4
parsing pdf files
I have a pdf file that I would like to parse into R: http://www.williams.edu/Registrar/geninfo/faculty.pdf For now, I open the file in Acrobat by hand, then save it "as text" and then use readLines(). That works fine but a) I am concerned that some information may be lost and b) I may be doing this a lot, so I would rather have R grab the information from the pdf file directly. So: is
2020 Nov 03
1
LibreOffice locking up
Mike McCarthy, W1NR <sysop at w1nr.net> ? 2020?11?3? ?? ??7:56??? > > > Can you reproduce the problem with this document? > > > > And, when the crash happened, could you still ping the computer from > > another device in the network? > > > > Crashing hard so that only a reset helps is usually only possible with a > > kernel bug or hardware issue,
2012 Jun 26
0
CESA-2012:1043 Important CentOS 5 libwpd Update
CentOS Errata and Security Advisory 2012:1043 Important Upstream details at : https://rhn.redhat.com/errata/RHSA-2012-1043.html The following updated files have been uploaded and are currently syncing to the mirrors: ( sha256sum Filename ) i386: 1f840991f4ec80bfcbd90c844d598645eb115624d383e7eb1cc76b1b8ecc396a libwpd-0.8.7-3.1.el5_8.i386.rpm
2012 Jun 27
0
CentOS-announce Digest, Vol 88, Issue 16
Send CentOS-announce mailing list submissions to centos-announce at centos.org To subscribe or unsubscribe via the World Wide Web, visit http://lists.centos.org/mailman/listinfo/centos-announce or, via email, send a message with subject or body 'help' to centos-announce-request at centos.org You can reach the person managing the list at centos-announce-owner at centos.org When
2020 Aug 12
5
Using SSSD + AD with Samba seems to require Winbind be running
Hi all, Configuration information right off the bat: Debian Buster 10.5 and Samba 2:4.9.5+dfsg-5+deb10u1. Testparm is at the bottom I'm running into some interesting behavior on a server I've configured to use SSSD to bind to the AD domain. I've successfully bound using "net ads" and can get tickets and so on, and have samba configured to use kerberos through sssd.
2009 Dec 22
0
Reading PDF files (using xpdf)
Greetings Zaki, You should really post this question on the R-help forum so that others might benefit from any responses. It's been a while since I've done this, but if memory serves, the basic process was to download xpdf and add it to the windows path, thus making it accessable from within R. Two methods follow: Method One (easiest) - using the awesome ?system command: (1) Download
2007 Nov 20
5
Compound search / grouping
Hi, Following problem: We have a tree structure with children and a root element (recursivly) stored in one table (imagine a threaded forum). Each of the children has a title which should be indexed by ferret. Now we want to make a search that returns only the root and searches all items. So if one node has "expensive" and nother node has "car" I want to enter
2013 Mar 06
1
can you suggest on extending ogg as short-clip container and the make of its tool?
On Tue, Mar 5, 2013 at 7:43 PM, <gildororonar at mail-on.us> wrote: > very short context clips, most of them less than one > second, only chokes file systems. > I'm not sure how you reached this conclusion, but I think you should revisit it. I think everyone, including you, will be a lot happier if you store each sound effect clip in its own file. This is a common practice
2004 Aug 24
1
error when unsubscribe (PR#7198)
Full_Name: Huan Huang Version: 1.9.1 OS: Win NT Submission from: (NULL) (155.140.122.227) Traceback: Traceback (most recent call last): File "/scratch/local/app/mailman-sfs/2.1.5/scripts/driver", line 87, in run_main main() File "/scratch/local/app/mailman-sfs/2.1.5/Mailman/Cgi/options.py", line 91, in main language = cgidata.getvalue('language') File