search for: htdig

Displaying 20 results from an estimated 62 matches for "htdig".

2006 Mar 29
1
htdig with omega for multiple URLs (websites)
Olly, many thanks for suggesting htdig, you saved me a lot of time. Htdig looks better than my original idea - wget, you were right. Using htdig, I can crawl and search single website - but I need to integrate search of pages spread over 100+ sites. Learning, learning.... Htdig uses separate document database for every website (one d...
2001 Nov 08
0
[RHSA-2001:139-04] Updated htdig packages are available
--------------------------------------------------------------------- Red Hat, Inc. Red Hat Security Advisory Synopsis: Updated htdig packages are available Advisory ID: RHSA-2001:139-04 Issue date: 2001-10-24 Updated on: 2001-10-30 Product: Red Hat Linux Keywords: htdig CGI htsearch DOS configuration file -c switch security Cross references: http://www.securityspace.com/smysecure/catid.htm...
2001 Oct 09
0
Security Update: [CSSA-2001-035.0] Linux - Remote File View Problem in htdig
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 ______________________________________________________________________________ Caldera International, Inc. Security Advisory Subject: Linux - Remote File View Problem in htdig Advisory number: CSSA-2001-035.0 Issue date: 2001, October 09 Cross reference: ______________________________________________________________________________ 1. Problem Description A security problem in all versions of htdig has been reported to bugtraq by the htdig authors. This vulner...
2006 Mar 17
1
omega crawler: ht://dig or wget?
At wiki page: http://wiki.xapian.org/Omega I added a comment that ht://Dig looks like dead. Does anybody really use it? >From brief glance at docs I had a feeling it is not easy to configure. Maybe better crawler is GNU wget? Mature, stable, maintained? -- Peter Masiar
2004 Nov 30
5
RE: [Shorewall-devel] SFTP
...ried to search on generic words, nothing. I then tried some > really common words like ''help'', ''initiated'', ''masq'' - nothing. I think > the index might be corrupt because I get no results from anything when I > search the archives. The htdig packager for Fedora seems to think that it is more important that people have the lastest config files than that they have the right config files. When I installed an update via ''up2date'' my htdig.conf file was renamed htdig.conf.rpmsave and a new file was installed in its place....
2007 Dec 03
0
CESA-2007:1095 Moderate CentOS 4 s390(x) htdig - security update
CentOS Errata and Security Advisory 2007:1095 https://rhn.redhat.com/errata/RHSA-2007-1095.html The following updated files have been uploaded and are currently syncing to the mirrors: s390: updates/s390/RPMS/htdig-3.2.0b6-4.c4.s390.rpm updates/s390/RPMS/htdig-web-3.2.0b6-4.c4.s390.rpm s390x: updates/s390x/RPMS/htdig-3.2.0b6-4.c4.s390x.rpm updates/s390x/RPMS/htdig-web-3.2.0b6-4.c4.s390x.rpm -- Pasi Pirhonen - upi at iki.fi - http://pasi.pirhonen.eu/ Top-postings silently ignored -------------- next part -...
2007 Dec 05
0
CESA-2007:1095 Moderate CentOS 5 x86_64 htdig Update
CentOS Errata and Security Advisory 2007:1095 Moderate Upstream details at : https://rhn.redhat.com/errata/RHSA-2007-1095.html The following updated files have been uploaded and are currently syncing to the mirrors: ( md5sum Filename ) x86_64: 9cb4b14b7e1a32596705f2ed6882f7ef htdig-3.2.0b6-9.0.1.el5_1.x86_64.rpm b96548484dfaf007eb3d4c362ed577f8 htdig-web-3.2.0b6-9.0.1.el5_1.x86_64.rpm Source: 00badca9e41aba302819de85f7935ce2 htdig-3.2.0b6-9.0.1.el5_1.src.rpm -- Karanbir Singh CentOS Project { http://www.centos.org/ } irc: z00dax, #centos at irc.freenode.net
2007 Dec 05
0
CESA-2007:1095 Moderate CentOS 5 i386 htdig Update
CentOS Errata and Security Advisory 2007:1095 Moderate Upstream details at : https://rhn.redhat.com/errata/RHSA-2007-1095.html The following updated files have been uploaded and are currently syncing to the mirrors: ( md5sum Filename ) i386: b4b53fd6444cd16ca1ba49ff3326f2ca htdig-3.2.0b6-9.0.1.el5_1.i386.rpm 70f178075fab7be728b9bcdfff7f25ca htdig-web-3.2.0b6-9.0.1.el5_1.i386.rpm Source: 00badca9e41aba302819de85f7935ce2 htdig-3.2.0b6-9.0.1.el5_1.src.rpm -- Karanbir Singh CentOS Project { http://www.centos.org/ } irc: z00dax, #centos at irc.freenode.net
2007 Dec 05
0
CentOS-announce Digest, Vol 34, Issue 5
...uest at centos.org You can reach the person managing the list at centos-announce-owner at centos.org When replying, please edit your Subject line so it is more specific than "Re: Contents of CentOS-announce digest..." Today's Topics: 1. CESA-2007:1095 Moderate CentOS 5 x86_64 htdig Update (Karanbir Singh) 2. CESA-2007:1095 Moderate CentOS 5 i386 htdig Update (Karanbir Singh) 3. CentOS-5.1 x86_64 cd isos refresh (Karanbir Singh) 4. CEBA-2007:1106 CentOS 5 i386 gfs-kmod Update (Karanbir Singh) 5. CEBA-2007:1106 CentOS 5 x86_64 gfs-kmod Update (Karanbi...
2005 Oct 09
1
Mailman + htDig
Hello, Does anyone know if there is an rpm for mailman with the htdig integration? If not, how hard is to patch mailman to use this feature? Anyone have this working? TIA
2007 Dec 03
0
CESA-2007:1095 Moderate CentOS 4 ia64 htdig - security update
CentOS Errata and Security Advisory 2007:1095 https://rhn.redhat.com/errata/RHSA-2007-1095.html The following updated files have been uploaded and are currently syncing to the mirrors: ia64: updates/ia64/RPMS/htdig-3.2.0b6-4.c4.ia64.rpm updates/ia64/RPMS/htdig-web-3.2.0b6-4.c4.ia64.rpm -- Pasi Pirhonen - upi at iki.fi - http://pasi.pirhonen.eu/ Top-postings silently ignored -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size...
2007 Dec 04
0
CentOS-announce Digest, Vol 34, Issue 4
...equest at centos.org You can reach the person managing the list at centos-announce-owner at centos.org When replying, please edit your Subject line so it is more specific than "Re: Contents of CentOS-announce digest..." Today's Topics: 1. CESA-2007:1095 Moderate CentOS 4 ia64 htdig - security update (Pasi Pirhonen) 2. CESA-2007:1095 Moderate CentOS 4 s390(x) htdig - security update (Pasi Pirhonen) 3. CESA-2007:1049 Important CentOS 3 i386 kernel - security and bug fix update (Tru Huynh) 4. CESA-2007:1049 Important CentOS 3 x86_64 kernel - security...
2010 Sep 28
4
Mailman - searchable archive
Mailman works well for our mailing lists, but the archive is unacceptable - the worst thing is lack of search function. I got one tip for this: 1) emails converted to html format with mhonarc 2) search can be done with htdig Opinions? Maybe there are better software solutions for this - I hope. - Jussi -- Jussi Hirvi * Green Spot Topeliuksenkatu 15 C * 00250 Helsinki * Finland Tel. +358 9 493 981 * Mobile +358 40 771 2098 (only sms) jussi.hirvi at greenspot.fi * http://www.greenspot.fi
2002 May 05
1
possible changed organization of help files in 1.5.0?
I recently updated my search site at http://finzi.psych.upenn.edu It took about 7 hours with htDig, instead of the usual 3 hours. Now, when I search the "functions" index I get all sorts of non-html documents (indicated with brackets). These have never shown up before. Did htDig mogrify itself while it sat on my computer? (I didn't do anything to change it.) Or, more likely, I...
2004 Nov 22
3
Test builds for CYGWIN and IRIX?
I'm starting to prepare the next release. Since 0.8.3 I've made a number of changes to get working builds working on HPUX and OSF, and made some of the Windows specific bits more robust. I'd like to check that these haven't broken CYGWIN or IRIX builds, but I don't have access to these platforms. If you are able to test, it'd be most appreciated if you could. Download a
2003 Dec 01
3
search site for R (http://finzi.psych.upenn.edu)
My search site, http://finzi.psych.upenn.edu, has had several problems recently, all my fault, for which I apologize. But it now seems to be running reliably, on a new computer that is much faster than the old one. It uses htdig to permit search of the Rhelp mailing list, R documents, R functions, and various combinations of these. Search has several options, including Boolean search (with AND, etc.). Suggestions are welcome. -- Jonathan Baron, Professor of Psychology, University of Pennsylvania Home page: htt...
2004 Dec 23
1
searching Jonathan Baron's R Site
...rade at Penn. It will also be down at least one day before that, while I upgrade the operating system. (And another day some time in January because of a planned power outage.) Second, I have replaced the search engine in my R site: http://finzi.psych.upenn.edu/ I am now using Namazu instead of HtDig. The direct link to the search page is http://finzi.psych.upenn.edu/nmz.html Namazu has capabilities that HtDig does not have, such as wildcard searches. (On the down side, its phrase searching works fine for two word phrases, but a search for "A B C" will actually produce something li...
2006 May 26
1
Unicode troubles
...earch.xapian.general/1927 Now the QueryParser works as I wants it to do, and creates the terms correctly. But sadly I can't find any documents. If I do this; $ quest -d /var/lib/xapian r?serbil -> no results $ query -d /var/lib/xapian r*serbil -> result I'm indexing the pages from a htdig database using htdig2omega. I've tried to parse the db.docs-file as generated by htdump or after it's been converted to utf-8 by iconv. I've also tried to replace the p_* functions in scriptindex.cc to U_ ones -- just like the first patch does -- but I'm unable to get it to work. A...
2007 Feb 08
1
Getting custom field data from the page through crawling
...w.. My next quest is to implement a system of creating custom fields in the index. Our site is fully dynamic. That is, every page is generated in PHP and there are enough different kinds of pages that I wouldn't want to get into the business of indexing the DB directly, so I think that using htdig to crawl the site is the best way to go.. But, I would like to be able to search for things by field such as 'type', 'category', 'name', 'city', etc. I thought about it a lot and also did a lot of reading and research in the list archives but couldn't come up w...
2006 Apr 11
3
Robust Search Solution (with CentOS 4.3)
I've got about 10,000 docs I'd like to devise a search/index for. I found a perl script called Perlfect that can do that on an old P3 but at the astronomical time of 7 hours. Another script(cgi/perl) at hotscripts can do the same but allows the "rm -rf /" exploit. DoH!? Is there anything perl/flatfile that can search/index faster? This is a nice job for an aging P3 in the