similar to: htdig with omega for multiple URLs (websites)

Displaying 20 results from an estimated 2000 matches similar to: "htdig with omega for multiple URLs (websites)"

2006 Mar 17
1
omega crawler: ht://dig or wget?
At wiki page: http://wiki.xapian.org/Omega I added a comment that ht://Dig looks like dead. Does anybody really use it? >From brief glance at docs I had a feeling it is not easy to configure. Maybe better crawler is GNU wget? Mature, stable, maintained? -- Peter Masiar
2006 May 26
1
Unicode troubles
Hi, I've tried to follow all helpful tips I've found in the mailing-list and I've applied these two utf-8 patches; http://article.gmane.org/gmane.comp.search.xapian.general/2324 http://article.gmane.org/gmane.comp.search.xapian.general/1927 Now the QueryParser works as I wants it to do, and creates the terms correctly. But sadly I can't find any documents. If I do this; $ quest
2007 Feb 08
1
Getting custom field data from the page through crawling
Now on to my next question.. I've got the search and indexing working well for now.. My next quest is to implement a system of creating custom fields in the index. Our site is fully dynamic. That is, every page is generated in PHP and there are enough different kinds of pages that I wouldn't want to get into the business of indexing the DB directly, so I think that using htdig to crawl
2005 Oct 09
1
Mailman + htDig
Hello, Does anyone know if there is an rpm for mailman with the htdig integration? If not, how hard is to patch mailman to use this feature? Anyone have this working? TIA
2001 Nov 08
0
[RHSA-2001:139-04] Updated htdig packages are available
--------------------------------------------------------------------- Red Hat, Inc. Red Hat Security Advisory Synopsis: Updated htdig packages are available Advisory ID: RHSA-2001:139-04 Issue date: 2001-10-24 Updated on: 2001-10-30 Product: Red Hat Linux Keywords: htdig CGI htsearch DOS configuration file -c switch security Cross
2001 Oct 09
0
Security Update: [CSSA-2001-035.0] Linux - Remote File View Problem in htdig
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 ______________________________________________________________________________ Caldera International, Inc. Security Advisory Subject: Linux - Remote File View Problem in htdig Advisory number: CSSA-2001-035.0 Issue date: 2001, October 09 Cross reference: ______________________________________________________________________________ 1.
2007 Dec 03
0
CESA-2007:1095 Moderate CentOS 4 s390(x) htdig - security update
CentOS Errata and Security Advisory 2007:1095 https://rhn.redhat.com/errata/RHSA-2007-1095.html The following updated files have been uploaded and are currently syncing to the mirrors: s390: updates/s390/RPMS/htdig-3.2.0b6-4.c4.s390.rpm updates/s390/RPMS/htdig-web-3.2.0b6-4.c4.s390.rpm s390x: updates/s390x/RPMS/htdig-3.2.0b6-4.c4.s390x.rpm updates/s390x/RPMS/htdig-web-3.2.0b6-4.c4.s390x.rpm
2007 Dec 05
0
CESA-2007:1095 Moderate CentOS 5 x86_64 htdig Update
CentOS Errata and Security Advisory 2007:1095 Moderate Upstream details at : https://rhn.redhat.com/errata/RHSA-2007-1095.html The following updated files have been uploaded and are currently syncing to the mirrors: ( md5sum Filename ) x86_64: 9cb4b14b7e1a32596705f2ed6882f7ef htdig-3.2.0b6-9.0.1.el5_1.x86_64.rpm b96548484dfaf007eb3d4c362ed577f8 htdig-web-3.2.0b6-9.0.1.el5_1.x86_64.rpm
2007 Dec 05
0
CESA-2007:1095 Moderate CentOS 5 i386 htdig Update
CentOS Errata and Security Advisory 2007:1095 Moderate Upstream details at : https://rhn.redhat.com/errata/RHSA-2007-1095.html The following updated files have been uploaded and are currently syncing to the mirrors: ( md5sum Filename ) i386: b4b53fd6444cd16ca1ba49ff3326f2ca htdig-3.2.0b6-9.0.1.el5_1.i386.rpm 70f178075fab7be728b9bcdfff7f25ca htdig-web-3.2.0b6-9.0.1.el5_1.i386.rpm Source:
2007 Dec 03
0
CESA-2007:1095 Moderate CentOS 4 ia64 htdig - security update
CentOS Errata and Security Advisory 2007:1095 https://rhn.redhat.com/errata/RHSA-2007-1095.html The following updated files have been uploaded and are currently syncing to the mirrors: ia64: updates/ia64/RPMS/htdig-3.2.0b6-4.c4.ia64.rpm updates/ia64/RPMS/htdig-web-3.2.0b6-4.c4.ia64.rpm -- Pasi Pirhonen - upi at iki.fi - http://pasi.pirhonen.eu/ Top-postings silently ignored --------------
2004 Nov 30
5
RE: [Shorewall-devel] SFTP
On Tue, 2004-11-30 at 12:17 +0700, Matthew Hodgett wrote: > > As for the 169.254 issue I tried to search the archives but got nothing. > I then tried to search on generic words, nothing. I then tried some > really common words like ''help'', ''initiated'', ''masq'' - nothing. I think > the index might be corrupt because I get no
2002 May 05
1
possible changed organization of help files in 1.5.0?
I recently updated my search site at http://finzi.psych.upenn.edu It took about 7 hours with htDig, instead of the usual 3 hours. Now, when I search the "functions" index I get all sorts of non-html documents (indicated with brackets). These have never shown up before. Did htDig mogrify itself while it sat on my computer? (I didn't do anything to change it.) Or, more likely, I
2010 Sep 28
4
Mailman - searchable archive
Mailman works well for our mailing lists, but the archive is unacceptable - the worst thing is lack of search function. I got one tip for this: 1) emails converted to html format with mhonarc 2) search can be done with htdig Opinions? Maybe there are better software solutions for this - I hope. - Jussi -- Jussi Hirvi * Green Spot Topeliuksenkatu 15 C * 00250 Helsinki * Finland Tel. +358 9
2011 Apr 17
3
Report for http://trac.xapian.org/wiki/SupportedPlatforms
Hello :-) There was probably no good reason to do this build but the Debian 6.0 Squeeze repo version was 1.2.3, we needed 1.2.4 and I didn't think of using the package from unstable. Arch: x86_64 Platform: Linux 2.6 Debian 6.0 (Squeeze) Compiler: gcc version 4.4.5 (Debian 4.4.5-8) Version: 1.2.4 Status: no known problems Source: http://oligarchy.co.uk/xapian/1.2.4/xapian-core-1.2.4.tar.gz
2003 Dec 01
3
search site for R (http://finzi.psych.upenn.edu)
My search site, http://finzi.psych.upenn.edu, has had several problems recently, all my fault, for which I apologize. But it now seems to be running reliably, on a new computer that is much faster than the old one. It uses htdig to permit search of the Rhelp mailing list, R documents, R functions, and various combinations of these. Search has several options, including Boolean search (with AND,
2004 Nov 22
3
Test builds for CYGWIN and IRIX?
I'm starting to prepare the next release. Since 0.8.3 I've made a number of changes to get working builds working on HPUX and OSF, and made some of the Windows specific bits more robust. I'd like to check that these haven't broken CYGWIN or IRIX builds, but I don't have access to these platforms. If you are able to test, it'd be most appreciated if you could. Download a
2007 Dec 05
0
CentOS-announce Digest, Vol 34, Issue 5
Send CentOS-announce mailing list submissions to centos-announce at centos.org To subscribe or unsubscribe via the World Wide Web, visit http://lists.centos.org/mailman/listinfo/centos-announce or, via email, send a message with subject or body 'help' to centos-announce-request at centos.org You can reach the person managing the list at centos-announce-owner at centos.org When
2007 Dec 04
0
CentOS-announce Digest, Vol 34, Issue 4
Send CentOS-announce mailing list submissions to centos-announce at centos.org To subscribe or unsubscribe via the World Wide Web, visit http://lists.centos.org/mailman/listinfo/centos-announce or, via email, send a message with subject or body 'help' to centos-announce-request at centos.org You can reach the person managing the list at centos-announce-owner at centos.org When
2002 Feb 12
1
SMB-server from Win2k -> Red Hat Linux 7.2 - Samba 2.2.1a seen in Network Neighbourhood but not browsable
Hi everyone, I am new to the list and new to Linux as well as to Samba. I've read quite a few howtos, man-pages and other docs on samba now. I configured my own SMB-server with smbd and nmbd. Of course I generated a smb.conf file and my server is accessible and running on the Linux mashine. I can connect with smbclient to the Linux-mashine and to the win2k-mashine (hostname and service
2004 Dec 23
1
searching Jonathan Baron's R Site
First, my site will be down December 27-28 because of a network upgrade at Penn. It will also be down at least one day before that, while I upgrade the operating system. (And another day some time in January because of a planned power outage.) Second, I have replaced the search engine in my R site: http://finzi.psych.upenn.edu/ I am now using Namazu instead of HtDig. The direct link to the