similar to: Unicode troubles

Displaying 20 results from an estimated 400 matches similar to: "Unicode troubles"

2006 Mar 29
1
htdig with omega for multiple URLs (websites)
Olly, many thanks for suggesting htdig, you saved me a lot of time. Htdig looks better than my original idea - wget, you were right. Using htdig, I can crawl and search single website - but I need to integrate search of pages spread over 100+ sites. Learning, learning.... Htdig uses separate document database for every website (one database per URL to initiate crawling). Htdig also can merge
2006 Mar 17
1
omega crawler: ht://dig or wget?
At wiki page: http://wiki.xapian.org/Omega I added a comment that ht://Dig looks like dead. Does anybody really use it? >From brief glance at docs I had a feeling it is not easy to configure. Maybe better crawler is GNU wget? Mature, stable, maintained? -- Peter Masiar
2014 Jan 26
1
Trouble installing package in development version of R
Dear R-devel, I'm trying to get practice learning more about the internals of R so am experimenting with the development version. I successfully cloned the SVN repo and built and installed R. I am using 2014-01-23 r64861. However, I cannot seem to install the lattice package. I tried within R using install.packages and got the error sh: 1: 1: not found I then downloaded the tar.gz and tried
2011 Apr 17
3
Report for http://trac.xapian.org/wiki/SupportedPlatforms
Hello :-) There was probably no good reason to do this build but the Debian 6.0 Squeeze repo version was 1.2.3, we needed 1.2.4 and I didn't think of using the package from unstable. Arch: x86_64 Platform: Linux 2.6 Debian 6.0 (Squeeze) Compiler: gcc version 4.4.5 (Debian 4.4.5-8) Version: 1.2.4 Status: no known problems Source: http://oligarchy.co.uk/xapian/1.2.4/xapian-core-1.2.4.tar.gz
2004 Jun 28
2
[Fwd: Irix install of omega fails.]
OK, I'll try again. Thanks, Jim. -------------- next part -------------- An embedded message was scrubbed... From: Jim Lynch <jwl@sgi.com> Subject: Irix install of omega fails. Date: Mon, 28 Jun 2004 14:16:46 -0400 Size: 2057 Url: http://lists.tartarus.org/pipermail/xapian-discuss/attachments/20040628/212669c1/Irixinstallofomegafails.eml
2007 Feb 08
1
Getting custom field data from the page through crawling
Now on to my next question.. I've got the search and indexing working well for now.. My next quest is to implement a system of creating custom fields in the index. Our site is fully dynamic. That is, every page is generated in PHP and there are enough different kinds of pages that I wouldn't want to get into the business of indexing the DB directly, so I think that using htdig to crawl
2004 Nov 22
3
Test builds for CYGWIN and IRIX?
I'm starting to prepare the next release. Since 0.8.3 I've made a number of changes to get working builds working on HPUX and OSF, and made some of the Windows specific bits more robust. I'd like to check that these haven't broken CYGWIN or IRIX builds, but I don't have access to these platforms. If you are able to test, it'd be most appreciated if you could. Download a
2003 Jul 07
1
Samba can't find my @groups
Hello! I've installed Samba as a PDC and followed all documentations I've found helpful on the internet! It looks like everything runs nicely now on my Debian server with Linux 2.4.21 and Samba 3.0beta1 except when it comes to groups. I have i.e. the following share: [Learning] path = /home/share/learning write list = @personal, @admin, @installer force group =
2007 Mar 28
2
Moving indextext.cc into core.
One of the items on the ToDo list for version 1.0 at http://wiki.xapian.org/TodoFor1_2e0#preview is: "Rework Omega's indextext.cc as a xapian-core "TextSplitter" class." I've been wondering about this for a while now. Currently, we have the Query Parser in Xapian core, but no text processing. Clearly, it makes sense to have a "text splitter" class in
2003 Jan 22
5
Proxy arp and pptp
Hi all! I''ve set up a Linux box with shorewall doing proxy arp as per http://www.shorewall.net/shorewall_setup_guide.htm#ProxyARP the 5.2 (non routed) example. Everything is working great except for one thing, and that leads me to my question: is there a conflict between proxy arp and pptp? I''ve set the apropriate ACCEPT rules to allow tcp port 1723 and protocol 47 to the host
2004 Nov 30
5
RE: [Shorewall-devel] SFTP
On Tue, 2004-11-30 at 12:17 +0700, Matthew Hodgett wrote: > > As for the 169.254 issue I tried to search the archives but got nothing. > I then tried to search on generic words, nothing. I then tried some > really common words like ''help'', ''initiated'', ''masq'' - nothing. I think > the index might be corrupt because I get no
2005 Dec 30
1
Query Parser, filenames and compound words
When I submit a filename to the query parser it breaks it up Example: /home/user/file_name.ext becomes Xapian::Query((home:(pos=1) PHRASE 5 user:(pos=2) PHRASE 5 file:(pos=3) PHRASE 5 name:(pos=4) PHRASE 5 ext:(pos=5))) which does not find the document. If I do an single term query not using the query parser then I find the document. The Query Parser also breaks up hyphenated terms
2010 Sep 28
4
Mailman - searchable archive
Mailman works well for our mailing lists, but the archive is unacceptable - the worst thing is lack of search function. I got one tip for this: 1) emails converted to html format with mhonarc 2) search can be done with htdig Opinions? Maybe there are better software solutions for this - I hope. - Jussi -- Jussi Hirvi * Green Spot Topeliuksenkatu 15 C * 00250 Helsinki * Finland Tel. +358 9
2003 Dec 01
3
search site for R (http://finzi.psych.upenn.edu)
My search site, http://finzi.psych.upenn.edu, has had several problems recently, all my fault, for which I apologize. But it now seems to be running reliably, on a new computer that is much faster than the old one. It uses htdig to permit search of the Rhelp mailing list, R documents, R functions, and various combinations of these. Search has several options, including Boolean search (with AND,
2002 May 05
1
possible changed organization of help files in 1.5.0?
I recently updated my search site at http://finzi.psych.upenn.edu It took about 7 hours with htDig, instead of the usual 3 hours. Now, when I search the "functions" index I get all sorts of non-html documents (indicated with brackets). These have never shown up before. Did htDig mogrify itself while it sat on my computer? (I didn't do anything to change it.) Or, more likely, I
2006 Apr 11
3
Robust Search Solution (with CentOS 4.3)
I've got about 10,000 docs I'd like to devise a search/index for. I found a perl script called Perlfect that can do that on an old P3 but at the astronomical time of 7 hours. Another script(cgi/perl) at hotscripts can do the same but allows the "rm -rf /" exploit. DoH!? Is there anything perl/flatfile that can search/index faster? This is a nice job for an aging P3 in the
2005 Oct 09
1
Mailman + htDig
Hello, Does anyone know if there is an rpm for mailman with the htdig integration? If not, how hard is to patch mailman to use this feature? Anyone have this working? TIA
2011 Jan 04
5
Webkit - can't find variable: mxn
In my app (v2.0.15) and on the mapstraction demo the first time I load the page in Safari I get the error "Can''t find variable: mxn" and the map doesn''t load. http://mapstraction.appspot.com/ mapstraction.googlecode.com/svn/trunk/source/mxn.core.js:6ReferenceError: Can''t find variable: mxn I''ve seen a few old posts in the list archives about similar
2011 Dec 02
1
Problems with scriptindex
Hi I am a beginner with Xapian. I am trying to use scriptindex but I get some problems : 1) If my script file is empty I have an indexation by default, and I get two output fields when I am using quest.exe (These fields are caption and sample) 2) I then tryed to not use an empty script file defining this time my own fields, but if I use quest to perform researches these fileds does not appear in
2007 Dec 13
1
Omega datevalue search fails during scriptindex flush
Hi, I'm experiencing weird behaviour in Xapian/Omega. Here's my situation: * I'm indexing data through scriptindex, the output is as follows: ***************** Replace: 6706476 Replace: 6706477 Replace: 6706478 ***************** Since there is no output after entering several enters, I assume scriptindex is indexing data. * The search I perform on omega is as follows: