Displaying 20 results from an estimated 1000 matches similar to: "Regexpr. analyzer"
2006 Jul 07
4
How to add Asia token analyzer to ferret simply?
Hi,David
Can you give me an example of how to add analyzer to ferret to Asian
languages?
My web application will have to support multi language search,which
means,for example,both Chinese and English will be searched through the
form.
Currently,I have decided to use the simple token principles,which means
that every Chinese character will be a token,although this is not so
well in some
2007 Apr 29
1
Chinese full-text support! Still fail-_-
Hi all,
I want to use ferrent in my website but when i input chinese words, i
have the same symptom like Chengcai. In order to fix it, i have reviewed
all the topics about chinese support in our forum and tried all the way
your guys suggested but still made any progress. i downloaded the latest
version of ferret from svn.
Thanks and regards.
captain
Chengcai He wrote:
> Hello everyone!
2007 Apr 19
5
Chinese full text searching by acts_as_ferret?
How to add Chinese language full text searching function by using
acts_as_ferret?
RegExpAnalyzer.new(/./,false)
this analyzer, i don''t know how to use it!
does it works like this:
user searching---->acts_as_ferret---->ferret
????
--
Posted via http://www.ruby-forum.com/.
2006 Oct 31
3
No search results using Searcher
I just started using Ferret and I successfully indexed some documents. I
can search this index using the following code:
index = Index::Index.new(:path => path)
index.search_each("something") do |doc, score|
print "##{doc} #{index[doc][''url'']} - #{score}"
print "\n"
end
However, when I try to use Search::Searcher and QueryParser
2006 Sep 06
9
Which analyzer to use
Lucene''s standard analyzer splits words separater with underscores.
Ferret doesn''t do this. For example, if I create an index with only
document ''test_case'' and search for ''case'' it doesn''t find anything.
Lucene on the other hand finds it. The same story goes for words
separated by colons.
Which analyzer should I use to emulate
2007 May 18
3
issues with : in the content
Hi,
I''ve discovered ferret and aaf this evening, I''ve just done some tests
and it seems perfect for my needs.
I''m indexing text data (title, description, etc) and also ethernet
hardware addresses (MAC).
Sorry if that sounds trivial but I can''t find the way to correctly
index and achieve correct searches on MAC addresses.
If I do something like this:
index =
2006 Oct 18
2
install ferret on windows
Hi!
I''m trying to install ferret on windows, so I chose
ferret-0.10.9-mswin32.gem from the download page, as it includes a
already pre-compiled ferret_ext.so (is that correct?) and I don''t have
any C compiler here. The gem installed it correctly, but the test didn''t
work. So I called these commands:
rake ext
ruby setup.rb config
ruby setup.rb setup
ruby setup.rb
2007 May 02
4
Wrong total_hits when using conditions in find_by_contents
In my model Topic:
acts_as_ferret({ :fields => {:username => {:store => :yes, :boost =>
30}, :subject => {:store => :yes, :boost => 20}, :body => {:store =>
:yes, :boost => 10}}, :remote => true }, { :analyzer =>
Ferret::Analysis::RegExpAnalyzer.new(/./, false) })
def self.full_text_search(q, options = {}, find_options = {})
return nil if q.nil? or
2007 Apr 08
10
Ferret and non latin characters support
I''ve successfully installed ferret and acts_as_ferret and have no
problem with utf-8 for accented characters. It returns correct results
fot e.g. fran?ais. My problem is with non latin characters (Persian
indeed). I have tested different locales with no success both on Debian
and Mac. Any idea?
(ferret 0.11.4, acts_as_ferret 0.4.0, rails 1.1.6)
--
Posted via http://www.ruby-forum.com/.
2008 Apr 15
1
why does regexpr not work with '.'
Dear R Helpers,
I am running R 2.6.2 on a Windows XP machine.
I am trying to use regexpr to locate full stops in strings, but, without
success.
Here an example:-
f="a,b.c at d:" #define an arbitrary test string
regexpr(',',f) #find the occurrences of ',' in f - should be one at location
2
# and this is what regexpr finds
#[1] 2
2010 Jun 01
1
regexpr help (match.length=0)
R-help,
Sorry if this is more of a regex question than an R question. However,
help would be appreciated on my use of the regexpr function.
In the first example below, I ask for all characters (a-z) in 'abc123';
regexpr returns a 3-character match beginning at the first character.
> regexpr("[[:alpha:]]*", "abc123")
[1] 1
attr(,"match.length")
[1] 3
2010 May 05
1
extracting a matched string using regexpr
Given a text like
I want to be able to extract a matched regular expression from a piece of
text.
this apparently works, but is pretty ugly
# some html
test<-"</tr><tr><th>88958</th><th>Abcdsef</th><th>67.8S</th><th>68.9\nW</th><th>26m</th>"
# a pattern to extract 5 digits
> pattern<-"[0-9]{5}"
#
2007 Jun 29
2
regexpr
Hi,
I 'd like to match each member of a list to a target string, e.g.
------------------------------
mylist=c("MN","NY","FL")
g=regexpr(mylist[1], "Those from MN:")
if (g>0)
{
"On list"
}
------------------------------
My question is:
How to add an end-of-string symbol '$' to the to-match string? so that 'M'
won't
2004 Feb 06
3
a grep/regexpr problem
Hi,
I'm trying to parse lines of the form:
dan001.hin (0): fingerprint={256, 411, 426, 947, 973, 976}
What I need is the sequence of number between {}. I'm using grep as
match <- grep("{([0-9,\s]*)}",s,perl=T,value=T)
where s is a character vector.
But all I get is the whole string s. I tried using regexpr in an attempt
to get just the sequence I wanted:
match <-
2011 Sep 29
2
String manipulation with regexpr, got to be a better way
Help-Rs,
I'm doing some string manipulation in a file where I converted a string date in mm/dd/yyyy format and returned the date yyyy.
I've used regexpr (hat tip to Gabor G for a very nice earlier post on this function) in steps (I've un-nested the code and provided it and an example of what I did below. My question is: is there a more efficient way to do this. Specifically is
2008 Oct 01
1
regexpr syntax question
Greetings R list,
I am stuck on a simple syntax problem. I want to list all files in a directory, excluding files of a certain type. I have tried pattern matching as follows:
a <- list.files(data, full.name = TRUE, pattern != ".xml") # exclude all .xml files
The warning returns that my syntax is incorrect. I have read the regexpr help files and search old posts to no
2005 Aug 03
2
regexpr and portability issue
Dear all--
I am still forging my first arms with R and I am fighting with regexpr() as
well as portability between unix and windoz. I need to extract barcodes from
filenames (which are located between a double and single underscore) as well
as the directory where the filename is residing. Here is the solution I came
to:
aFileName <-
2010 Jun 02
2
regexpr mystery can not remove trailing spaces
Dear all
I encountered strange problem with regexpr replacement
I made this character object
str <- "02.06.10 12:40 "
> str(str)
chr "02.06.10 12:40 "
I read in an object which seems to be quite similar
> str(as.character(becva$V1)[1])
chr "02.06.10 12:40 "
However I can not remove trailing spaces from it
> sub(' +$',
2012 Aug 06
5
regexpr with accents
Hello,
I have build a syntax to find out if a given substring is included in a larger string that works like this:
d1$V1[regexpr("some text = 9",d1$V2)>0] <- 9
and this works all right till "some text" contains standard ASCII set. However, it does not work when accents are included as the following:
d1$V1[regexpr("some t?xt = 9",d1$V2)>0] <- 9
I have
2003 Aug 13
7
Regexpr with "."
I'm trying to use the regexpr function to locate the decimal in a character
string. Regardless of the position of the decimal, the function returns 1.
For example,
> regexpr(".", "Female.Alabama")
[1] 1
attr(,"match.length")
[1] 1
In trying to figure out what was going on here, I tried the below command:
> gsub(".", ",",