Displaying 20 results from an estimated 400 matches similar to: "Bug in agrep computing edit distance?"
2010 Nov 17
2
Bug in agrep computing edit distance?
I posted this yesterday to r-help and Ben Bolker suggested reposting it
here...
Dickison, Daniel <ddickison <at> carnegielearning.com> writes:
>
> The documentation for agrep says it uses the Levenshtein edit distance,
> but it seems to get this wrong in certain cases when there is a
> combination of deletions and substitutions. For example:
>
> >
2006 Jan 05
1
Pb with agrep()
Happy new year everybody,
I'm getting the following while trying to use the agrep() function:
> pattern <- "XXX"
> subject <- c("oooooo", "oooXooo", "oooXXooo", "oooXXXooo")
> max <- list(ins=0, del=0, sub=0) # I want exact matches only
> agrep(pattern, subject, max=max)
[1] 4
OK
> max$sub <- 1 # One allowed
2012 Jan 19
1
bug en funcion 'agrep'
Estimados R-users:
Estoy intentando usar la función 'agrep' para hacer búsquedas en
cadenas de texto. El parámetro max.distance permite controlar la
medida de aproximación de búsqueda de la función de Levenshtein. No
obstante, cuando hago búsquedas específicas no obtengo siempre el
resultado deseado y no se si es un bug o que no entiendo bien el
algoritmo de búsqueda. Por
2012 Dec 11
1
How do you use agrep inside a loop
Hi all.
This is my first message at R-help...so I'm hoping I have some beginner's
luck and get some good help for my problem!
FYI I have just started using R recently so my knowledge of R is pretty
preliminary.
Okay here is what I need help with - I need to know how to use agrep in a
for loop.
I need to compare elements of a vector of names with other elements of the
same vector.
2004 Feb 11
6
AGREP
Hi all, I have two questions
1 - I have the version 1.4.1 of R, and it doesn't have the 'agrep'
function in the base library. Is there a way to make this funcion
avaliable in R 1.4.1? I mean, how to 'copy' it from R 1.8.1 and 'paste'
it in R 1.4.1?
2 - The AGREP function doesn't give me the Levenshtein distance (edit
distance). Is there a function in R that does
2010 Nov 09
1
agrep pmatch recursive???
Hello R Helpers,
Business - 64 bit windows 7, R 2.11.1
I am trying to match the character contents of one list, called 'exclude', to those of a second list, called 'dataset'
dataset is a list of file names with folder locations, and looks like this when called:
> dataset
[1] "A/10-10-29a-13.cdf" "A/10-10-29a-14.cdf" "A/10-10-29a-15.cdf"
2005 May 01
3
win32-dir 0.1.0 compile problems
I tried to download/compile/install win32-dir, but I couldn''t get it to
go. Over a private email Daniel Berger had me...
"Curious. What platform are you on exactly? Try
modifying the extconf.rb file. Add
''have_library("SHFolder")'' above
''have_library("shell32")''. If that doesn''t work, try
uncommenting the other
2004 Feb 13
1
RES: AGREP
"Marcos Sanches" <marcos.sanches at ipsos-opinion.com.br> wrote:
Ls1<-length(s1)
Ls2<-length(s2)
for ( p in 1:ls1){
for (q in 1:ls2){
t1<-levenshteinFast(s1[p],s2[q])
...
Ls1=42000
Ls2=70000
I think I will wait for months untill this program ends. Do you have any
sugestion to increase the speed?
The first suggestion has to be "search HARD in the
2009 Jun 30
1
possible agrep bug; R 2.9.1, Mac OS X 10.5 (PR#13789)
Same behavior was noted on R 2.9.0 OS X and WIndows.
> t(sapply(seq(0.01, .99, length=20), function(x) c(x, length(agrep("Staatssekretar im Bundeskanzleramt","Bundeskanzler",max.distance=x)))))
[1,] 0.01000000 0
[2,] 0.06157895 0
[3,] 0.11315789 0
[4,] 0.16473684 0
[5,] 0.21631579 0
[6,] 0.26789474 0
[7,] 0.31947368 0
[8,] 0.37105263 0
[9,]
2008 Mar 13
1
R Finance
Hi,
I am an R novice working with financial data. I am developing a
portfolio strategy evaluation technique to back-test the performance
of our screens; checking how the screened stock would've performed
over the period in question.
I am using quantmod in R to download the historical data from yahoo
and then analyzing it using PerformanceAnalytics. My problem is that,
as our screens are done
2001 Mar 13
2
Samba 2.2 CVS
Sure thing:
cvs -d :pserver:cvs@pserver.samba.org:/cvsroot login
<password is cvs >
cvs -z5 -d :pserver:cvs@pserver.samba.org:/cvsroot co -r SAMBA_2_2 samba
this will grab samba_2_2 and place it in a directory lableled samba.
leaving out the SAMBA_2_2 grabs HEAD.
you can get other modules as follows:
SAMBA_2_2
HEAD
APPLIANCE_HEAD
APPLIANCE_TNG
there are others but I haven't tried
2008 Aug 26
2
String search: Return "closest" match
Hi,
I have to match names where names can be recorded with errors or additions.
Now I am searching for a string search function which returns always the "closest" match. E.g. searching for "Washington" it should return only Washington but not Washington, D.C. But it also could be that the list contains only "Hamburg" but the record I am searching for is
2008 May 29
1
help (using ?) does not handle trailing whitespace (PR#11537)
> ?agrep
>
Results in:
No documentation for 'agrep ' in specified packages and libraries:
you could try 'help.search("agrep ")'
There is white space after agrep, that ? doesn't ignore.
--please do not edit the information below--
Version:
platform = i486-pc-linux-gnu
arch = i486
os = linux-gnu
system = i486, linux-gnu
status =
major = 2
minor =
2005 Nov 02
2
[LLVMdev] LLVM 1.6 Release Branch
Tanya Lattner wrote:
>
>> 1. I'm still looking for volunteers to test MacOS X and Solaris.
>> If you'd like to volunteer, please email the list to let us know.
>
>
> I'll do minimal testing on Sparc. I'm not going to look into any
> regressions though since no one has been really watching Sparc since I
> graduated and I am sure there are
2010 Oct 18
3
remove numbers from string of characters
Greetings
I want to remove numbers from a string of characters that identify
sites so that I can merge two data frames. For example, a site in one
frame is called "001a Frozen Niagara Entrance" whereas the same site in
the other data frame is called "Frozen Niagara Entrance". It seems to me
the easiest thing to do would be to remove the numbers from the first data
2003 Oct 15
2
help.search in trouble with R-patched ?
...unless its me missing something...
> help.search("prompt", agrep=F)
Error: couldn't find function ".class1"
> traceback()
12: initialize(value, ...)
11: initialize(value, ...)
10: new("ObjectsWithPackage", value, package = pkg)
9: metaNameUndo(unique(these), prefix = "M", searchForm = searchForm)
8: methods:::getGenerics(ns)
7:
2005 Nov 07
0
[LLVMdev] LLVM 1.6 Release Branch
Everything builds fine on sparc. The configure script needs to be fixed
though (see previous email).
Sparc testing results:
make check:
# of expected passes 1189
# of expected failures 34
Regressions Single Source:
None
New Failures Single Source (new tests):
2005-05-12-Int64ToFP: llc,jit
Regressions MultiSource:
Applications/d/make_dparser: llc, cbe, jit
2011 Jul 14
1
(no subject)
Good Afternoon R Community,
I often work with very large data bases and want to search for select cases by a particular word or numeric value. I created the following simple function to do just that. It searchs a particular column for the phrase and returns a data frame with the rows that contain that phrase (for a particular column).
Search<-function(term, dataframe, column.name,
2013 Jan 14
4
Grabbing Specific Words from Content (basic text mining)
Hi all,
Suppose I have a data frame with mixed content (name age and address).
a<-"Name: John Smith Age: 35 Address: 32, street, sub, something"
b<-data.frame(a)
1. The question is I want to extract the name age and
address separately from this data frame (containing potentially more
people).
2. Also just incase I have to deal with it how would the syntax change if I
had
2008 Mar 03
2
esercizio
? stato filtrato un testo allegato il cui set di caratteri non era
indicato...
Nome: non disponibile
Url: https://stat.ethz.ch/pipermail/r-help/attachments/20080303/c747f103/attachment.pl