similar to: Bug in agrep computing edit distance?

Displaying 20 results from an estimated 400 matches similar to: "Bug in agrep computing edit distance?"

2010 Nov 17
2
Bug in agrep computing edit distance?
I posted this yesterday to r-help and Ben Bolker suggested reposting it here... Dickison, Daniel <ddickison <at> carnegielearning.com> writes: > > The documentation for agrep says it uses the Levenshtein edit distance, > but it seems to get this wrong in certain cases when there is a > combination of deletions and substitutions. For example: > > >
2006 Jan 05
1
Pb with agrep()
Happy new year everybody, I'm getting the following while trying to use the agrep() function: > pattern <- "XXX" > subject <- c("oooooo", "oooXooo", "oooXXooo", "oooXXXooo") > max <- list(ins=0, del=0, sub=0) # I want exact matches only > agrep(pattern, subject, max=max) [1] 4 OK > max$sub <- 1 # One allowed
2012 Jan 19
1
bug en funcion 'agrep'
Estimados R-users: Estoy intentando usar la función 'agrep' para hacer búsquedas en cadenas de texto. El parámetro max.distance permite controlar la medida de aproximación de búsqueda de la función de Levenshtein. No obstante, cuando hago búsquedas específicas no obtengo siempre el resultado deseado y no se si es un bug o que no entiendo bien el algoritmo de búsqueda. Por
2012 Dec 11
1
How do you use agrep inside a loop
Hi all. This is my first message at R-help...so I'm hoping I have some beginner's luck and get some good help for my problem! FYI I have just started using R recently so my knowledge of R is pretty preliminary. Okay here is what I need help with - I need to know how to use agrep in a for loop. I need to compare elements of a vector of names with other elements of the same vector.
2004 Feb 11
6
AGREP
Hi all, I have two questions 1 - I have the version 1.4.1 of R, and it doesn't have the 'agrep' function in the base library. Is there a way to make this funcion avaliable in R 1.4.1? I mean, how to 'copy' it from R 1.8.1 and 'paste' it in R 1.4.1? 2 - The AGREP function doesn't give me the Levenshtein distance (edit distance). Is there a function in R that does
2010 Nov 09
1
agrep pmatch recursive???
Hello R Helpers, Business - 64 bit windows 7, R 2.11.1 I am trying to match the character contents of one list, called 'exclude', to those of a second list, called 'dataset' dataset is a list of file names with folder locations, and looks like this when called: > dataset [1] "A/10-10-29a-13.cdf" "A/10-10-29a-14.cdf" "A/10-10-29a-15.cdf"
2005 May 01
3
win32-dir 0.1.0 compile problems
I tried to download/compile/install win32-dir, but I couldn''t get it to go. Over a private email Daniel Berger had me... "Curious. What platform are you on exactly? Try modifying the extconf.rb file. Add ''have_library("SHFolder")'' above ''have_library("shell32")''. If that doesn''t work, try uncommenting the other
2004 Feb 13
1
RES: AGREP
"Marcos Sanches" <marcos.sanches at ipsos-opinion.com.br> wrote: Ls1<-length(s1) Ls2<-length(s2) for ( p in 1:ls1){ for (q in 1:ls2){ t1<-levenshteinFast(s1[p],s2[q]) ... Ls1=42000 Ls2=70000 I think I will wait for months untill this program ends. Do you have any sugestion to increase the speed? The first suggestion has to be "search HARD in the
2009 Jun 30
1
possible agrep bug; R 2.9.1, Mac OS X 10.5 (PR#13789)
Same behavior was noted on R 2.9.0 OS X and WIndows. > t(sapply(seq(0.01, .99, length=20), function(x) c(x, length(agrep("Staatssekretar im Bundeskanzleramt","Bundeskanzler",max.distance=x))))) [1,] 0.01000000 0 [2,] 0.06157895 0 [3,] 0.11315789 0 [4,] 0.16473684 0 [5,] 0.21631579 0 [6,] 0.26789474 0 [7,] 0.31947368 0 [8,] 0.37105263 0 [9,]
2008 Mar 13
1
R Finance
Hi, I am an R novice working with financial data. I am developing a portfolio strategy evaluation technique to back-test the performance of our screens; checking how the screened stock would've performed over the period in question. I am using quantmod in R to download the historical data from yahoo and then analyzing it using PerformanceAnalytics. My problem is that, as our screens are done
2001 Mar 13
2
Samba 2.2 CVS
Sure thing: cvs -d :pserver:cvs@pserver.samba.org:/cvsroot login <password is cvs > cvs -z5 -d :pserver:cvs@pserver.samba.org:/cvsroot co -r SAMBA_2_2 samba this will grab samba_2_2 and place it in a directory lableled samba. leaving out the SAMBA_2_2 grabs HEAD. you can get other modules as follows: SAMBA_2_2 HEAD APPLIANCE_HEAD APPLIANCE_TNG there are others but I haven't tried
2008 Aug 26
2
String search: Return "closest" match
Hi, I have to match names where names can be recorded with errors or additions. Now I am searching for a string search function which returns always the "closest" match. E.g. searching for "Washington" it should return only Washington but not Washington, D.C. But it also could be that the list contains only "Hamburg" but the record I am searching for is
2008 May 29
1
help (using ?) does not handle trailing whitespace (PR#11537)
> ?agrep > Results in: No documentation for 'agrep ' in specified packages and libraries: you could try 'help.search("agrep ")' There is white space after agrep, that ? doesn't ignore. --please do not edit the information below-- Version: platform = i486-pc-linux-gnu arch = i486 os = linux-gnu system = i486, linux-gnu status = major = 2 minor =
2005 Nov 02
2
[LLVMdev] LLVM 1.6 Release Branch
Tanya Lattner wrote: > >> 1. I'm still looking for volunteers to test MacOS X and Solaris. >> If you'd like to volunteer, please email the list to let us know. > > > I'll do minimal testing on Sparc. I'm not going to look into any > regressions though since no one has been really watching Sparc since I > graduated and I am sure there are
2010 Oct 18
3
remove numbers from string of characters
Greetings I want to remove numbers from a string of characters that identify sites so that I can merge two data frames. For example, a site in one frame is called "001a Frozen Niagara Entrance" whereas the same site in the other data frame is called "Frozen Niagara Entrance". It seems to me the easiest thing to do would be to remove the numbers from the first data
2003 Oct 15
2
help.search in trouble with R-patched ?
...unless its me missing something... > help.search("prompt", agrep=F) Error: couldn't find function ".class1" > traceback() 12: initialize(value, ...) 11: initialize(value, ...) 10: new("ObjectsWithPackage", value, package = pkg) 9: metaNameUndo(unique(these), prefix = "M", searchForm = searchForm) 8: methods:::getGenerics(ns) 7:
2005 Nov 07
0
[LLVMdev] LLVM 1.6 Release Branch
Everything builds fine on sparc. The configure script needs to be fixed though (see previous email). Sparc testing results: make check: # of expected passes 1189 # of expected failures 34 Regressions Single Source: None New Failures Single Source (new tests): 2005-05-12-Int64ToFP: llc,jit Regressions MultiSource: Applications/d/make_dparser: llc, cbe, jit
2011 Jul 14
1
(no subject)
Good Afternoon R Community, I often work with very large data bases and want to search for select cases by a particular word or numeric value. I created the following simple function to do just that. It searchs a particular column for the phrase and returns a data frame with the rows that contain that phrase (for a particular column). Search<-function(term, dataframe, column.name,
2013 Jan 14
4
Grabbing Specific Words from Content (basic text mining)
Hi all, Suppose I have a data frame with mixed content (name age and address). a<-"Name: John Smith Age: 35 Address: 32, street, sub, something" b<-data.frame(a) 1. The question is I want to extract the name age and address separately from this data frame (containing potentially more people). 2. Also just incase I have to deal with it how would the syntax change if I had
2008 Mar 03
2
esercizio
? stato filtrato un testo allegato il cui set di caratteri non era indicato... Nome: non disponibile Url: https://stat.ethz.ch/pipermail/r-help/attachments/20080303/c747f103/attachment.pl