Displaying 20 results from an estimated 1000 matches similar to: "AGREP"
2004 Feb 13
1
RES: AGREP
"Marcos Sanches" <marcos.sanches at ipsos-opinion.com.br> wrote:
Ls1<-length(s1)
Ls2<-length(s2)
for ( p in 1:ls1){
for (q in 1:ls2){
t1<-levenshteinFast(s1[p],s2[q])
...
Ls1=42000
Ls2=70000
I think I will wait for months untill this program ends. Do you have any
sugestion to increase the speed?
The first suggestion has to be "search HARD in the
2010 Nov 16
1
Bug in agrep computing edit distance?
The documentation for agrep says it uses the Levenshtein edit distance,
but it seems to get this wrong in certain cases when there is a
combination of deletions and substitutions. For example:
> agrep("abcd", "abcxyz", max.distance=1)
[1] 1
That should've been a no-match. The edit distance between those strings
is 3 (1 substitution, 2 deletions), but agrep matches
2012 Jan 19
1
bug en funcion 'agrep'
Estimados R-users:
Estoy intentando usar la función 'agrep' para hacer búsquedas en
cadenas de texto. El parámetro max.distance permite controlar la
medida de aproximación de búsqueda de la función de Levenshtein. No
obstante, cuando hago búsquedas específicas no obtengo siempre el
resultado deseado y no se si es un bug o que no entiendo bien el
algoritmo de búsqueda. Por
2010 Nov 17
2
Bug in agrep computing edit distance?
I posted this yesterday to r-help and Ben Bolker suggested reposting it
here...
Dickison, Daniel <ddickison <at> carnegielearning.com> writes:
>
> The documentation for agrep says it uses the Levenshtein edit distance,
> but it seems to get this wrong in certain cases when there is a
> combination of deletions and substitutions. For example:
>
> >
2008 Aug 26
2
String search: Return "closest" match
Hi,
I have to match names where names can be recorded with errors or additions.
Now I am searching for a string search function which returns always the "closest" match. E.g. searching for "Washington" it should return only Washington but not Washington, D.C. But it also could be that the list contains only "Hamburg" but the record I am searching for is
2006 Jan 05
1
Pb with agrep()
Happy new year everybody,
I'm getting the following while trying to use the agrep() function:
> pattern <- "XXX"
> subject <- c("oooooo", "oooXooo", "oooXXooo", "oooXXXooo")
> max <- list(ins=0, del=0, sub=0) # I want exact matches only
> agrep(pattern, subject, max=max)
[1] 4
OK
> max$sub <- 1 # One allowed
2012 Dec 11
1
How do you use agrep inside a loop
Hi all.
This is my first message at R-help...so I'm hoping I have some beginner's
luck and get some good help for my problem!
FYI I have just started using R recently so my knowledge of R is pretty
preliminary.
Okay here is what I need help with - I need to know how to use agrep in a
for loop.
I need to compare elements of a vector of names with other elements of the
same vector.
2011 Aug 20
2
Pattern names matching
Dear R magic guys.. I have two tables (actually will be dataframes), both
with names to be matched.
The names on the first dataframe are from a study with antenatal visits on
some health centers here. It happens that we need the delivery info. And
half and some thing else of the women decided to delivery some where else
our health units. We managed to get the names from some other places but now
2006 Jan 10
5
matching country name tables from different sources
Hi,
Before I reinvent the wheel I wanted to kindly ask you for your opinion if there is a simple way to do it.
I want to merge a larger number of tables from different data sources in R and the matching criterium are country names. The tables are of different size and sometimes the country names do differ slightly.
Has anyone done this or any recommendation on what commands I
2004 Feb 09
5
Printting 'for' and 'while' indices
Sorry, I forgot to correct the message subject, so I am resending my
doubt it below:
Hi all!
I wrote a very basic program in R, which has some loops ('for' and
'while'). Is there a way to print the 'for' (or while) indice while the
program is running so that I am able to estimate where the program is
and how long it will last?
I mean, I want something like this:
2008 Jun 27
1
Similarity matching with probabilities
Hello,
It's just a strange coincidence that someone posted just very recently a
question about matching. I know there are several match function in the base
package (such as match, pmatch, charmatch, and the gsub etc) but I can't
seem to use them wisely to be able to get what I need.
suppose I have the following strings:
"tets"
"estt"
"rtes7"
2009 Jan 22
4
text vector clustering
Hi,
I am a new user of R using R 2.8.1 in windows 2003. I have a csv file with
single column which contain the 30,000 students names. There were typo
errors while entering this student names. The actual list of names is <
1000. However we dont have that list for keyword search.
I am interested in grouping/cluster these names as those which are
similar letter to letter. Are there any
2010 Nov 09
1
agrep pmatch recursive???
Hello R Helpers,
Business - 64 bit windows 7, R 2.11.1
I am trying to match the character contents of one list, called 'exclude', to those of a second list, called 'dataset'
dataset is a list of file names with folder locations, and looks like this when called:
> dataset
[1] "A/10-10-29a-13.cdf" "A/10-10-29a-14.cdf" "A/10-10-29a-15.cdf"
2009 Jun 30
1
possible agrep bug; R 2.9.1, Mac OS X 10.5 (PR#13789)
Same behavior was noted on R 2.9.0 OS X and WIndows.
> t(sapply(seq(0.01, .99, length=20), function(x) c(x, length(agrep("Staatssekretar im Bundeskanzleramt","Bundeskanzler",max.distance=x)))))
[1,] 0.01000000 0
[2,] 0.06157895 0
[3,] 0.11315789 0
[4,] 0.16473684 0
[5,] 0.21631579 0
[6,] 0.26789474 0
[7,] 0.31947368 0
[8,] 0.37105263 0
[9,]
2004 Feb 09
1
RConsole
I changed my R console configurations, for example, the letters
are white, the background is black, etc,... Then I saved this new
configuration. What should I do if I want to have this new configuration
everytime I open R?
TIA
2004 Feb 26
2
Structural Equation Model
Hello all!
I want to estimate parameters in a MIMIC model. I have one latent
variable (ksi), four reflexive indicators (y1, y2, y3 and y4) and four
formative indicators (x1, x2, x3, x4). Is there a way to do it in R? I
know there is the SEM library, but it seems not to be possible to
specify formative indicators, that is, observed exogenous variables
which causes the latent variable.
Thanks,
2007 Apr 07
3
string edit distance
I have a column of words, for example
"DOG"
"DOOG"
"GOD"
"GOOD"
"DOOR"
...
and I am interested in creating a matrix that contains the string
edit distances between each pair of words. I am this close -> ' '
<- to writing the algorithm myself (which will allow for different
variations on the string edit rules, indels,
2002 Jun 05
5
Monte Carlo
Doctor in veterinary medecine I have a degree in veterinary epidemiology
I would ask if someone can perform Monte Carlo simulation with R.
Thanks in advance.
Dr Kane Ismaila
i_kane at hotmail.com
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info",
2010 Dec 21
3
Performing basic Multiple Sequence Alignment in R?
Hello everyone,
I am not sure if this should go on the general R mailing list (for example,
if there is a text mining solution that might work here) or the bioconductor
mailing list (since I wasn't able to find a solution to my question on
searching their lists) - so this time I tried both, and in the future I'll
know better (in case it should go to only one of the two).
The task
2012 Nov 02
2
Merge data frame with mispelling characters
Hello dear R-helpers,
I'm working with R-2.15.2 on Windows 7 OS. I'm stucked with a merge of two
data frames by characters.
In each data frame I got two different list of names, that is my main-key to
be merged.
To figure out what I'm saying, I build up a modified "?merge" example, with
errors by purpose:
# Data for authors:
authors <- data.frame(
surname =