Displaying 2 results from an estimated 2 matches for "tokenize_words".
2024 Nov 25
1
Problemas usando paquete textreuse
...e,
pero para ello he empleado el siguiente código.
library(pdftools)
library(textreuse)
text1 <- pdf_text("uno.pdf")
text2 <- pdf_text("dos.pdf")
full_text1 <- paste(text1, collapse = " ")
full_text2 <- paste(text2, collapse = " ")
a <- tokenize_words(full_text1)
b <- tokenize_words(full_text2)
jaccard_similarity(a, b)
Gracias
[[alternative HTML version deleted]]
2024 Nov 26
0
Resumen de R-help-es, Vol 187, Envío 10
...rary(pdftools)
>
> library(textreuse)
>
> text1 <- pdf_text("uno.pdf")
>
> text2 <- pdf_text("dos.pdf")
>
> full_text1 <- paste(text1, collapse = " ")
>
> full_text2 <- paste(text2, collapse = " ")
>
> a <- tokenize_words(full_text1)
>
> b <- tokenize_words(full_text2)
>
> jaccard_similarity(a, b)
>
>
> Gracias
>
> [[alternative HTML version deleted]]
>
>
>
>
> ------------------------------
>
> Subject: Pié de página del digest
>
> __________________________...