search for: tokenize_words

Displaying 2 results from an estimated 2 matches for "tokenize_words".

2024 Nov 25
1
Problemas usando paquete textreuse
...e, pero para ello he empleado el siguiente código. library(pdftools) library(textreuse) text1 <- pdf_text("uno.pdf") text2 <- pdf_text("dos.pdf") full_text1 <- paste(text1, collapse = " ") full_text2 <- paste(text2, collapse = " ") a <- tokenize_words(full_text1) b <- tokenize_words(full_text2) jaccard_similarity(a, b) Gracias [[alternative HTML version deleted]]
2024 Nov 26
0
Resumen de R-help-es, Vol 187, Envío 10
...rary(pdftools) > > library(textreuse) > > text1 <- pdf_text("uno.pdf") > > text2 <- pdf_text("dos.pdf") > > full_text1 <- paste(text1, collapse = " ") > > full_text2 <- paste(text2, collapse = " ") > > a <- tokenize_words(full_text1) > > b <- tokenize_words(full_text2) > > jaccard_similarity(a, b) > > > Gracias > > [[alternative HTML version deleted]] > > > > > ------------------------------ > > Subject: Pié de página del digest > > __________________________...