search for: stoplist_ascii

Displaying 2 results from an estimated 2 matches for "stoplist_ascii".

2013 Apr 09
3
Question on Stopword Removal from a Cyrillic (Bulgarian)Text
...ncoding='CP1251', fileEncoding='CP1251') (I also tried the above with UTF-8 encoding on a UTF-8-saved file.) I have my list of stop words written in a separate text file, one word per line, which I read into R using the scan function: stoplist<-scan(file='stoplist_ascii.txt', what='character', strip.white=TRUE, blank.lines.skip=TRUE, fileEncoding='CP1251', encoding='CP1251') (also tried with UTF-8 here on a correspondingly encoded file) I currently only te...
2013 Apr 09
3
Question on Stopword Removal from a Cyrillic (Bulgarian)Text
...ncoding='CP1251', fileEncoding='CP1251') (I also tried the above with UTF-8 encoding on a UTF-8-saved file.) I have my list of stop words written in a separate text file, one word per line, which I read into R using the scan function: stoplist<-scan(file='stoplist_ascii.txt', what='character', strip.white=TRUE, blank.lines.skip=TRUE, fileEncoding='CP1251', encoding='CP1251') (also tried with UTF-8 here on a correspondingly encoded file) I currently only te...