Displaying 2 results from an estimated 2 matches for "stoplist_ascii".
2013 Apr 09
3
Question on Stopword Removal from a Cyrillic (Bulgarian)Text
...ncoding='CP1251',
fileEncoding='CP1251')
(I also tried the above with UTF-8 encoding on a UTF-8-saved file.)
I have my list of stop words written in a separate text file, one word
per line, which I read into R using the scan function:
stoplist<-scan(file='stoplist_ascii.txt',
what='character',
strip.white=TRUE,
blank.lines.skip=TRUE,
fileEncoding='CP1251',
encoding='CP1251')
(also tried with UTF-8 here on a correspondingly encoded file)
I currently only te...
2013 Apr 09
3
Question on Stopword Removal from a Cyrillic (Bulgarian)Text
...ncoding='CP1251',
fileEncoding='CP1251')
(I also tried the above with UTF-8 encoding on a UTF-8-saved file.)
I have my list of stop words written in a separate text file, one word
per line, which I read into R using the scan function:
stoplist<-scan(file='stoplist_ascii.txt',
what='character',
strip.white=TRUE,
blank.lines.skip=TRUE,
fileEncoding='CP1251',
encoding='CP1251')
(also tried with UTF-8 here on a correspondingly encoded file)
I currently only te...