Dear Lists, What is the appropriate software package for dumping say 20 PDFS in a folder, then creating data visualization with frequency counts of certain words as well as measure correlation within each file for certain key relationships or key words. I am doing text analysis of biases in enterprise software sponsored publications- and need to come up with a statistical threshold. Regards, Ajay Ohri Websites- http://decisionstats.com
Karl Ove Hufthammer
2011-May-18 08:14 UTC
[R] text mining analysis and word visualization of pdfs
Ajay Ohri wrote:> What is the appropriate software package for dumping say 20 PDFS in a > folder, then creating data visualization with frequency counts of > certain words as well as measure correlation within each file for > certain key relationships or key words.pdftotext + Unix? for Poets + R (ggplot2) HTH. -- Karl Ove Hufthammer