ziad.elmously at tnsglobal.com
2014-Sep-24 11:30 UTC
[R] Text Mining in Non English Speaking Countries
Hello All, I am interested in conducting text mining in languages other English. My understanding is the following R packages can analyze alternative (to English) languages: 1. "topicmodels" 2. "snowball" 3. "tm" Can anyone confirm? Specifically, I am interested in Hindi and Chinese (2 or so most popular dialects). If so, can you recommend relevant documentation and share your experiences with these packages. Thank you in advance. Ziad Elmously http://www.kantar.com/disclaimer.html [[alternative HTML version deleted]]
I used already with portuguese. No problems. Flavio Barros www.flaviobarros.net <http://s.wisestamp.com/links?url=http%3A%2F%2Fwww.flaviobarros.net&sn=> [image: Facebook] <http://s.wisestamp.com/links?url=http%3A%2F%2Fwww.facebook.com%2Fflavio.barros.1650%3Fref%3Dtn_tnmn&sn=> [image: LinkedIn] <http://s.wisestamp.com/links?url=http%3A%2F%2Fwww.linkedin.com%2Fprofile%2Fview%3Fid%3D61839390%26trk%3Dtab_pro&sn=> [image: about.me] <http://s.wisestamp.com/links?url=http%3A%2F%2Fabout.me%2Fflavio_barros&sn=> Contact me: [image: Google Talk] flaviomargarito at gmail.com ?"We are not victims by nature...we are programmed to be victims...for good reason...if we truly embraced our power, we would never be controlled. Live WISE~" - Gail Blackman <http://s.wisestamp.com/links?url=http%3A%2F%2Fwww.quotesdaddy.com%2Fquote%2F1403644%2Fgail-blackman%2Fwe-are-not-victims-by-naturewe-are-programmed-to-be&sn=> ? Get this email app! <http://s.wisestamp.com/links?url=http%3A%2F%2Fwww.wisestamp.com%2Fapps%2Fquotes%3Futm_source%3Dextension%26utm_medium%3Demail%26utm_term%3Dquotes%26utm_campaign%3Dapps&sn=> [image: WordPress Blog Posts] <http://s.wisestamp.com/links?url=http%3A%2F%2Fwww.flaviobarros.net&sn=>My latest post:Data Preparation ? Part II <http://s.wisestamp.com/links?url=http%3A%2F%2Ffeedproxy.google.com%2F~r%2FFlavioBarros%2F~3%2F9MTu1M40mhE%2F&sn=> Read more <http://s.wisestamp.com/links?url=http%3A%2F%2Ffeedproxy.google.com%2F~r%2FFlavioBarros%2F~3%2F9MTu1M40mhE%2F&sn=> | My blog <http://s.wisestamp.com/links?url=http%3A%2F%2Fwww.flaviobarros.net&sn=> [image: Share on Facebook] <http://s.wisestamp.com/links?url=http%3A%2F%2Fwww.facebook.com%2Fsharer.php%3Fu%3Dhttp%253A%252F%252Ffeedproxy.google.com%252F~r%252FFlavioBarros%252F~3%252F9MTu1M40mhE%252F&sn=> [image: Share on Twitter] <http://s.wisestamp.com/links?url=https%3A%2F%2Ftwitter.com%2Fintent%2Ftweet%3Ftext%3DData%2520Preparation%2520%25E2%2580%2593%2520Part%2520II%2520%2520(via%2520%2540wisestamp)&sn=> Get this email app! <http://s.wisestamp.com/links?url=http%3A%2F%2Fwww.wisestamp.com%2Fapps%2Fwordpress%3Futm_source%3Dextension%26utm_medium%3Demail%26utm_term%3Dwordpress%26utm_campaign%3Dapps&sn=> <http://s.wisestamp.com/links?url=http%3A%2F%2Fwww.linkedin.com%2Fin%2F&sn=> Create your free signature: <http://s.wisestamp.com/links?url=http%3A%2F%2Fr1.wisestamp.com%2Fr%2Flanding%3Fpromo%3D33%26dest%3Dhttp%253A%252F%252Fwww.wisestamp.com%252Femail-install%253Futm_source%253Dextension%2526utm_medium%253Demail%2526utm_campaign%253Dpromo_33&sn=> CLICK HERE! <http://s.wisestamp.com/links?url=http%3A%2F%2Fr1.wisestamp.com%2Fr%2Flanding%3Fpromo%3D33%26amp%3Bdest%3Dhttp%253A%252F%252Fwww.wisestamp.com%252Femail-install%253Futm_source%253Dextension%2526utm_medium%253Demail%2526utm_campaign%253Dpromo_33&sn=> ? On Wed, Sep 24, 2014 at 8:30 AM, <ziad.elmously at tnsglobal.com> wrote:> Hello All, > > I am interested in conducting text mining in languages other English. My > understanding is the following R packages can analyze alternative (to > English) languages: > > > 1. "topicmodels" > > 2. "snowball" > > 3. "tm" > > Can anyone confirm? Specifically, I am interested in Hindi and Chinese (2 > or so most popular dialects). If so, can you recommend relevant > documentation and share your experiences with these packages. > > Thank you in advance. > > Ziad Elmously > > > > > > http://www.kantar.com/disclaimer.html > > > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help at r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. >[[alternative HTML version deleted]]