I was wondering if R had a data mining componant and how i could get it. If not do you know anyone who is developing a datamining "plug in" for R Phillip Goodreid -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
What "exactly" are you looking for? R has a package for neural network (nnet), and trees (rpart and/or tree). And some others. It really, I think, depend on what "component" you need. Cheers, Kevin ------------------------------------------------ Ko-Kang Kevin Wang Post Graduate PGDipSci Student Department of Statistics University of Auckland New Zealand www.stat.auckand.ac.nz/~kwan022 ----- Original Message ----- From: <Pgoodr1 at aol.com> To: <r-help at stat.math.ethz.ch> Sent: Thursday, September 05, 2002 5:53 PM Subject: [R] data mining for R> I was wondering if R had a data mining componant and how i could get it.If not do you know anyone who is developing a datamining "plug in" for R> Phillip Goodreid > -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-> r-help mailing list -- Readhttp://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html> Send "info", "help", or "[un]subscribe" > (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._. _._>-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
Pgoodr1 at aol.com wrote:> > I was wondering if R had a data mining componant and how i could get it. > If not do you know anyone who is developing a datamining "plug in" for RThe best starting point would be to get a book on data mining and to look for a sec./chap. on data mining techniques. You should remark then that mainly all supervised and unsupervised techniques (except maybe association rules) are available in R by means of a corresponding R package. Saying this, you can see R as your wanted data mining "plug in". Why would you need a data mining component if everything is there? Sincerely, Diego Kuonen -- Dr. ?s sc. Diego Kuonen CEO & Applied Statistician Statoo Consulting, PO Box 107, 1015 Lausanne, Switzerland + Have you ever been Statooed? http://www.statoo.info + + Free Statooed newsletters: http://lists.statoo.com + -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
Well, R does not have a `statistics' plug in either! In the words of Witten & Franke's book, Data Mining is `statistics plus marketing', and R can do a lot of data mining. If you could be more specififc about what techniques you want to use, we may be able to help you further. On Thu, 5 Sep 2002 Pgoodr1 at aol.com wrote:> I was wondering if R had a data mining componant and how i could get it. If not do you know anyone who is developing a datamining "plug in" for R > Phillip Goodreid-- Brian D. Ripley, ripley at stats.ox.ac.uk Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/ University of Oxford, Tel: +44 1865 272861 (self) 1 South Parks Road, +44 1865 272860 (secr) Oxford OX1 3TG, UK Fax: +44 1865 272595 -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
rpart, MARS (in the mda package), MART, and randomForests(maybe?) could be used for Data Mining. I'm working on a Boosting algorithm based on rpart that isn't quite finished. Tom Stockton At 01:53 AM 9/5/2002 -0400, Pgoodr1 at aol.com wrote:>I was wondering if R had a data mining componant and how i could get it. >If not do you know anyone who is developing a datamining "plug in" for R >Phillip Goodreid >-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- >.-.- >r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >Send "info", "help", or "[un]subscribe" >(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._ >._._-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
....in my humble opinion especially DecisionTrees, Rule-Extracting-Algorithms and Bayesian-Networks are the common methods in advanced and practical Data-Mining. The first one is possible in R with (Rpart) - the second and third one i miss, too but i'm on process and be attempt develope a package for association-rules (market-basket-analysis), but didn't know when i'm finished (hopefully end of the year with easy GUI's) ,because i jump between Java(Weka) and R. regards,Christian I was wondering if R had a data mining componant and how i could get it. If not do you know anyone who is developing a datamining "plug in" for R Phillip Goodreid -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._ -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
Philippe, I think you have slightly misunderstood what data-mining is all about. Data-miners tend to see themselves applying expertise from at least three areas: statistics, machine learning ("AI"), and database theory. Most data-mining problems involve extracting or detecting useful information from huge or complex sources of data. In any case, a data-miner would certainly find S-Plus or R a valuable tool for investigating and solving data-mining problems. But a data-miner might also use techniques like "reinforcement learning", "inductive logic", "natural language processing" or "relational theory" to discover concepts and relationships which characterize and solve the problem. These techniques may not be interesting to a "pure" statistician. HTH, John Day Staff Scientist Computer Science Innovations, Melbourne, FL http://www.csi-inc.com/~jday At 04:16 PM 9/5/2002 +0200, Philippe wrote:>In the risk to be heavily critisized, one could mainly see data mining as a >pseudo-new concept invented to sell new (and sometimes, expensive) software >to industries. Data mining is nothing else than existing statistical >analyses optimized for speed in order to deal with millions of entries, or >even more, in a reasonable period of time. So, as it was suggested earlier >in this thread, methods probably exist already somewhere in R. On the >counterpart, R could not be optimized enough to deal with the huge dataset >usually manipulated by data mining software. >Best, > >Philippe Grosjean > >-----Message d'origine----- >De: owner-r-help at stat.math.ethz.ch >[mailto:owner-r-help at stat.math.ethz.ch]De la part de Peter Dalgaard BSA >Envoye: jeudi 5 septembre 2002 14:37 >A: Prof Brian Ripley >Cc: Pgoodr1 at aol.com; r-help at stat.math.ethz.ch >Objet: Re: [R] data mining for R > > >Prof Brian Ripley <ripley at stats.ox.ac.uk> writes: > > > Well, R does not have a `statistics' plug in either! > > > > In the words of Witten & Franke's book, Data Mining is `statistics plus > > marketing', and R can do a lot of data mining. > > > > If you could be more specififc about what techniques you want to use, we > > may be able to help you further. > > > > On Thu, 5 Sep 2002 Pgoodr1 at aol.com wrote: > > > > > I was wondering if R had a data mining componant and how i could get it. >If not do you know anyone who is developing a datamining "plug in" for R > > > Phillip Goodreid > >Another possible definition is "statistics with massive amounts of >incidental data". A large part of the DM practices seems to be >"quarrying". The actual statistical methodology is only a part of a >complicated process of getting data out of databases on a, say, weekly >schedule, roughly preprocessed, then fed to a statistics engine, and >postprocessed to something that can end up on the manager's desk. > >In my impression that is essentially what SPSS's Clementine product >does, using a GUI to draw arrows between pretty little hexagonal >cells. It is not at all unthinkable that something like that could be >coded up in R too. I think we have most of the pieces to do it. > >-- > O__ ---- Peter Dalgaard Blegdamsvej 3 > c/ /'_ --- Dept. of Biostatistics 2200 Cph. N > (*) \(*) -- University of Copenhagen Denmark Ph: (+45) 35327918 >~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk) FAX: (+45) 35327907 >-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-. >-.- >r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >Send "info", "help", or "[un]subscribe" >(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._. >_._ > > > >-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- >r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >Send "info", "help", or "[un]subscribe" >(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._ >-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
Hi , ..my intention is in main implement a fuzzy-Association-algorithms which works better , when you have many cases on category threshold's i.e. like age ! Open-Source apriori exist enough (use google for url) i.e. Apriori form Christian Borgelt ARTool (Java) Weka(Java) You have a variable age which some categories, i.e. one is 19-29 years ! Now you search for high supports in your dataset (frequent Itemsets). Now suppose in your data are many people with age 28 and 29 years. Now the calculations for frequent Itemsets with the age category 30 to 39 years is biased, because apriori counts not the people which are 28 and 29 years what's statistical correct but not a good interpretation of the associations between variables - and here works fuzzy !? ...like i'm mentioned i need until end of the year, because i have several other normal boring jobs regards,Christian SK?LA_Zdenek <skala at incoma.cz> schrieb am 06.09.02 08:32:31:> Could you kindly send me an info after your package will be available? > Thanks! > Zdenek Skala > INCOMA Research > skala at incoma.cz > > -----Original Message----- > From: chr.schulz at email.de [mailto:chr.schulz at email.de] > > ... i'm on process and be attempt develope a package > for association-rules (market-basket-analysis), but didn't know when i'm finished > (hopefully end of the year with easy GUI's) ,because i jump between Java(Weka) > and R. > > regards,Christian-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._