Hi everyone,
Could somebody explain to me what is the package/function for
classification analysis. I am performing analysis of music files in the form
of MIDI files. I end up with about 750 dependent variables from the
analysis, I also have a number of independent/grouping variables that I set
manually. What I would like is to be able to predict which group a
particular MIDI files belongs to given the 750 dependent variables. In order
to this I have to perform classification analysis on a sample set of MIDI
files where I know what group they belong to. I want to extract the
'classification rule' that would enable me to predict the group of each
MIDI
file (there would be a different classification rule for each grouping
variable). Can anybody explain what is the best way of doing this in R. What
is the best package/function that would enable me to perform classification
analysis.
Any help would be greatly appreciated.
Many Thanks For Your Help!
Rishabh
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !) To: r-help-request at
stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
This is a very general problem and a very large area of statistics/computer
science/etc is concerned with it. R provides lots of possibilities; you
might find tree-based approaches (recursive partitioning) to suit your
needs; in that case, rpart and the new random forests package will be of
interest. Also see package e1071 and the VR packages for starters. There are
lot of other possibilities; you might want to have a look at Ripley, Pattern
Recognition and Neural Networks, for example, to see some.
Reid Huntsinger
-----Original Message-----
From: Rishabh Gupta [mailto:rg117 at ohm.york.ac.uk]
Sent: Tuesday, April 16, 2002 11:14 AM
To: r-help at stat.math.ethz.ch
Subject: [R] Classification Analysis
Hi everyone,
Could somebody explain to me what is the package/function for
classification analysis. I am performing analysis of music files in the form
of MIDI files. I end up with about 750 dependent variables from the
analysis, I also have a number of independent/grouping variables that I set
manually. What I would like is to be able to predict which group a
particular MIDI files belongs to given the 750 dependent variables. In order
to this I have to perform classification analysis on a sample set of MIDI
files where I know what group they belong to. I want to extract the
'classification rule' that would enable me to predict the group of each
MIDI
file (there would be a different classification rule for each grouping
variable). Can anybody explain what is the best way of doing this in R. What
is the best package/function that would enable me to perform classification
analysis.
Any help would be greatly appreciated.
Many Thanks For Your Help!
Rishabh
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.
-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !) To: r-help-request at
stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._.
_._
------------------------------------------------------------------------------
Notice: This e-mail message, together with any attachments, contains
information of Merck & Co., Inc. (Whitehouse Station, New Jersey, USA) that
may be confidential, proprietary copyrighted and/or legally privileged, and is
intended solely for the use of the individual or entity named in this message.
If you are not the intended recipient, and have received this message in error,
please immediately return this by e-mail and then delete it.
=============================================================================
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !) To: r-help-request at
stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
Rishabh, Software which extract rules from data are my main "WorkingField" but it's to bad that in present nothing rule generating algorithm (package) exist. I do my best to get R programming skills until end2002 for my contribution in data-mining package development,because R is a great basic for this. Some nice free-software ressources ( not R ) for this purpose are: http://www.cs.waikato.ac.nz/ml/weka/ (look C4.5Part for rule generating) http://fuzzy.cs.uni-magdeburg.de/~borgelt/software.html#dtree (that's for you probably interesting, because it's very fast C4.5 implementation and you can use in lot of way prune to reduce the complexity 750 variables!) http://fuzzy.cs.uni-magdeburg.de/nefclass/nefclass.html Or my present favorit from J.Mortensen (it's great) when you want analyse your data with fuzzy strategies.......... http://inet.uni2.dk/~jemor/jfs.htm P.S. One question - i make electronic music with ComputerSoftware,too and i'm statistican - what are the 750 variables and what is your classification criterium ? (if it is not a secret !) regards,christian Am 16.04.2002 17:13:39, schrieb "Rishabh Gupta" <rg117 at ohm.york.ac.uk>:>Hi everyone, > Could somebody explain to me what is the package/function for >classification analysis. I am performing analysis of music files in the form >of MIDI files. I end up with about 750 dependent variables from the >analysis, I also have a number of independent/grouping variables that I set >manually. What I would like is to be able to predict which group a >particular MIDI files belongs to given the 750 dependent variables. In order >to this I have to perform classification analysis on a sample set of MIDI >files where I know what group they belong to. I want to extract the >'classification rule' that would enable me to predict the group of each MIDI >file (there would be a different classification rule for each grouping >variable). Can anybody explain what is the best way of doing this in R. What >is the best package/function that would enable me to perform classification >analysis. > >Any help would be greatly appreciated. > >Many Thanks For Your Help! > >Rishabh > >-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- >r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >Send "info", "help", or "[un]subscribe" >(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._ >______________________________________________________________________________ 100 MB und noch mehr gute Gr?nde! Jetzt anmelden und profitieren. Da ist mehr f?r Sie drin unter http://club.web.de/?mc=021103 -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
really interesting! is there some url/docs for more information or are this your own thoughts - maybe for Ph.D thesis ? Am 17.04.2002 15:22:08, schrieb "Rishabh Gupta" <rg117 at ohm.york.ac.uk>:>Thanks For your reply >I would really prefer to use R as I am using R for all the other exploratory analysis that I am doing on the dataIn my humble opinion in R the package(Rpart) is good starting point for this, but in fact you get no really rules. The result are more sub-segmentation's which habit different with the classification (dependent) variable. But you can formulate rules on the basis of the results in a sophisticated way.......... regards,christian> >P.S. >One question - i make electronic music with ComputerSoftware,too >and i'm statistican - what are the 750 variables and what is your >classification criterium ? (if it is not a secret !) >regards,christian > >I have a large of variables (750) because I have a lot of version of each >kind of data. I analyse melody, rhythm, harmony, etc as a different stream >of data. For each one, I calculate a different versions, intervals, contour, >modulo, modulo of interval, etc. For each one of those I perform a different >kind of analysis, basic statistics, complexity measurements, etc. For each >kind of analysis there are five results each. E.g. for statistical, there is >mean, SD, MAD, COV, etc. >After performing all these different conversions/analysis I end up with >about 750 variables (6 * 5 * 5 * 5) >The classification groups are different depending on what set of MIDI files >I am analysing. In some sets the criteria is genre, subject (who played the >piece), composer, extract (which piece of music), etc. > > >______________________________________________________________________________ 100 MB und noch mehr gute Gr?nde! Jetzt anmelden und profitieren. Da ist mehr -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
Rishabh, yes your "PseudoCode" idea is what i want in R,too. But even i mentioned , i didn't know of something like that in R/S-plus and i have got open eyes for this. Perhaps anybody in the whole R community work on machineLearning-algorithm with rule generating,too ??? So you must program a function for yourself (i.e. C4.5 or FS-ID3) !? (If you are interesting for papers (pdf) with algorithm let me know). regards,Christian Am 17.04.2002 15:13:47, schrieb "Rishabh Gupta" <rg117 at ohm.york.ac.uk>:>Thanks for your reply. >I am still learning these aspects of statistical analysis, so if I don't >make sense please forgive me. >All this work I am doing is part of a Phd and my deparment does not really >neural network for situations like these because they say it acts like a >black box and we don't really know what is happening in the inside. So I >would like to concentrate on using pure statistical techniques. Ideally what >I would like is some kind of a function that calculates the "classification >rule" when given a grouped data set : > rule <- classification( GroupVar ~ DepVar1 + DepVar2 + DepVar3 + DepVar4 >..... + ...... DepVarX ) > >Then I would be able to use that rule and apply to a single data element for >which the group is not known: > > theGroup <- rule( DataElement ) > >I have looked at the rpart package in R but I am not entirely sure how to >use in a way that I can create the "classification rule" and then use that >rule. I understand that it is a general problem but it's made more difficult >for me because I have to deal with 750 dependent variables. > >Your help is greatly appreciated. > >Many Thanks > >Rishabh > >----- Original Message ----- >From: "Huntsinger, Reid" <reid_huntsinger at merck.com> >To: "'Rishabh Gupta'" <rg117 at ohm.york.ac.uk>; <r-help at stat.math.ethz.ch> >Sent: Tuesday, April 16, 2002 5:00 PM >Subject: RE: [R] Classification Analysis > > >> This is a very general problem and a very large area of >statistics/computer >> science/etc is concerned with it. R provides lots of possibilities; you >> might find tree-based approaches (recursive partitioning) to suit your >> needs; in that case, rpart and the new random forests package will be of >> interest. Also see package e1071 and the VR packages for starters. There >are >> lot of other possibilities; you might want to have a look at Ripley, >Pattern >> Recognition and Neural Networks, for example, to see some. >> >> Reid Huntsinger >> >> -----Original Message----- >> From: Rishabh Gupta [mailto:rg117 at ohm.york.ac.uk] >> Sent: Tuesday, April 16, 2002 11:14 AM >> To: r-help at stat.math.ethz.ch >> Subject: [R] Classification Analysis >> >> >> Hi everyone, >> Could somebody explain to me what is the package/function for >> classification analysis. I am performing analysis of music files in the >form >> of MIDI files. I end up with about 750 dependent variables from the >> analysis, I also have a number of independent/grouping variables that I >set >> manually. What I would like is to be able to predict which group a >> particular MIDI files belongs to given the 750 dependent variables. In >order >> to this I have to perform classification analysis on a sample set of MIDI >> files where I know what group they belong to. I want to extract the >> 'classification rule' that would enable me to predict the group of each >MIDI >> file (there would be a different classification rule for each grouping >> variable). Can anybody explain what is the best way of doing this in R. >What >> is the best package/function that would enable me to perform >classification >> analysis. >> >> Any help would be greatly appreciated. >> >> Many Thanks For Your Help! >> >> Rishabh >> >> -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-. >-. >> -.- >> r-help mailing list -- Read >http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >> Send "info", "help", or "[un]subscribe" >> (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >> >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._. >> _._ >> >> >> -------------------------------------------------------------------------- >---- >> Notice: This e-mail message, together with any attachments, contains >information of Merck & Co., Inc. (Whitehouse Station, New Jersey, USA) that >may be confidential, proprietary copyrighted and/or legally privileged, and >is intended solely for the use of the individual or entity named in this >message. If you are not the intended recipient, and have received this >message in error, please immediately return this by e-mail and then delete >it. >> >> >===========================================================================>=>> >> -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-. >-.-.- >> r-help mailing list -- Read >http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >> Send "info", "help", or "[un]subscribe" >> (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >> >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._. >_._ >> > >-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- >r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >Send "info", "help", or "[un]subscribe" >(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._ >________________________________________________________________ Keine verlorenen Lotto-Quittungen, keine vergessenen Gewinne mehr! -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
rpart will calculate the classification rule (represented as a tree) given a
grouped data set. predict.rpart will apply that rule to a single data
element (or data set) for which groupings are not known. path.rpart will
return the rules explicitly, if that's needed. Try the examples from
help("rpart"), help("predict.rpart") and
help("path.rpart") to see.
Reid Huntsinger
-----Original Message-----
From: ozric at web.de [mailto:ozric at web.de]
Sent: Wednesday, April 17, 2002 2:23 PM
To: Rishabh Gupta
Cc: r-help at stat.math.ethz.ch
Subject: Re: [R] Classification Analysis
Rishabh,
yes your "PseudoCode" idea is what i want in R,too.
But even i mentioned , i didn't know of something like that in R/S-plus and
i have got open eyes for this.
Perhaps anybody in the whole R community work on machineLearning-algorithm
with rule generating,too ???
So you must program a function for yourself (i.e. C4.5 or FS-ID3) !?
(If you are interesting for papers (pdf) with algorithm let me know).
regards,Christian
Am 17.04.2002 15:13:47, schrieb "Rishabh Gupta" <rg117 at
ohm.york.ac.uk>:>Thanks for your reply.
>I am still learning these aspects of statistical analysis, so if I don't
>make sense please forgive me.
>All this work I am doing is part of a Phd and my deparment does not really
>neural network for situations like these because they say it acts like a
>black box and we don't really know what is happening in the inside. So I
>would like to concentrate on using pure statistical techniques. Ideally
what>I would like is some kind of a function that calculates the
"classification
>rule" when given a grouped data set :
> rule <- classification( GroupVar ~ DepVar1 + DepVar2 + DepVar3 +
DepVar4>..... + ...... DepVarX )
>
>Then I would be able to use that rule and apply to a single data element
for>which the group is not known:
>
> theGroup <- rule( DataElement )
>
>I have looked at the rpart package in R but I am not entirely sure how to
>use in a way that I can create the "classification rule" and then
use that
>rule. I understand that it is a general problem but it's made more
difficult>for me because I have to deal with 750 dependent variables.
>
>Your help is greatly appreciated.
>
>Many Thanks
>
>Rishabh
>
>----- Original Message -----
>From: "Huntsinger, Reid" <reid_huntsinger at merck.com>
>To: "'Rishabh Gupta'" <rg117 at ohm.york.ac.uk>;
<r-help at stat.math.ethz.ch>
>Sent: Tuesday, April 16, 2002 5:00 PM
>Subject: RE: [R] Classification Analysis
>
>
>> This is a very general problem and a very large area of
>statistics/computer
>> science/etc is concerned with it. R provides lots of possibilities; you
>> might find tree-based approaches (recursive partitioning) to suit your
>> needs; in that case, rpart and the new random forests package will be
of
>> interest. Also see package e1071 and the VR packages for starters.
There
>are
>> lot of other possibilities; you might want to have a look at Ripley,
>Pattern
>> Recognition and Neural Networks, for example, to see some.
>>
>> Reid Huntsinger
>>
>> -----Original Message-----
>> From: Rishabh Gupta [mailto:rg117 at ohm.york.ac.uk]
>> Sent: Tuesday, April 16, 2002 11:14 AM
>> To: r-help at stat.math.ethz.ch
>> Subject: [R] Classification Analysis
>>
>>
>> Hi everyone,
>> Could somebody explain to me what is the package/function for
>> classification analysis. I am performing analysis of music files in the
>form
>> of MIDI files. I end up with about 750 dependent variables from the
>> analysis, I also have a number of independent/grouping variables that I
>set
>> manually. What I would like is to be able to predict which group a
>> particular MIDI files belongs to given the 750 dependent variables. In
>order
>> to this I have to perform classification analysis on a sample set of
MIDI
>> files where I know what group they belong to. I want to extract the
>> 'classification rule' that would enable me to predict the group
of each
>MIDI
>> file (there would be a different classification rule for each grouping
>> variable). Can anybody explain what is the best way of doing this in R.
>What
>> is the best package/function that would enable me to perform
>classification
>> analysis.
>>
>> Any help would be greatly appreciated.
>>
>> Many Thanks For Your Help!
>>
>> Rishabh
>>
>>
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.>-.
>> -.-
>> r-help mailing list -- Read
>http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
>> Send "info", "help", or "[un]subscribe"
>> (in the "body", not the subject !) To: r-help-request at
stat.math.ethz.ch
>>
>_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
.>> _._
>>
>>
>>
-------------------------------------------------------------------------->----
>> Notice: This e-mail message, together with any attachments, contains
>information of Merck & Co., Inc. (Whitehouse Station, New Jersey, USA)
that
>may be confidential, proprietary copyrighted and/or legally privileged, and
>is intended solely for the use of the individual or entity named in this
>message. If you are not the intended recipient, and have received this
>message in error, please immediately return this by e-mail and then delete
>it.
>>
>>
>==========================================================================>=>>
>>
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.>-.-.-
>> r-help mailing list -- Read
>http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
>> Send "info", "help", or "[un]subscribe"
>> (in the "body", not the subject !) To: r-help-request at
stat.math.ethz.ch
>>
>_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
.>_._
>>
>
>-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
.-.->r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
>Send "info", "help", or "[un]subscribe"
>(in the "body", not the subject !) To: r-help-request at
stat.math.ethz.ch
>_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
._._>
________________________________________________________________
Keine verlorenen Lotto-Quittungen, keine vergessenen Gewinne mehr!
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.
-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !) To: r-help-request at
stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._.
_._
------------------------------------------------------------------------------
Notice: This e-mail message, together with any attachments, contains
information of Merck & Co., Inc. (Whitehouse Station, New Jersey, USA) that
may be confidential, proprietary copyrighted and/or legally privileged, and is
intended solely for the use of the individual or entity named in this message.
If you are not the intended recipient, and have received this message in error,
please immediately return this by e-mail and then delete it.
=============================================================================
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !) To: r-help-request at
stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
Hi Bill, a really nice fundus especially for "fuzzy-papers" what is in present my main interesting) you can download at http://www.scch.at/index.html . If you have a special method interest let me now ,beacuse i have got a growing pdf-archive. ....my learning by doing project for R is a fuzzy association rule approach. suggestion for Rishabh:>> >All this work I am doing is part of a Phd and my deparment does not really >> >neural network for situations like these because they say it acts like a >> >black box and we don't really know what is happening in the inside. So I >> >would like to concentrate on using pure statistical techniques.I'm not complete agree that neural networks are a black box, because it is possible to measure the sensitivity. One sophisticated strategy i.e. is 1. learn a neural network with independent and classification variable 2. If the neural network have got a good learning result test the generalization. 3.If this is ok for your purporse, change the values for all cases of the independent variable (which sens. you interested). 4. Now generalize again the neural network with the modified data and you get a new distribution of the classification variable and now the influence of the modification. The advantage in contradiction to "logistic regression" is that you are not bound to linearity and can i.e. test interaction effects when you change 2 variables at the same time. The difficulty with ANN here is the estimation of the neural network paramteres for good training and test approximation, but for this exist genetic algorithm. regards,christian>Hello Ozric, > >I am interested in knowing what papers you have available. > >Thanks, > >Bill > >ozric at web.de wrote: > >> Rishabh, >> yes your "PseudoCode" idea is what i want in R,too. >> But even i mentioned , i didn't know of something like that in R/S-plus and i have got open eyes for this. >> >> Perhaps anybody in the whole R community work on machineLearning-algorithm >> with rule generating,too ??? >> >> So you must program a function for yourself (i.e. C4.5 or FS-ID3) !? >> >> (If you are interesting for papers (pdf) with algorithm let me know). >> regards,Christian >> >> Am 17.04.2002 15:13:47, schrieb "Rishabh Gupta" <rg117 at ohm.york.ac.uk>: >> >Thanks for your reply. >> >I am still learning these aspects of statistical analysis, so if I don't >> >make sense please forgive me. >> >All this work I am doing is part of a Phd and my deparment does not really >> >neural network for situations like these because they say it acts like a >> >black box and we don't really know what is happening in the inside. So I >> >would like to concentrate on using pure statistical techniques. Ideally what >> >I would like is some kind of a function that calculates the "classification >> >rule" when given a grouped data set : >> > rule <- classification( GroupVar ~ DepVar1 + DepVar2 + DepVar3 + DepVar4 >> >..... + ...... DepVarX ) >> > >> >Then I would be able to use that rule and apply to a single data element for >> >which the group is not known: >> > >> > theGroup <- rule( DataElement ) >> > >> >I have looked at the rpart package in R but I am not entirely sure how to >> >use in a way that I can create the "classification rule" and then use that >> >rule. I understand that it is a general problem but it's made more difficult >> >for me because I have to deal with 750 dependent variables. >> > >> >Your help is greatly appreciated. >> > >> >Many Thanks >> > >> >Rishabh >> > >> >----- Original Message ----- >> >From: "Huntsinger, Reid" <reid_huntsinger at merck.com> >> >To: "'Rishabh Gupta'" <rg117 at ohm.york.ac.uk>; <r-help at stat.math.ethz.ch> >> >Sent: Tuesday, April 16, 2002 5:00 PM >> >Subject: RE: [R] Classification Analysis >> > >> > >> >> This is a very general problem and a very large area of >> >statistics/computer >> >> science/etc is concerned with it. R provides lots of possibilities; you >> >> might find tree-based approaches (recursive partitioning) to suit your >> >> needs; in that case, rpart and the new random forests package will be of >> >> interest. Also see package e1071 and the VR packages for starters. There >> >are >> >> lot of other possibilities; you might want to have a look at Ripley, >> >Pattern >> >> Recognition and Neural Networks, for example, to see some. >> >> >> >> Reid Huntsinger >> >> >> >> -----Original Message----- >> >> From: Rishabh Gupta [mailto:rg117 at ohm.york.ac.uk] >> >> Sent: Tuesday, April 16, 2002 11:14 AM >> >> To: r-help at stat.math.ethz.ch >> >> Subject: [R] Classification Analysis >> >> >> >> >> >> Hi everyone, >> >> Could somebody explain to me what is the package/function for >> >> classification analysis. I am performing analysis of music files in the >> >form >> >> of MIDI files. I end up with about 750 dependent variables from the >> >> analysis, I also have a number of independent/grouping variables that I >> >set >> >> manually. What I would like is to be able to predict which group a >> >> particular MIDI files belongs to given the 750 dependent variables. In >> >order >> >> to this I have to perform classification analysis on a sample set of MIDI >> >> files where I know what group they belong to. I want to extract the >> >> 'classification rule' that would enable me to predict the group of each >> >MIDI >> >> file (there would be a different classification rule for each grouping >> >> variable). Can anybody explain what is the best way of doing this in R. >> >What >> >> is the best package/function that would enable me to perform >> >classification >> >> analysis. >> >> >> >> Any help would be greatly appreciated. >> >> >> >> Many Thanks For Your Help! >> >> >> >> Rishabh >> >> >> >> -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-. >> >-. >> >> -.- >> >> r-help mailing list -- Read >> >http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >> >> Send "info", "help", or "[un]subscribe" >> >> (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >> >> >> >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._. >> >> _._ >> >> >> >> >> >> -------------------------------------------------------------------------- >> >---- >> >> Notice: This e-mail message, together with any attachments, contains >> >information of Merck & Co., Inc. (Whitehouse Station, New Jersey, USA) that >> >may be confidential, proprietary copyrighted and/or legally privileged, and >> >is intended solely for the use of the individual or entity named in this >> >message. If you are not the intended recipient, and have received this >> >message in error, please immediately return this by e-mail and then delete >> >it. >> >> >> >> >> >===========================================================================>> >=>> >> >> >> -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-. >> >-.-.- >> >> r-help mailing list -- Read >> >http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >> >> Send "info", "help", or "[un]subscribe" >> >> (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >> >> >> >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._. >> >_._ >> >> >> > >> >-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- >> >r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >> >Send "info", "help", or "[un]subscribe" >> >(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >> >_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._ >> > >> ________________________________________________________________ >> Keine verlorenen Lotto-Quittungen, keine vergessenen Gewinne mehr! >> >> -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- >> r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html >> Send "info", "help", or "[un]subscribe" >> (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch >> _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._ > >______________________________________________________________________________ Sie wollen mehr? Mehr Speicher, mehr Mail, mehr Erlebnis, mehr Leistung, -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html Send "info", "help", or "[un]subscribe" (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._