Displaying 20 results from an estimated 10000 matches similar to: "Classification Analysis"
2002 Oct 14
3
normalizing data sets
Hi,
Can someone tell me how to normalize a data set so that the mean of the set is 0 and the variance is 1. As I understand, when you
calculate the principle components of a data set through correlation as
< princomp( dataset, cor=T ) >
then a similar calculation is performed. I would like to know how I can perform such a calulation directly. Any help would be
greatly appreciated.
Many
2002 Oct 17
1
panel vs subpanel in cloud from lattice library
Hi,
Could somebody explain to be the different between the panel and the subpanel parameter in the cloud function of the lattice
library. I'm not sure when to use which.
Thanks
Rishabh
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or
2002 Jul 03
2
grouping in scatterplot3d
Hi all,
I've been using the scatterplot3d function (from the scatterplot3d library) to create 3D plots. I was wondering whether there is
anyway to group the points according to a particular group variable. For example I used the plot function as
plot( factor( <GROUPINGVAR> ) , <PLOTVARS> )
to organise the results in a according to the grouping variable. I was wondering
2003 Feb 27
2
multidimensional function fitting
Take a look at package mgcv. Hope this helps. --Matt
-----Original Message-----
From: RenE J.V. Bertin [mailto:rjvbertin at despammed.com]
Sent: Thursday, February 27, 2003 1:39 PM
To: r-help at stat.math.ethz.ch
Subject: [R] multidimensional function fitting
Hello,
I have been looking around for how to perform a multidimensional, arbitrary
function fit (in any case non-linear; more below),
2002 Mar 10
6
Newbie with R
Hello,
I would like to test some learning algorithms (C5.0, or C4.5) using decision trees or neural network.
I didn't find anything in the R documentation for functions to apply on datasets (well, English is not my mother tongue and I have some problems to understand some functions descriptions). Does anybody know where I can find anything about that ?
Thank you very much,
Thomas
2004 Mar 13
4
nnet classification accuracy vs. other models
I was wandering if anybody ever tried to compare the classification
accuracy of nnet to other (rpart, tree, bagging) models. From what I
know, there is no reason to expect a significant difference in
classification accuracy between these models, yet in my particular case
I get about 10% error rate for tree, rpart and bagging model and 80%
error rate for nnet, applied to the same data.
Thanks.
2004 Jan 06
2
comparing classification methods: 10-fold cv or leaving-one-out ?
Hi
what would you recommend to compare classification methods such as LDA,
classification trees (rpart), bagging, SVM, etc:
10-fold cv (as in Ripley p. 346f)
or
leaving-one-out (as e.g. implemented in LDA)?
my data-set is not that huge (roughly 200 entries)
many thanks for a hint
Christoph
--
Christoph Lehmann <christoph.lehmann at gmx.ch>
2009 Jun 19
4
Recursive partitioning algorithms in R vs. alia
Dear R-helpers,
I had a conversation with a guy working in a "business intelligence"
department at a major Spanish bank. They rely on recursive partitioning
methods to rank customers according to certain criteria.
They use both SAS EM and Salford Systems' CART. I have used package R
part in the past, but I could not provide any kind of feature comparison
or the like as I have no
2003 May 27
3
0 margin for creating eps files
Dear all,
I am trying to create eps files of R plots (in Linux) so that I can import them into Word
(obviously in MS Windows). What I would like is for the files to be cropped so that there is no
margin around the actual plot, because I have no way of editing the files after they have been
created. I have tried using
par(mai=c(.75,.75,0,0))
in order to reduce the margin; it works fine when I
2003 Mar 10
1
rpart returning only 1 node
Hi,
This may actually be a theoretical question.
When I tried to do the following:
##########################################################
> colnames(rating.adclms)
[1] "usage" "mileage" "sex" "excess" "ncd"
[6] "primage" "minage" "drivers" "district" "cargroup"
[11]
2006 Feb 27
1
clustering
Hi there,
Sorry for the double email. Does R have the packages for the following
clustering methods? And if it does, what the commands for them?
1. SOM (Self-organization map)
2. Graph partitioning:
3. Neural network
4. Probability Binning
Thank you very much!
Linda
[[alternative HTML version deleted]]
2008 May 15
5
Inconsistent linear model calculations
Readers,
Using version 251 I tried the following command:
lm(y~a+b,data=datafile)
Resulting in, inter alia:
...
coefficients
(intercept) a
1.2 3.4
Packages installed:
acepack ace() and avas() for selecting regression
transformations
adlift An adaptive lifting scheme algorithm
akima Interpolation of irregularly spaced
2009 Jan 13
1
Message: No title available (pre-2.0.0 install?)
Hello All,
I'm actually the system administrator of a UNIX system where several
users use R version 2.6.0. I have a user who is trying to use the SURVEY
package, and when he does, he gets the message:
survey' is not a valid package -- installed < 2.0.0?
When I run the library() command, I get (see below):
Anything that is listed as ** No title available (pre-2.0.0 install?) **
2005 Mar 18
1
How to show which variables include in plot of classification tree
Dear all
For my research, I am learning classification now.
I was trying some example about classification tree pakages, such as
tree and rpart, for instance,
in Pima.te dataset have 8 variables (include class=type):
library(rpart)
library(datasets)
pima.rpart <- rpart(type ~ npreg+glu+bp+skin+bmi+ped+age,data=Pima.te,
method='class')
plot(pima.rpart, uniform=TRUE)
text(pima.rpart)
2007 Jun 15
2
method of rpart when response variable is binary?
Dear all,
I would like to model the relationship between y and x. y is binary
variable, and x is a count variable which may be possion-distribution.
I think it is better to divide x into intervals and change it to a
factor before calling glm(y~x,data=dat,family=binomail).
I try to use rpart. As y is binary, I use "class" method and get the
following result.
>
2011 Nov 04
1
Decision tree model using rpart ( classification
Hi Experts,
I am new to R, using decision tree model for getting segmentation rules.
A) Using behavioural data (attributes defining customer behaviour, ( example
balances, number of accounts etc.)
1. Clustering: Cluster behavioural data to suitable number of clusters
2. Decision Tree: Using rpart classification tree for generating rules for
segmentation using cluster number(cluster id) as target
2006 Mar 07
3
how to use the rpart function?
Hi all,
What parameter do I normally change in the rpart function? How do I set the
"cp" option?
Is there a way to read off error rate directly from the "rpart" function for
training data; I imagine for testing data I have to apply a "predict", but
for training data I guess the error count would be somewhere existing once
the "rpart" function is
2001 Aug 02
1
Missing value in Rpart
Hi, all
Our understanding of how classification trees in Rpart treat missing is
that if the variable is ordinal(continous), Rpart, by default, imputes a
value for missing. How do we do the classification tree and tell Rpart not
to impute. That is, what command is used to turn off the imputation.
Also, if we do get true missing, how does classification tree analysis in
Rpart treat missing when
2002 Aug 12
1
question about cloud() in lattice package
Hi all,
I have been previously been using scatterplot3d package to create some graphs but unfortunately it does not allow me to rotate the
plot on all three axis. The cloud() function in the lattice package does allow me to do so. When I was using scatterplot3d I was
using a script (Shown Below) to calculate the mean, quartiles and range limits for all three axis and I was representing that on the
2003 Jul 22
6
variable names
Dear helpers
I want to use rpart several times in a loop to build a classification tree. My problem is that rpart needs a formula as argument and for that the variables need to have names and this doesn't happen in my case. Every iteration in the loop has a different dataset with several variables (ex. 38 or more) and so I can't type the names by hand every time. Is there any function