Displaying 20 results from an estimated 78 matches for "unsupervised".
2007 Jun 06
0
Question on RandomForest in unsupervised mode
Hi,
I attempted to run the randomForest() function on a dataset without
predefined classes. According to the manual, running randomForest
without a response variable/class labels should result in the
function assuming you are running in unsupervised mode. In this case,
I understand that my data is all assigned to one class whereas a
second synthetic class is made up, which is assigned to a second
class. The online manual suggests that an oob misclassification error
in this two-class problem of ~40% or more would indicate that the x-
v...
2006 Mar 08
1
Unsupervised RandomForest
Dear all,
I am trying to calculate the proximity matrix for a data set with 16 variables
and 6804 observations using random forests. I have a Pentium 4, 3.00GHz
processor with 1 GB of RAM. When I use the command
randomForest(data.scale,proximity=T)
I get the warning message
Error: cannot allocate vector of size 361675 kb
Is this because I have reached the limit of what my computer is
2008 Jul 02
1
Usage of rJava (.jcall) with Weka functions, any example?
...at are not implemented (do not have
interface) in RWeka, like the Remove function and others in the
future!
The .java() functionality is for that purpose but I haven't seen any
example with Weka functions. Could anyone give me hand in how to do
it? For instace if I want to use the
weka.filters.unsupervised.attribute.Remove?
1. in the R console, first I load a matrix file,
> matrix <- read.table(fileName, header=TRUE)
OK!
2. Second, I f I want to remove the first attribute with Weka:
> library(grid)
> library(rJava)
> library(RWeka)
> c <- .jcall("weka/filters/unsupervise...
2007 Jul 18
2
EM unsupervised clustering
Hi All,
I have a n x m matrix. The n rows are individuals, the m columns are variables.
The matrix is in itself a collection of 1s (if a variable is observed for an
individual), and 0s (is there is no observation).
Something like:
[,1] [,2] [,3] [,4] [,5] [,6]
[1,] 1 0 1 1 0 0
[2,] 1 0 1 1 0 0
[3,] 1 0 1 1 0 0
[4,] 0 1 0
2007 Nov 22
1
Cluster Analysis:build a classifier?
Dear All,
I'm currently doing a project about unsupervised learning, and I'll be using
R to analyse a few network traffic datasets downloaded off Andrew Moore's
website
(http://www.cl.cam.ac.uk/Research/SRG/netos/nprobe/data/papers/sigmetrics/index.html).
Could anyone shed some light on how to build a classifier from the training
set A, and then...
2010 Apr 25
1
randomForest predictions with new data
...I have taken a random post from iris database and taken the species data
away and tried to use it as new data but I have not succeeded in finding out
what species it is.
How do I write that in R?
And how do I get the guess if it is a regression problem?
Is it possible to get it with unsupervised learning? One post with new data
resulting in a good guess?
Thank you
Rolf
[[alternative HTML version deleted]]
2010 Apr 06
2
help output figures in R
somfunc<- function (file) {
aa_som<-scale(file)
final.som<-som(data=aa_som, rlen=10000, grid=somgrid(5,4, "hexagonal"))
pdf(file="/home/cdu/changbin/file.pdf") #output graphic file.
plot(final.som, main="Unsupervised SOM")
dev.off()
}
I have many different files, if I want output pdf file with the same name
as for each dataset I feed to the function somfunc.
HOw should I DO?
THANKS!
--
Sincerely,
Changbin
--
Changbin Du
DOE Joint Genome Institute
Bldg 400 Rm 457
2800 Mitchell Dr
Walnut Creet, CA 9...
2007 Jun 08
2
How to do clustering
Dear List,
I have another question to bother you about how to do clustering.
My data consists of 49 columns (49 variables) and 238804 rows.
I would like to do hierarchical clustering (unsupervised clustering
and PCA). So far I tried pvclust (www.is.titech.ac.jp/~shimo/prog/*pvclust*
/)
but I always had the problem like for R like "cannot allocate the memory".
I am curious about what else packages can perform the clustering analysis
while memory efficient.
Meanwhile, is there any...
2014 Mar 16
2
Contribute in the Xapian project throgh GSoC.
...n
algorithm for Parallel hardware that boosted getting depths of objects in
an image).
I think that I would be able to contribute in the Learning to Rank and the
Cluster Analysis algorithms. Having studied it in my undergrad, I have a
deep understanding of support vector machines, supervised and unsupervised
learning and have implemented codes in Matlab and Python in my lab work.
Other than that I am a regular C++ programmer and a linux user. Also very
familiar with the OSS community, having contributed in the OpenCL(a
Parallel computing language) AMD forums.
I am hoping to make use of the SoC prog...
2014 Mar 22
2
[GSOC 2014] Indexing INEX dataset
For unsupervised approaches like BM25 this approach works well but letor
does not need special weighting for title in this form as it itself assigns
weights to title features separately.
But I see your concern it would be a problem when BM25 is used on the index
with this setup. Hence its preferable to take a note...
2014 Mar 04
4
Questions on letor module
Hi,
I have several questions regarding the letor module,I looked at the
framework of learning to rank in xapian
http://rishabhmehrotra.com/gsoc/17.png, I am a little confused. Why using
deep learning to find unsupervised features in test data? Since in my
understanding, learning to rank model usually learn features from the
training data then apply the model to the test data? Why test set and
training set have different features? And deep learning is to extract
hidden features from the data set, I don't think i...
2012 Apr 01
1
[GSoC2012] Learning to Rank: few thoughts/issues
...9;11<http://eprints.pascal-network.org/archive/00008597/01/342_icmlpaper.pdf>used
the advantage of feature learning using unlabeled data and beat the
state-of-the-art in sentiment classification.
Combining the above two points, I suggest an approach which uses features
learnt from data in an unsupervised fashion "*in addition to*" the commonly
used features.
*Please note:* all this is in addition to the traditional features and
finally we would be using *listwise/pairwise approaches*[ListMLE, et
cetera] to train our models on the new set of features. Please let me know
if this sounds good...
2004 Jan 07
0
Statistical Learning and Datamining course based on R/Splus tools
...nts in internet technology, genomics and other high-tech
industries, we rely increasingly more on data analysis and statistical
models to exploit the vast amounts of data at our fingertips.
This sequel to our popular "Modern Regression and Classification"
course covers many new areas of unsupervised learning and data mining,
and gives an in-depth treatment of some of the hottest tools in
supervised learning.
The first course is not a prerequisite for this new course.
All of the techniques discussed in the course are implemented by the
authors and others in the S language (S-plus or R).
Da...
2004 Jul 12
0
Statistical Learning and Data Mining Course
...nts in internet technology, genomics and other high-tech
industries, we rely increasingly more on data analysis and statistical
models to exploit the vast amounts of data at our fingertips.
This sequel to our popular "Modern Regression and Classification"
course covers many new areas of unsupervised learning and data mining,
and gives an in-depth treatment of some of the hottest tools in
supervised learning.
The first course is not a prerequisite for this new course.
Most of the techniques discussed in the course are implemented by the
authors and others in the S language (S-plus or R), and...
2005 Jan 04
0
Statistical Learning and Data Mining Course
...ents in internet technology, genomics and other high-tech
industries, we rely increasingly more on data analysis and statistical
models to exploit the vast amounts of data at our fingertips.
This sequel to our popular "Modern Regression and Classification"
course covers many new areas of unsupervised learning and data mining,
and gives an in-depth treatment of some of the hottest tools in
supervised learning.
The first course is not a prerequisite for this new course.
Most of the techniques discussed in the course are implemented by the
authors and others in the S language (S-plus or R), and a...
2001 Oct 03
1
package GeneSOM ?
...possibility , extract the
information from the SOMplot "clusterSize" and "mean" for every cluster as quantitative information ( i.e. the DataFrame with an additional column which
define the calculate clusters from SOM)?
My intention - compare calculate SOM results with other unsupervised classification results !
P.S. How i have got qerror to interpret ?
Thanks for advance & regards,
Christian
_______________________________________________________________________
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read...
2008 Oct 28
2
slightly OT: (un)supervised clustering?
...inant analysis expects me to do the grouping and then it will
"decide" the rest. Therefore not suitable.
A bunch of significance tests can help in deciding whether the
differences are statistically significant. But again, I have to present
my own groups, therefore - not suitable.
Other unsupervised learning algorithms (Neural Networks & Co) - well,
how can I instruct them to do analysis along an environmental gradient
of depth ?..
If anyone among the experts on this list has dealt with similar problems
before I would highly appreciate if you could briefly describe your
approaches or...
2006 Apr 10
2
passing known medoids to clara() in the cluster package
...a wrapper to pam(), which will accept known medoid
data - I am wondering if this too is possible with clara() ... The
documentation does not suggest that this is possible.
Essentially I am trying to implement a "supervised classification" of numerous
geographic data layers. The "unsupervised" approach using clara() works well,
but I feel the output classes would be more meaningful if I were able to let
clara() know about the classes that I have in mind.
Is this at all feasible, or am I trying to accomplish something that is not
possible?
Cheers,
--
Dylan Beaudette
Soils an...
2012 Mar 24
3
Learning to rank
...33000 features and a binary class label with +1 OR -1
value. I applied the Decision Tree induction(GINI INDEX) Approach for
filtering out the URLs and then applying a RANKSUM[1] metric, which uses
weighted sum approach, to rank the URLs accordingly.
The current implementation involves firstly the unsupervised ranking of a
query and then applying a supervised learning algorithm, SVM, on the first
'n' documents retrieved.
A similar approach can be incorporated while extending the problem of
ranking with a better supervised learning algorithm and probabilistic model
viz. Bayesian Belief Networks i....
2010 Mar 30
1
predict.kohonen for SOM returns NA?
All,
The kohonen predict function is returning NA for SOM predictions
regardless of data used... even the package example for a SOM using
wine data is returning NA's
Does anyone have a working example SOM. Also, what is the purpose of
trainY, what would be the dependent data for an unsupervised SOM?
As may be apparent to you by my questions, I am very new to kohonen
maps and am very grateful for any assistance offered. Thanks in
advance!
Below is terminal output:
> sessionInfo()
R version 2.10.1 (2009-12-14)
i386-pc-mingw32
locale:
[1] LC_COLLATE=English_United States.1252 LC_CTYPE=E...