search for: unsupervised

Displaying 20 results from an estimated 77 matches for "unsupervised".

2007 Jun 06
0
Question on RandomForest in unsupervised mode
Hi, I attempted to run the randomForest() function on a dataset without predefined classes. According to the manual, running randomForest without a response variable/class labels should result in the function assuming you are running in unsupervised mode. In this case, I understand that my data is all assigned to one class whereas a second synthetic class is made up, which is assigned to a second class. The online manual suggests that an oob misclassification error in this two-class problem of ~40% or more would indicate that the x- v...
2006 Mar 08
1
Unsupervised RandomForest
Dear all, I am trying to calculate the proximity matrix for a data set with 16 variables and 6804 observations using random forests. I have a Pentium 4, 3.00GHz processor with 1 GB of RAM. When I use the command randomForest(data.scale,proximity=T) I get the warning message Error: cannot allocate vector of size 361675 kb Is this because I have reached the limit of what my computer is
2008 Jul 02
1
Usage of rJava (.jcall) with Weka functions, any example?
...at are not implemented (do not have interface) in RWeka, like the Remove function and others in the future! The .java() functionality is for that purpose but I haven't seen any example with Weka functions. Could anyone give me hand in how to do it? For instace if I want to use the weka.filters.unsupervised.attribute.Remove? 1. in the R console, first I load a matrix file, > matrix <- read.table(fileName, header=TRUE) OK! 2. Second, I f I want to remove the first attribute with Weka: > library(grid) > library(rJava) > library(RWeka) > c <- .jcall("weka/filters/unsupervise...
2007 Jul 18
2
EM unsupervised clustering
Hi All, I have a n x m matrix. The n rows are individuals, the m columns are variables. The matrix is in itself a collection of 1s (if a variable is observed for an individual), and 0s (is there is no observation). Something like: [,1] [,2] [,3] [,4] [,5] [,6] [1,] 1 0 1 1 0 0 [2,] 1 0 1 1 0 0 [3,] 1 0 1 1 0 0 [4,] 0 1 0
2007 Nov 22
1
Cluster Analysis:build a classifier?
Dear All, I'm currently doing a project about unsupervised learning, and I'll be using R to analyse a few network traffic datasets downloaded off Andrew Moore's website (http://www.cl.cam.ac.uk/Research/SRG/netos/nprobe/data/papers/sigmetrics/index.html). Could anyone shed some light on how to build a classifier from the training set A, and then...
2010 Apr 25
1
randomForest predictions with new data
...I have taken a random post from iris database and taken the species data away and tried to use it as new data but I have not succeeded in finding out what species it is. How do I write that in R? And how do I get the guess if it is a regression problem? Is it possible to get it with unsupervised learning? One post with new data resulting in a good guess? Thank you Rolf [[alternative HTML version deleted]]
2010 Apr 06
2
help output figures in R
somfunc<- function (file) { aa_som<-scale(file) final.som<-som(data=aa_som, rlen=10000, grid=somgrid(5,4, "hexagonal")) pdf(file="/home/cdu/changbin/file.pdf") #output graphic file. plot(final.som, main="Unsupervised SOM") dev.off() } I have many different files, if I want output pdf file with the same name as for each dataset I feed to the function somfunc. HOw should I DO? THANKS! -- Sincerely, Changbin -- Changbin Du DOE Joint Genome Institute Bldg 400 Rm 457 2800 Mitchell Dr Walnut Creet, CA 9...
2007 Jun 08
2
How to do clustering
Dear List, I have another question to bother you about how to do clustering. My data consists of 49 columns (49 variables) and 238804 rows. I would like to do hierarchical clustering (unsupervised clustering and PCA). So far I tried pvclust (www.is.titech.ac.jp/~shimo/prog/*pvclust* /) but I always had the problem like for R like "cannot allocate the memory". I am curious about what else packages can perform the clustering analysis while memory efficient. Meanwhile, is there any...
2014 Mar 16
2
Contribute in the Xapian project throgh GSoC.
...n algorithm for Parallel hardware that boosted getting depths of objects in an image). I think that I would be able to contribute in the Learning to Rank and the Cluster Analysis algorithms. Having studied it in my undergrad, I have a deep understanding of support vector machines, supervised and unsupervised learning and have implemented codes in Matlab and Python in my lab work. Other than that I am a regular C++ programmer and a linux user. Also very familiar with the OSS community, having contributed in the OpenCL(a Parallel computing language) AMD forums. I am hoping to make use of the SoC prog...
2014 Mar 22
2
[GSOC 2014] Indexing INEX dataset
For unsupervised approaches like BM25 this approach works well but letor does not need special weighting for title in this form as it itself assigns weights to title features separately. But I see your concern it would be a problem when BM25 is used on the index with this setup. Hence its preferable to take a note...
2014 Mar 04
4
Questions on letor module
Hi, I have several questions regarding the letor module,I looked at the framework of learning to rank in xapian http://rishabhmehrotra.com/gsoc/17.png, I am a little confused. Why using deep learning to find unsupervised features in test data? Since in my understanding, learning to rank model usually learn features from the training data then apply the model to the test data? Why test set and training set have different features? And deep learning is to extract hidden features from the data set, I don't think i...
2012 Apr 01
1
[GSoC2012] Learning to Rank: few thoughts/issues
...9;11<http://eprints.pascal-network.org/archive/00008597/01/342_icmlpaper.pdf>used the advantage of feature learning using unlabeled data and beat the state-of-the-art in sentiment classification. Combining the above two points, I suggest an approach which uses features learnt from data in an unsupervised fashion "*in addition to*" the commonly used features. *Please note:* all this is in addition to the traditional features and finally we would be using *listwise/pairwise approaches*[ListMLE, et cetera] to train our models on the new set of features. Please let me know if this sounds good...
2004 Jan 07
0
Statistical Learning and Datamining course based on R/Splus tools
...nts in internet technology, genomics and other high-tech industries, we rely increasingly more on data analysis and statistical models to exploit the vast amounts of data at our fingertips. This sequel to our popular "Modern Regression and Classification" course covers many new areas of unsupervised learning and data mining, and gives an in-depth treatment of some of the hottest tools in supervised learning. The first course is not a prerequisite for this new course. All of the techniques discussed in the course are implemented by the authors and others in the S language (S-plus or R). Da...
2004 Jul 12
0
Statistical Learning and Data Mining Course
...nts in internet technology, genomics and other high-tech industries, we rely increasingly more on data analysis and statistical models to exploit the vast amounts of data at our fingertips. This sequel to our popular "Modern Regression and Classification" course covers many new areas of unsupervised learning and data mining, and gives an in-depth treatment of some of the hottest tools in supervised learning. The first course is not a prerequisite for this new course. Most of the techniques discussed in the course are implemented by the authors and others in the S language (S-plus or R), and...
2005 Jan 04
0
Statistical Learning and Data Mining Course
...ents in internet technology, genomics and other high-tech industries, we rely increasingly more on data analysis and statistical models to exploit the vast amounts of data at our fingertips. This sequel to our popular "Modern Regression and Classification" course covers many new areas of unsupervised learning and data mining, and gives an in-depth treatment of some of the hottest tools in supervised learning. The first course is not a prerequisite for this new course. Most of the techniques discussed in the course are implemented by the authors and others in the S language (S-plus or R), and a...
2001 Oct 03
1
package GeneSOM ?
...possibility , extract the information from the SOMplot "clusterSize" and "mean" for every cluster as quantitative information ( i.e. the DataFrame with an additional column which define the calculate clusters from SOM)? My intention - compare calculate SOM results with other unsupervised classification results ! P.S. How i have got qerror to interpret ? Thanks for advance & regards, Christian _______________________________________________________________________ -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.- r-help mailing list -- Read...
2008 Oct 28
2
slightly OT: (un)supervised clustering?
...inant analysis expects me to do the grouping and then it will "decide" the rest. Therefore not suitable. A bunch of significance tests can help in deciding whether the differences are statistically significant. But again, I have to present my own groups, therefore - not suitable. Other unsupervised learning algorithms (Neural Networks & Co) - well, how can I instruct them to do analysis along an environmental gradient of depth ?.. If anyone among the experts on this list has dealt with similar problems before I would highly appreciate if you could briefly describe your approaches or...
2006 Apr 10
2
passing known medoids to clara() in the cluster package
...a wrapper to pam(), which will accept known medoid data - I am wondering if this too is possible with clara() ... The documentation does not suggest that this is possible. Essentially I am trying to implement a "supervised classification" of numerous geographic data layers. The "unsupervised" approach using clara() works well, but I feel the output classes would be more meaningful if I were able to let clara() know about the classes that I have in mind. Is this at all feasible, or am I trying to accomplish something that is not possible? Cheers, -- Dylan Beaudette Soils an...
2012 Mar 24
3
Learning to rank
...33000 features and a binary class label with +1 OR -1 value. I applied the Decision Tree induction(GINI INDEX) Approach for filtering out the URLs and then applying a RANKSUM[1] metric, which uses weighted sum approach, to rank the URLs accordingly. The current implementation involves firstly the unsupervised ranking of a query and then applying a supervised learning algorithm, SVM, on the first 'n' documents retrieved. A similar approach can be incorporated while extending the problem of ranking with a better supervised learning algorithm and probabilistic model viz. Bayesian Belief Networks i....
2010 Mar 30
1
predict.kohonen for SOM returns NA?
All, The kohonen predict function is returning NA for SOM predictions regardless of data used... even the package example for a SOM using wine data is returning NA's Does anyone have a working example SOM. Also, what is the purpose of trainY, what would be the dependent data for an unsupervised SOM? As may be apparent to you by my questions, I am very new to kohonen maps and am very grateful for any assistance offered. Thanks in advance! Below is terminal output: > sessionInfo() R version 2.10.1 (2009-12-14) i386-pc-mingw32 locale: [1] LC_COLLATE=English_United States.1252 LC_CTYPE=E...