similar to: Clustering

Displaying 20 results from an estimated 2000 matches similar to: "Clustering"

2008 Sep 02
2
cluster a distance(analogue)-object using agnes(cluster)
I try to perform a clustering using an existing dissimilarity matrix that I calculated using distance (analogue) I tried two different things. One of them worked and one not and I don`t understand why. Here the code: not working example library(cluster) library(analogue) iris2<-as.data.frame(iris) str(iris2) 'data.frame': 150 obs. of 5 variables: $ Sepal.Length: num 5.1 4.9 4.7
2011 Jun 27
3
New to R, trying to use agnes, but can't load my ditance matrix
Hi, I'm mighty new to R. I'm using it on Windows. I'm trying to cluster using a distance matrix I created from the data on my own and called it D10.dist. I loaded the cluster package. Then tried the following command... > agnes("E:D10.dist", diss = TRUE, metric = "euclidean", stand = FALSE, > method = "average", par.method, keep.diss = n < 1000,
2011 Jan 27
3
agnes clustering and NAs
Hello, In the documentation for agnes in the package 'cluster', it says that NAs are allowed, and sure enough it works for a small example like : > m <- matrix(c( 1, 1, 1, 2, 1, NA, 1, 1, 1, 2, 2, 2), nrow = 3, byrow = TRUE) > agnes(m) Call: agnes(x = m) Agglomerative coefficient: 0.1614168 Order of objects: [1] 1 2 3 Height (summary): Min. 1st Qu. Median Mean 3rd
2007 Nov 21
4
NA values
Hi all! I am new to R and I would like to ask you the following question:How can I substitute the NA values with 0 in a data frame? I cannot find a command to check if a value is NA... Thank you very much! Eleni
2008 Jan 22
4
select repositories under linux
Hi all, I am trying to install the package "GEOquery" in unix. I have downloaded the standard version of R and this package is not contained in the default. I know that I can select repositories under windows but I don't know how to do it in unix. Does anyone have any idea on this? Thank you in advance, Eleni [[alternative HTML version deleted]]
2008 Feb 14
4
Kaplan Meier function
Hi all, I am trying to draw a Kaplan-Meier curve and I found online that Kaplan - Meier estimates are computed with a function called km in the event package. Is there an update for that because when I choose to download packages in R,. there is no package called event, even though I have selected all the repositories. Thanks in advance, Eleni [[alternative HTML version deleted]]
2007 Sep 28
2
simple matching with R
Hello! I am R beginner and I have a question obout a simple matching. I have to datasets that i read in with: MalVar29_37<-read.table("MalVar29_37.csv", sep = ";") FemVar29_37<-read.table("FemVar29_37.csv", sep = ";") They look like this and show binary variables: V1 V2 V3 V4 V5 V6 V7 V8 V9 1 0 0 0 0 0 1 0 0 0 2 0 0 0 0 0 1
2008 Mar 05
2
t.test & p-Value
Hello list, I am trying to apply the paired t.test between diseased and not diseased patients to identify genes that are more expressed in the one situation under the other. In order to retrieve the genes that are more expressed in the positive disease state I do: p.values<-c() for(i in 1:length(Significant[,1])){ p.values[i]<-try(t.test(positive[i,],negative[i,],alternative
2006 Apr 24
1
Problem with the cluster package
Hi everybody, I want to use the cluster package (Cluster Analysis Extended Rousseeuw et al.). I downloaded it from the CRAN and installed it on my linux system (fedora core 4). All seemed to be allright. But when trying to launch examples, I obtained the following message : > library(cluster) > data(votes.repub) > agn1 <- agnes(votes.repub, metric = "manhattan",
2010 Mar 09
4
Transfer R workspace on another PC
Hi list! I have recently tried to take my office work home, meaning that I tried to transfer my "... .RData" workspace from my PC on my laptop. The office PC runs on Windows XP and my laptop runs on Windows Vista. I have saved the workspace at the office PC and kept it in a usb drive. When I tried to open it on my laptop I got an error: "Fatal Error: Unable to restore saved data in
2002 Jan 24
2
Data input performance
Dear list, I'm brand new to R (started using it few days ago...), so sorry for possibly stupid question. Anyways, I'm using R to cluster my data. I do have the dissimilarity matrix as a text file, numbers separated by space. It's at its best something like 2300x2300 matrix. Now, it seems to me, that the process of importing the matrix into R is rather slow. For the peak size of
2013 Jun 09
1
agnes() in package cluster on R 2.14.1 and R 3.0.1
Dear R users, I discovered something strange using the function agnes() of the cluster package on R 3.0.1 and on R 2.14.1. Indeed, the clusterings obtained are different whereas I ran exactly the same code. I quickly looked at the source code of the function and I discovered that there was an important change: agnes() in R 2.14.1 used a FORTRAN code whereas agnes() in R 3.0.1 uses a C code.
2007 Oct 10
3
as.dist with diagonal unequal zero
Hello and sorry that I still haven?t found a solution for my problem. I need to extract the lower and upper triangle from a square matrix including the diagonal. This diagonal is not zero in that special case. I tried with as.dist w<-as.dist(w, diag = TRUE) > w 1 2 3 4 5 1 0 2 2 0 3 3 8 0 4 4 9 14 0 5 5 10 15 20 0 but found no way to keep the diagonal that is in the
2009 Dec 02
1
Ridge regression
Dear list, I have a couple of questions concerning ridge regression. I am using the lm.ridge(...) function in order to fit a model to my microarray data. Thus *model=lm.ridge(...)* I retrieve some coefficients and some scales for each gene. First of all, I would like to ask: the real coefficients of the model are not included in the first argument of the output but in the result of coef(model),
2008 Jan 31
3
Memory problem?
Hello R users, I am trying to run a cox model for the prediction of relapse of 80 cancer tumors, taking into account the expression of 17000 genes. The data are large and I retrieve an error: "Cannot allocate vector of 2.4 Mb". I increase the memory.limit to 4000 (which is the largest supported by my computer) but I still retrieve the error because of other big variables that I have in
2007 Jun 20
1
Dissimilarity
Hello Stephen, I am happy that you help me. Thanks a million. It is a good feeling that you confirm my assumption that dsvdis is not able to deal with missing data, because it says me that I am not completely incapable. Okay now I have the problem what to do. I used this function ´cause there is an option to weight columns differently what I haven´t found in other functions. But now I
2008 May 15
1
metaMDS using Dissimilarity matrix
Hello R-user community! I am running R 2.7.0 on a Power Book (Tiger). (I am still R and statistics beginner) Presently I try to run the function metaMDS (vegan) using an existing dissimilarity-matrix. As I would like to start with this matrix I thought I could just give the matrix using the x= -argument Test<-metaMDS(x=Dist.Gower) Fehler in inherits(comm, "dist") :
2005 May 30
2
How to access to sum of dissimilarities in CLARA
Dear All , Since dissimilarity is one of quality measures in clustering , I'm trying to access to the sum of dissimilarity as a whole measure. But after running my data using CLARA I obtain : 1128 dissimilarities, summarized : Min. 1st Qu. Median Mean 3rd Qu. Max. 0.033155 0.934630 2.257000 2.941600 4.876600 8.943700 But I can not find the sum of dissimilarity.How can i
2007 Jun 20
1
How to extract diagonals
Hello, I am using Mac OS X on a power book and R 2.5.0 I try to extract a diagonal from a dissimilarity matrix made with dsvdis, with this code: diag(DiTestRR) But I get this error message: Fehler in array(0, c(n, p)) : 'dim' spezifiziert ein zu großes Array english: Error in array(0, c(n, p)) : 'dim' specifies a too big array. Is there a limit to extract diagonals? I
2006 Mar 29
6
which function to use to do classification
Dear All, I have a data, suppose it is an N*M matrix data. All I want is to classify it into, let see, 3 classes. Which method(s) do you think is(are) appropriate for this purpose? Any reference will be welcome! Thanks! Best, Baoqiang Cao