thr3ads.net - similar to: "weighting variables using Gower with DAISY"

Displaying 20 results from an estimated 4000 matches similar to: "weighting variables using Gower with DAISY"

want to count 2 NULLS as disimilar with DIANA/DAISY/GOWER

2012 Oct 18

want to count 2 NULLS as disimilar with DIANA/DAISY/GOWER

I am using DIANA/DAISY/GOWER. Some of my categorical data include NULLS. When assessing disimilarity, these NULLS are considered similar. I do not want these NULLS to contribute towards similarity. Instead is there a way for these NULLS to each be considered different so as to contribute to disimiliarity and not simillarity? Also, I do not want to change these NULLS in the data as I need them for

Why daisy() in cluster library failed to exclude NA when computing dissimilarity

2013 Dec 08

Why daisy() in cluster library failed to exclude NA when computing dissimilarity

Hi, According to daisy function from cluster documentation, it can compute dissimilarity when NA (missing) value(s) is present. http://stat.ethz.ch/R-manual/R-devel/library/cluster/html/daisy.html But why when I tried this code library(cluster) x <- c(1.115,NA,NA,0.971,NA) y <- c(NA,1.006,NA,NA,0.645) df <- as.data.frame(rbind(x,y)) daisy(df,metric="gower") It gave this

Specify ID variable in daisy{cluster}

2011 Jun 16

Specify ID variable in daisy{cluster}

Hi All - I am using the daisy function from the cluster library to create a dissimilarity matrix. I'm going to use that matrix to run a cluster analysis. My participants are identified with the variable, hhid. However, when I try to keep hhid in the dataset that I use to create the dissimilarity matrix, daisy uses it to create the matrix rather than ignoring it as an ID variable. I need to

cutreeDynamic error

2013 Feb 26

cutreeDynamic error

I am having difficulty getting the dynamic tree cut package to work. Given the data table "myddtable" LengthPlaceColorAge5HRed224ABlue205WGreen243GRed222GBlue236WGreen255ARed194H Blue23 I created a similarity matrix using DAISY and Gower metric and specified Place and Color columns as characters (since they are categorical variables) > dd.daisy<-daisy(myddtable, metric =

Gower distance between a individual and a population

2008 Oct 13

Gower distance between a individual and a population

Hi the list, I need to compute Gower distance between a specific individual and all the other individual. The function DAISY from package cluster compute all the pairwise dissimilarities of a population. If the population is N individuals, that is arround N^2 distances to compute. I need to compute the distance between a specific individual and all the other individual, that is only N

How to perform clustering without removing rows where NA is present in R

2013 Dec 07

How to perform clustering without removing rows where NA is present in R

I have a data which contain some NA value in their elements. What I want to do is to **perform clustering without removing rows** where the NA is present. I understand that `gower` distance measure in `daisy` allow such situation. But why my code below doesn't work? __BEGIN__ # plot heat map with dendogram together. library("gplots") library("cluster")

gower distance calculation

2006 Nov 17

gower distance calculation

Hello I have 2 rows in a matrix and I want to calculate the Gower Distance between the 2 , how can I do it? I searched and found nothing that can help me, and my program doesn't know the gdist function and I couldn't find it on the R help site. Can anyone help me plz Thank u all [[alternative HTML version deleted]]

daisy(): space allocation issue

2010 Aug 26

daisy(): space allocation issue

Hi, I'm trying to apply the function daisy() to a data.frame 10000x10 but I have not enough space (error message: cannot allocate vector of length 1476173280). I didn't imagine I was not able to work with a matrix of just 10000 observations... I have setted in Rgui --max-mem-size=2G (I'm not able to set more space..) How can I solve this issue? Separating observations depending on

variable type assignment in daisy

2010 Nov 06

variable type assignment in daisy

Dear Rhelp, I did a daisy on 5 lifestyle variables, 3 of which were nominal and 2 were ordinal and assigned types “nominal” and “ordinal” for the variables, respectively. I got an output indicating their types as “I” for interval(?). Doing it on the Rdata example “flower” gave the same types in the output as the types they were assigned to. Why is this so? Below are the codes and outputs.

Two matrix loop

2011 Mar 24

Two matrix loop

Hi, I'm trying to create a distance matrix. And it works out somewhat ok. However, I suspect that there are some efficiency issues with my efforts. Plz have a look at this: donor <- matrix(c(3,1,2,3,3,1,4,3,5,1,3,2), ncol=4) receiver <- matrix(c(1,4,3,2,4,3,1,5,1,3,2,1,4,5,3,5,1,3,2,4,5,1,2,3,1,4,5,5,1,2,1,3,4,3,2,5,5,1,4,2,5,4,3,2), ncol=4) The above creates my two matrices. I have

Argument dendro must have class hclust - cutreeDynamic error

2013 Feb 25

Argument dendro must have class hclust - cutreeDynamic error

error using daisy() in library(cluster). Bug?

2004 Aug 12

error using daisy() in library(cluster). Bug?

Hi, I'm using the cluster library to examine multivariate data. The data come from a connection to a postgres database, and I did a short R script to do the analisys. With the cluster version included in R1.8.0, daisy worked well for my data, but now, when I call daisy, I obtain the following messages: --------- Error in if (any(sx == 0)) { : missing value where TRUE/FALSE needed In

Gower

2011 Dec 21

Gower

Alguém sabe onde encontro os comandos para utilizar em daisy() no pacote cluster o método de Gower para dados moleculares codominantes, quantitativos e multicategóricos "conjuntamente"? Agradeço. OBS.: confirmar por favor o recebimento da mensagem. #################################### Adésio Ferreira Universidade Federal do Espírito Santo Centro de Ciências Agrárias Departamento

type in daisy

2006 Mar 20

type in daisy

Hi, I'm a PhD student and I want to use the function 'daisy' from the package 'cluster' to compute dissimilarities. My variables are of mixed types so I use the argument 'stand' in daisy to define the type of my variables. I have the following error message : Warning message: binary variable(s) 13, 16, 17, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35,

stepwise variable selection method wanted

2009 Jul 30

stepwise variable selection method wanted

Hi List, I am looking for a variable selection procedure with a forward-backward selection method. Firstly, it is meant to work with the cophenetic correlation coefficient (CPCC) and intended to find the variable combination with the highest cophenetic correlation. Secondly, it is aimed at Gower metric with wards method (though this could be easily extended) aimed at categorical data. What I

cluster a distance(analogue)-object using agnes(cluster)

2008 Sep 02

cluster a distance(analogue)-object using agnes(cluster)

I try to perform a clustering using an existing dissimilarity matrix that I calculated using distance (analogue) I tried two different things. One of them worked and one not and I don`t understand why. Here the code: not working example library(cluster) library(analogue) iris2<-as.data.frame(iris) str(iris2) 'data.frame': 150 obs. of 5 variables: $ Sepal.Length: num 5.1 4.9 4.7

Very Slow Gower Similarity Function

2005 Apr 18

Very Slow Gower Similarity Function

Hello, I am a relatively new user of R. I have written a basic function to calculate the Gower similarity function. I was motivated to do so partly as an excercise in learning R, and partly because the existing option (vegdist in the vegan package) does not accept missing values. I think I have succeeded - my function gives me the correct values. However, now that I'm starting to use it with

calculating distances using Gower's coefficient on mixed variables.

2005 Sep 26

calculating distances using Gower's coefficient on mixed variables.

I want to compute the distances in a mixed variable matrix using the Gower coefficient. I understand it is possible to calculate distances in a matrix with mixed variables using the dudi.pco command. How would this work? Jorine

Cluster prediction from factor/numeric datasets

2007 Jul 23

Cluster prediction from factor/numeric datasets

Hi all, I have a dataset with numeric and factor columns of data which I developed a Gower Dissimilarity Matrix for (Daisy) and used Agglomerative Nesting (Agnes) to develop 20 clusters. I would like to use the 20 clusters to determine cluster membership for a new dataset (using predict) but cannot find a way to do this (no way to "predict" in the cluster package). I know I can use

2016 Apr 12

Dissimilarity matrix and number clusters determination

Hi, I already have a dissimilarity matrix and I am submitting the results to the elbow.obj method to get an optimal number of clusters. Am I reading the below output correctly that I should have 17 clusters? code: top150 <- sampleset[1:150,] {cluster1 <- daisy(top150 , metric = c("gower") , stand = TRUE , type = list(symm

similar to: weighting variables using Gower with DAISY