thr3ads.net - R help - [R] PAM Clustering Ignores Cluster Number Parameter [May 2011]

If this information is useful, please help other people find it:
Share via:

Dario Strbenac

2011-May-18 02:00 UTC

[R] PAM Clustering Ignores Cluster Number Parameter

I am using PAM with k = 10 clusters, but I only get one cluster ID for all my
observations. I couldn't find any discussion about this in the help file, or
mailing lists.

Is there a reasonable explanation for this result ?

cIDs <- pam(all, 10, cluster.only = TRUE, do.swap =
FALSE)> table(cIDs)cIDs
    0 
16671

The matrix of observations can be found at :
http://129.94.136.7/file_dump/dario/all.obj

I'm using R version 2.13.0 (2011-04-13) on Platform:
x86_64-unknown-linux-gnu (64-bit) and have cluster_1.13.3.

--------------------------------------
Dario Strbenac
Research Assistant
Cancer Epigenetics
Garvan Institute of Medical Research
Darlinghurst NSW 2010
Australia

Martin Maechler

2011-May-24 06:50 UTC

head link

[R] pam() seems to ignore cluster number

>>>>> Dario Strbenac <D.Strbenac at garvan.org.au>
>>>>>     on Wed, 18 May 2011 12:00:11 +1000 writes:
    > I am using PAM with k = 10 clusters, but I only get one cluster
    > ID for all my observations. I couldn't find any discussion about
    > this in the help file, or mailing lists.  Is there a reasonable
    > explanation for this result ?

    > cIDs <- pam(all, 10, cluster.only = TRUE, do.swap = FALSE)
    >> table(cIDs)
    > cIDs
    > 0 
    > 16671

    > The matrix of observations can be found at :
    > http://129.94.136.7/file_dump/dario/all.obj

For the mailing list archives:

Dario's data contained so many NA's that some of the computed
dissimalirities "had to be" NA as well.
Had he used
    pam(all, 10)
    pam(all, 10, do.swap = FALSE)

he would have got the error message

   "No clustering performed, NAs in the computed dissimilarity
matrix."

But because of  'cluster.only=TRUE' 
*and* because of a lapsus of the 'cluster' maintainer (me),
pam()  returned without the error message in this case.

The next release of R (or of 'cluster') will give the error
message also in the case of 'cluster.only=TRUE' .

Martin Maechler, ETH Zurich

    > I'm using R version 2.13.0 (2011-04-13) on Platform:
    > x86_64-unknown-linux-gnu (64-bit) and have cluster_1.13.3.

    > --------------------------------------
    > Dario Strbenac
    > Research Assistant
    > Cancer Epigenetics
    > Garvan Institute of Medical Research
    > Darlinghurst NSW 2010
    > Australia

Possibly Parallel Threads

Search for more seemingly similar threads

R help - May 2011 - PAM Clustering Ignores Cluster Number Parameter

[R] PAM Clustering Ignores Cluster Number Parameter

[R] pam() seems to ignore cluster number

Possibly Parallel Threads