Population recovery capabilities of 35 cluster analysis methods

J Clin Psychol. 1993 Jul;49(4):459-70. doi: 10.1002/1097-4679(199307)49:4<459::aid-jclp2270490402>3.0.co;2-p.

Abstract

Comparative evaluation of population recovery capabilities of 35 cluster analysis methods defined by different combinations of 5 profile similarity measures and 7 agglomeration rules was undertaken using artificial data that represented duplicate mixture samples from 4 latent populations. The latent population mean profiles differed primarily in elevation or in pattern parameters. Latent population sampling variances were controlled to provide two different levels of realistic overlap. The within-population distributions were multivariate normal with diagonal covariance structure. Across all conditions examined, complete linkage and Ward's minimum variance methods, used with Euclidian or city block interprofile distance measures, performed best. Single linkage, median, and centroid methods were substantially inferior for clustering individuals in accordance with true population memberships.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Analysis of Variance
  • Cluster Analysis*
  • Humans
  • Models, Statistical*
  • Multivariate Analysis
  • Population*