Structure of hierarchic clusterings: implications for information retrieval and for multivariate data analysis

Research output: Contribution to journalArticle

17 Citations (Scopus)

Abstract

Hierarchic clustering methods may be used to condense information for a user, as they are in multivariate data analysis, or to achieve computational advantages, as they are in information retrieval. The structure of the hierarchic classification produced has a direct bearing on the effectiveness and utility of using cluster analysis, yet this important feature of the classification has only been implicitly referred to in the literature to date. In this study, three different coefficients are defined, each of which quantify the symmetry-asymmetry (balancedness-unbalancedness) of hierarchic clusterings on a scale from 0 to 1. Using examples of data from the areas of information retrieval and of multivariate data analysis, a number of hierarchic clustering methods are discussed in terms of the hierarchies they produce.

Original languageEnglish
Pages (from-to)611-617
Number of pages7
JournalInformation Processing and Management
Volume20
Issue number5-6
DOIs
Publication statusPublished - 1 Jan 1984
Externally publishedYes

    Fingerprint

Cite this