Fast, linear time, m-adic hierarchical clustering for search and retrieval using the Baire metric, with linkages to generalized ultrametrics, hashing, formal concept analysis, and precision of data measurement

F. Murtagh, P. Contreras

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

We describe many vantage points on the Baire metric and its use in clustering data, or its use in preprocessing and structuring data in order to support search and retrieval operations. In some cases, we proceed directly to clusters and do not directly determine the distances. We show how a hierarchical clustering can be read directly from one pass through the data. We offer insights also on practical implications of precision of datameasurement. As a mechanism for treating multidimensional data, including very high dimensional data, we use random projections.

Original languageEnglish
Pages (from-to)46-56
Number of pages11
JournalP-Adic Numbers, Ultrametric Analysis, and Applications
Volume4
Issue number1
DOIs
Publication statusPublished - 1 Jan 2012
Externally publishedYes

    Fingerprint

Cite this