TY - JOUR
T1 - OCEAN
T2 - A Non-Conventional Parameter Free Clustering Algorithm Using Relative Densities of Categories
AU - Gheyas, Iffat
AU - Parkinson, Simon
AU - Khan, Saad
N1 - Publisher Copyright:
© 2021 World Scientific Publishing Company.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2021/4/1
Y1 - 2021/4/1
N2 - In this paper, we propose a fully autonomous density-based clustering algorithm named 'Ocean', which is inspired by the oceanic landscape and phenomena that occur in it. Ocean is an improvement over conventional algorithms regarding both distance metric and the clustering mechanism. Ocean defines the distance between two categories as the difference in the relative densities of categories. Unlike existing approaches, Ocean neither assigns the same distance to all pairs of categories, nor assigns arbitrary weights to matches and mismatches between categories that can lead to clustering errors. Ocean uses density ratios of adjacent regions in multidimensional space to detect the edges of the clusters. Ocean is robust against clusters of identical patterns. Unlike conventional approaches, Ocean neither makes any assumption regarding the data distribution within clusters, nor requires tuning of free parameters. Empirical evaluations demonstrate improved performance of Ocean over existing approaches.
AB - In this paper, we propose a fully autonomous density-based clustering algorithm named 'Ocean', which is inspired by the oceanic landscape and phenomena that occur in it. Ocean is an improvement over conventional algorithms regarding both distance metric and the clustering mechanism. Ocean defines the distance between two categories as the difference in the relative densities of categories. Unlike existing approaches, Ocean neither assigns the same distance to all pairs of categories, nor assigns arbitrary weights to matches and mismatches between categories that can lead to clustering errors. Ocean uses density ratios of adjacent regions in multidimensional space to detect the edges of the clusters. Ocean is robust against clusters of identical patterns. Unlike conventional approaches, Ocean neither makes any assumption regarding the data distribution within clusters, nor requires tuning of free parameters. Empirical evaluations demonstrate improved performance of Ocean over existing approaches.
KW - Autonomous clustering
KW - Categorical data
KW - Density based clustering
KW - Unsupervised learning
KW - Distance metric
KW - distance metric
KW - density-based clustering
KW - categorical data
KW - unsupervised learning
UR - http://www.scopus.com/inward/record.url?scp=85098556765&partnerID=8YFLogxK
U2 - 10.1142/S0218001421500178
DO - 10.1142/S0218001421500178
M3 - Article
VL - 35
JO - International Journal of Pattern Recognition and Artificial Intelligence
JF - International Journal of Pattern Recognition and Artificial Intelligence
SN - 0218-0014
IS - 5
M1 - 2150017
ER -