A Clustering Comparison Measure Using Density Profiles and its Application to the Discovery of Alternate Clusterings

TitleA Clustering Comparison Measure Using Density Profiles and its Application to the Discovery of Alternate Clusterings
Publication TypeJournal Article
Year of Publication2010
AuthorsEric Bae, James Bailey, Guozhu Dong
JournalData Mining and Knowledge Discovery
Keywordsalternate clustering algorithms, alternate clusterings, cluster analysis, clustering, clustering comparison, clustering similarity
Abstract

Data clustering is a fundamental and very popular method of data analysis. Its subjective nature, however, means that different clustering algorithms or different parameter settings can produce widely varying and sometimes conflicting results. This has led to the use of clustering comparison measures to quantify the degree of similarity between alternative clusterings. Existing measures, though, can be limited in their ability to assess similarity and sometimes generate unintuitive results. They also cannot be applied to compare clusterings which contain different data points, an activity which is important for scenarios such as data stream analysis. In this paper, we introduce a new clustering similarity measure, known as ADCO, which aims to address some limitations of existing measures, by allowing greater flexibility of comparison via the use of density profiles to characterize a clustering. In particular, it adopts a 'data mining style' philosophy to clustering comparison, whereby two clusterings are considered to be more similar, if they are likely to give rise to similar types of prediction models. Furthermore, we show that this new measure can be applied as a highly effective objective function within a new algorithm, known as MAXIMUS, for generating alternate clusterings.

Full Text

Eric Bae, James Bailey, Guozhu Dong. 'A Clustering Comparison Measure Using Density Profiles and its Application to the Discovery of Alternate Clusterings'. Data mining and knowledge discovery. Volume 21, Issue 3, 2010.
research center: Data Mining Lab
publisher: Springer
year: 2010
hasEditor: Charu Aggarwal
hasURL: http://knoesis.wright.edu/library/download/dmkd.pdf
hasBookTitle: Data Mining and Knowledge Discovery

Related Files: