Loading...
Proximity curves for potential-based clustering
Csenki, Attila ; ; Torgunov, Denis ; Micic, Natasha
Csenki, Attila
Torgunov, Denis
Micic, Natasha
Publication Date
2020
End of Embargo
Supervisor
Rights
© 2019 The Author(s). This is an Open Access article distributed under the Creative Commons CC-BY license (https://creativecommons.org/licenses/by/4.0/)
Peer-Reviewed
Yes
Open Access status
openAccess
Accepted for publication
2019
Institution
Department
Awarded
Embargo end date
Files
Loading...
csenki_et_al_2020
Adobe PDF, 1.8 MB
Additional title
Abstract
The concept of proximity curve and a new algorithm are proposed for obtaining clusters in a finite set of data points in the finite dimensional Euclidean space. Each point is endowed with a potential constructed by means of a multi-dimensional Cauchy density, contributing to an overall anisotropic potential function. Guided by the steepest descent algorithm, the data points are successively visited and removed one by one, and at each stage the overall potential is updated and the magnitude of its local gradient is calculated. The result is a finite sequence of tuples, the proximity curve, whose pattern is analysed to give rise to a deterministic clustering. The finite set of all such proximity curves in conjunction with a simulation study of their distribution results in a probabilistic clustering represented by a distribution on the set of dendrograms. A two-dimensional synthetic data set is used to illustrate the proposed potential-based clustering idea. It is shown that the results achieved are plausible since both the ‘geographic distribution' of data points as well as the ‘topographic features' imposed by the potential function are well reflected in the suggested clustering. Experiments using the Iris data set are conducted for validation purposes on classification and clustering benchmark data. The results are consistent with the proposed theoretical framework and data properties, and open new approaches and applications to consider data processing from different perspectives and interpret data attributes contribution to patterns.
Version
Published version
Citation
Csenki A, Neagu CD, Torgunov D et al (2020) Proximity curves for potential-based clustering. Journal of Classification. 37: 671-695.
Link to publisher’s version
Link to published version
Link to Version of Record
Type
Article
