Advancing the PAM Algorithm to Semi-Supervised k-Medoids Clustering

Warning

This publication doesn't include Faculty of Economics and Administration. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

JÁNOŠOVÁ Miriama LANG Andreas BUDÍKOVÁ Petra SCHUBERT Erich DOHNAL Vlastislav

Year of publication 2024
Type Article in Proceedings
Conference 17th International Conference on Similarity Search and Applications (SISAP)
MU Faculty or unit

Faculty of Informatics

Citation
Web https://link.springer.com/chapter/10.1007/978-3-031-75823-2_19
Doi http://dx.doi.org/10.1007/978-3-031-75823-2_19
Keywords semi-supervised clustering;k-medoids;partitioning around medoids;FasterPAM;semi-supervised classification;DISA;LMI
Attached files
Description The analysis of complex, weakly labeled data is increasingly popular, presenting unique challenges. Traditional unsupervised clustering aims to uncover interrelated sets of objects using feature-based similarity of the objects, but this approach often hits its limits for complex multimedia data. Thus, semi-supervised clustering that exploits small amounts of labeled training data has gained traction recently. % In this paper, we propose LabeledPAM, a semi-supervised extension of FasterPAM, a state-of-the-art k-medoids clustering algorithm. Our approach is applicable in semi-supervised classification tasks, where labels are assigned to clusters with minimal labeled data, as well as in semi-supervised clustering scenarios, identifying new clusters with unknown labels. We evaluate our proposal against other semi-supervised clustering techniques suitable for arbitrary distances, demonstrating its efficacy and versatility.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.