Abstract
Deep clustering, an unsupervised technique independent of labels, necessitates tailored supervision for model training. Prior methods explore supervision like similarity and pseudo labels, yet overlook individual sample training analysis. Our study correlates sample stability during unsupervised training with clustering accuracy and network memorization on a per-sample basis. Unstable representations across epochs often lead to mispredictions, indicating difficulty in memorization and atypicality. Leveraging these findings, we introduce supervision signals for the first time based on sample stability at the representation level. Our proposed strategy serves as a versatile tool to enhance various deep clustering techniques. Experiments across benchmark datasets showcase that incorporating sample stability into training can improve the performance of deep clustering. The code is available at https://github.com/LZX-001/LFSS. Copyright 2025 by the author(s).
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 42nd International Conference on Machine Learning |
| Editors | Aarti Singh, Maryam Fazel, Daniel Hsu, Simon Lacoste-Julien, Felix Berkenkamp, Tegan Maharaj, Kiri Wagstaff, Jerry Zhu |
| Publisher | ML Research Press |
| Pages | 34904-34919 |
| Number of pages | 16 |
| Publication status | Online published - 6 Oct 2025 |
| Event | 42nd International Conference on Machine Learning (ICML 2025) - Vancouver Convention Center, Vancouver, Canada Duration: 13 Jul 2025 → 19 Jul 2025 https://icml.cc/Conferences/2025 |
Publication series
| Name | Proceedings of Machine Learning Research |
|---|---|
| Publisher | ML Research Press |
| Volume | 267 |
| ISSN (Electronic) | 2640-3498 |
Conference
| Conference | 42nd International Conference on Machine Learning (ICML 2025) |
|---|---|
| Abbreviated title | ICML 2025 |
| Place | Canada |
| City | Vancouver |
| Period | 13/07/25 → 19/07/25 |
| Internet address |
Funding
This work was in part supported by the National Natural Science Foundation of China under Grant U24A20322 and 62422118, and in part supported by the Hong Kong UGC under grant UGC/FDS11/E02/22 and UGC/FDS11/E03/24. This research work is supported by the Big Data Computing Center of Southeast University.
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Learning from Sample Stability for Deep Clustering'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver