TY - GEN
T1 - A Debiased Graph Clustering Approach Using Dual Contrastive Learning
AU - Gao, Kuang
AU - Chen, Mukun
AU - Liu, Chuang
AU - Xue, Shan
AU - Qiu, Zhenyu
AU - Ren, Ting
AU - Jia, Xiaohua
AU - Hu, Wenbin
PY - 2024
Y1 - 2024
N2 - Node and graph-level clustering hold considerable significance for a wide range of applications, including drug target identification and protein function prediction. Recently, contrastive learning has surpassed numerous unsupervised learning methods and become increasingly useful for various deep clustering procedures, achieving commendable results. However, two primary obstacles impede further deployment of graph contrastive clustering: (1) its inherent tendency to separate node representations, which contradicts the clustering objective of forming meaningful groups and impedes effective cluster creation, and (2) the occurrence of false negative samples, which similarly obstructs cluster formation. Hence, this paper proposes a novel graph clustering algorithm, which employs a dual contrastive learning approach, encompassing element and cluster contrasts, and a strategy for debiasing false negative samples. The proposed algorithm utilizes element-level contrastive learning on embeddings derived from the encoder, integrating detailed node or graph characteristics. Then, clustering and cluster-level contrastive learning are executed in the embedding space to refine the results. Furthermore, the algorithm effectively addresses the potential false negatives and imbalanced prediction challenges during the dual-contrast process by implementing an optimization mechanism based on reliable results, thereby enhancing the clustering performance. Rigorous experiments across three node and graph-level benchmarks validate our proposed algorithm's efficacy. © 2024 IEEE.
AB - Node and graph-level clustering hold considerable significance for a wide range of applications, including drug target identification and protein function prediction. Recently, contrastive learning has surpassed numerous unsupervised learning methods and become increasingly useful for various deep clustering procedures, achieving commendable results. However, two primary obstacles impede further deployment of graph contrastive clustering: (1) its inherent tendency to separate node representations, which contradicts the clustering objective of forming meaningful groups and impedes effective cluster creation, and (2) the occurrence of false negative samples, which similarly obstructs cluster formation. Hence, this paper proposes a novel graph clustering algorithm, which employs a dual contrastive learning approach, encompassing element and cluster contrasts, and a strategy for debiasing false negative samples. The proposed algorithm utilizes element-level contrastive learning on embeddings derived from the encoder, integrating detailed node or graph characteristics. Then, clustering and cluster-level contrastive learning are executed in the embedding space to refine the results. Furthermore, the algorithm effectively addresses the potential false negatives and imbalanced prediction challenges during the dual-contrast process by implementing an optimization mechanism based on reliable results, thereby enhancing the clustering performance. Rigorous experiments across three node and graph-level benchmarks validate our proposed algorithm's efficacy. © 2024 IEEE.
UR - http://www.scopus.com/inward/record.url?scp=85210234137&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85210234137&origin=recordpage
U2 - 10.1109/ICWS62655.2024.00143
DO - 10.1109/ICWS62655.2024.00143
M3 - RGC 32 - Refereed conference paper (with host publication)
T3 - Proceedings of the IEEE International Conference on Web Services, ICWS
SP - 1198
EP - 1205
BT - Proceedings - 2024 IEEE International Conference on Web Services (IEEE ICWS 2024)
PB - IEEE
T2 - 2024 IEEE International Conference on Web Services, ICWS 2024
Y2 - 7 July 2024 through 13 July 2024
ER -