TY - JOUR
T1 - Generating Compressed Counterfactual Hard Negative Samples for Graph Contrastive Learning
AU - Yang, Haoran
AU - Chen, Hongxu
AU - Zhao, Xiangyu
AU - Zhang, Sixiao
AU - Sun, Xiangguo
AU - Li, Qian
AU - Yin, Hongzhi
AU - Xu, Guandong
N1 - Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).
PY - 2026/2/9
Y1 - 2026/2/9
N2 - Graph contrastive learning (GCL) relies on acquiring high-quality positive and negative samples to learn the structural semantics of the input graph. Previous approaches typically sampled negative samples from the same training batch or an irrelevant external graph. However, this approach is limited by the problem of sampling false negatives. To address this limitation, this paper introduces a novel method called CGC. CGC uses a counterfactual mechanism to generate hard negative samples that are similar to positive samples but have different semantics. However, CGC faces the challenge of high space complexity due to the storage requirements of augmented graphs. To overcome this challenge, an expansion of CGC is proposed in this paper, called CCGC (compressed CGC), which incorporates a compression module using knowledge distillation techniques to compress the features of augmented graphs. The effectiveness of the proposed method is demonstrated through satisfactory results on multiple datasets, outperforming traditional unsupervised graph learning methods and state-of-the-art GCL methods. Supplementary experiments are conducted to compare CGC and CCGC at different compression ratios. © 2026 The Author(s). CAAI Transactions on Intelligence Technology published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology and Chongqing University of Technology.
AB - Graph contrastive learning (GCL) relies on acquiring high-quality positive and negative samples to learn the structural semantics of the input graph. Previous approaches typically sampled negative samples from the same training batch or an irrelevant external graph. However, this approach is limited by the problem of sampling false negatives. To address this limitation, this paper introduces a novel method called CGC. CGC uses a counterfactual mechanism to generate hard negative samples that are similar to positive samples but have different semantics. However, CGC faces the challenge of high space complexity due to the storage requirements of augmented graphs. To overcome this challenge, an expansion of CGC is proposed in this paper, called CCGC (compressed CGC), which incorporates a compression module using knowledge distillation techniques to compress the features of augmented graphs. The effectiveness of the proposed method is demonstrated through satisfactory results on multiple datasets, outperforming traditional unsupervised graph learning methods and state-of-the-art GCL methods. Supplementary experiments are conducted to compare CGC and CCGC at different compression ratios. © 2026 The Author(s). CAAI Transactions on Intelligence Technology published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology and Chongqing University of Technology.
KW - artificial neural networks
KW - data mining
KW - graph contrastive learning
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:001684930200001
UR - https://www.scopus.com/pages/publications/105029718837
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-105029718837&origin=recordpage
U2 - 10.1049/cit2.70102
DO - 10.1049/cit2.70102
M3 - RGC 21 - Publication in refereed journal
SN - 2468-6557
JO - CAAI Transactions on Intelligence Technology
JF - CAAI Transactions on Intelligence Technology
ER -