TY - JOUR
T1 - A Fast Approximation Algorithm for the Top-K Group Betweenness Centrality
AU - Xu, Wenzheng
AU - Li, Jing
AU - Liang, Weifa
AU - Xu, Zichuan
AU - Peng, Jian
AU - Zhou, Pan
AU - Zhao, Cheng
AU - Yan, Binyu
AU - Jia, Xiaohua
AU - Yu, Jeffrey Xu
PY - 2026/6
Y1 - 2026/6
N2 - Betweenness centrality is one of the key centrality measures in many applications including community detections in biological networks, vulnerability detections in communication networks, misinformation filtering in social networks, etc. The top-K group betweenness centrality problem is to find a group of K nodes from a network so that the total fraction of shortest paths that pass through the K nodes is maximized. Existing studies proposed randomized sampling algorithms for the problem. We notice that the existing studies ensured that, the maximum deviation of the estimated centrality of every group from its expectation is no greater than a small given threshold for all potential groups with no more than K nodes, thereby generating too many samples, as the number of such groups is prohibitively large. In contrast, in this paper we first devise a novel algorithm that enables to estimate the centrality of a tentative group adaptively, and the algorithm immediately stops once the centrality is large enough; otherwise, the algorithm uses more samples to find a better group. We then theoretically show that, even the proposed algorithm uses much less samples, it still can find a performance-guaranteed group with high probability. Experimental results with real-world networks demonstrate that the number of samples used by the proposed algorithm is up to 36 times smaller than the state-of-the-art, while the centrality of the group found by the algorithm is no more than 4.5% smaller than the latter. © 1989-2012 IEEE.
AB - Betweenness centrality is one of the key centrality measures in many applications including community detections in biological networks, vulnerability detections in communication networks, misinformation filtering in social networks, etc. The top-K group betweenness centrality problem is to find a group of K nodes from a network so that the total fraction of shortest paths that pass through the K nodes is maximized. Existing studies proposed randomized sampling algorithms for the problem. We notice that the existing studies ensured that, the maximum deviation of the estimated centrality of every group from its expectation is no greater than a small given threshold for all potential groups with no more than K nodes, thereby generating too many samples, as the number of such groups is prohibitively large. In contrast, in this paper we first devise a novel algorithm that enables to estimate the centrality of a tentative group adaptively, and the algorithm immediately stops once the centrality is large enough; otherwise, the algorithm uses more samples to find a better group. We then theoretically show that, even the proposed algorithm uses much less samples, it still can find a performance-guaranteed group with high probability. Experimental results with real-world networks demonstrate that the number of samples used by the proposed algorithm is up to 36 times smaller than the state-of-the-art, while the centrality of the group found by the algorithm is no more than 4.5% smaller than the latter. © 1989-2012 IEEE.
KW - approximation algorithms
KW - Group betweenness centrality
KW - randomized algorithms
UR - https://www.scopus.com/pages/publications/105033884783
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-105033884783&origin=recordpage
U2 - 10.1109/TKDE.2026.3677004
DO - 10.1109/TKDE.2026.3677004
M3 - RGC 21 - Publication in refereed journal
SN - 1041-4347
VL - 38
SP - 3335
EP - 3348
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 6
ER -