Abstract
While the majority of methods for community detection produce disjoint communities of nodes, most real-world networks naturally involve overlapping communities. In this paper, a scalable method for the detection of overlapping communities in large networks is proposed. The method is based on an extension of the notion of normalized cut to cope with overlapping communities. A spectral clustering algorithm is formulated to solve the related cut minimization problem. When available, the algorithm may take into account prior information about the likelihood for each node to belong to several communities. This information can either be extracted from the available metadata or from node centrality measures. We also introduce a hierarchical version of the algorithm to automatically detect the number of communities. In addition, a new benchmark model extending the stochastic blockmodel for graphs with overlapping communities is formulated. Our experiments show that the proposed spectral method outperforms the state-of-the-art algorithms in terms of computational complexity and accuracy on our benchmark graph model and on five real-world networks, including a lexical network and large-scale social networks. The scalability of the proposed algorithm is also demonstrated on large synthetic graphs with millions of nodes and edges.
| Original language | English |
|---|---|
| Article number | 8607015 |
| Pages (from-to) | 754-767 |
| Journal | IEEE Transactions on Knowledge and Data Engineering |
| Volume | 32 |
| Issue number | 4 |
| Online published | 10 Jan 2019 |
| DOIs | |
| Publication status | Published - Apr 2020 |
Research Keywords
- Community detection algorithm
- overlapping communities
- real-world network
- spectral clustering
- Normalized Cut
Fingerprint
Dive into the research topics of 'Scalable Spectral Clustering for Overlapping Community Detection in Large-Scale Networks'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver