Skip to main navigation Skip to search Skip to main content

Scalable Spectral Clustering for Overlapping Community Detection in Large-Scale Networks

Hadrien Van Lierde, Tommy W. S. Chow*, Guanrong Chen

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

While the majority of methods for community detection produce disjoint communities of nodes, most real-world networks naturally involve overlapping communities. In this paper, a scalable method for the detection of overlapping communities in large networks is proposed. The method is based on an extension of the notion of normalized cut to cope with overlapping communities. A spectral clustering algorithm is formulated to solve the related cut minimization problem. When available, the algorithm may take into account prior information about the likelihood for each node to belong to several communities. This information can either be extracted from the available metadata or from node centrality measures. We also introduce a hierarchical version of the algorithm to automatically detect the number of communities. In addition, a new benchmark model extending the stochastic blockmodel for graphs with overlapping communities is formulated. Our experiments show that the proposed spectral method outperforms the state-of-the-art algorithms in terms of computational complexity and accuracy on our benchmark graph model and on five real-world networks, including a lexical network and large-scale social networks. The scalability of the proposed algorithm is also demonstrated on large synthetic graphs with millions of nodes and edges.
Original languageEnglish
Article number8607015
Pages (from-to)754-767
JournalIEEE Transactions on Knowledge and Data Engineering
Volume32
Issue number4
Online published10 Jan 2019
DOIs
Publication statusPublished - Apr 2020

Research Keywords

  • Community detection algorithm
  • overlapping communities
  • real-world network
  • spectral clustering
  • Normalized Cut

Fingerprint

Dive into the research topics of 'Scalable Spectral Clustering for Overlapping Community Detection in Large-Scale Networks'. Together they form a unique fingerprint.

Cite this