Abstract
In many applications, we need to measure similarity between nodes in a large network based on features of their neighborhoods. Although in-network node similarity based on proximity has been well investigated, surprisingly, measuring in-network node similarity based on neighborhoods remains a largely untouched problem in literature. One challenge is that in different applications we may need different measurements that manifest different meanings of similarity. Furthermore, we often want to make trade-offs between specificity of neighborhood matching and efficiency. In this paper, we investigate the problem in a principled and systematic manner. We develop a unified parametric model and a series of four instance measures. Those instance similarity measures not only address a spectrum of various meanings of similarity, but also present a series of trade-offs between computational cost and strictness of matching between neighborhoods of nodes being compared. By extensive experiments and case studies, we demonstrate the effectiveness of the proposed model and its instances.
| Original language | English |
|---|---|
| Pages (from-to) | 43-70 |
| Journal | Knowledge and Information Systems |
| Volume | 53 |
| Issue number | 1 |
| Online published | 17 Feb 2017 |
| DOIs | |
| Publication status | Published - Oct 2017 |
| Externally published | Yes |
Research Keywords
- Neighborhood pattern
- Neighborhood pattern matching
- Node similarity
- Random walk
Fingerprint
Dive into the research topics of 'Measuring in-network node similarity based on neighborhoods: a unified parametric approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver