Skip to main navigation Skip to search Skip to main content

Distributed Subgraph Matching on Big Knowledge Graphs Using Pregel

  • Qiang XU
  • , Xin WANG*
  • , Jianxin LI
  • , Qingpeng ZHANG
  • , Lele CHAI
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

67 Downloads (CityUHK Scholars)

Abstract

With RDF becoming the de facto standard for representing knowledge graphs, it is indispensable to develop scalable subgraph matching algorithms over big RDF graphs stored in distributed clusters. In this paper, we propose a novel distributed subgraph matching method SP-Tree, using the Pregel model, to answer subgraph matching queries on big RDF graphs. In our method, the query graph is transformed to a variant spanning tree based on the shortest paths. Two optimization techniques are proposed to improve the efficiency of our algorithms. One employs RDF shapes to filter out local computations and messages passed, the other postpones the Cartesian product operations in the matching process to reduce intermediate results. The extensive experiments on both synthetic and real-world datasets show that our SP-Tree subgraph matching method outperforms the state-of-the-art methods by an order of magnitude.
Original languageEnglish
Article number8807148
Pages (from-to)116453-116464
JournalIEEE Access
Volume7
Online published20 Aug 2019
DOIs
Publication statusPublished - 2019

Research Keywords

  • Subgraph matching
  • distributed
  • knowledge graphs
  • RDF
  • Pregel
  • SPARQL

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'Distributed Subgraph Matching on Big Knowledge Graphs Using Pregel'. Together they form a unique fingerprint.

Cite this