Communication-efficient algorithms for parallel latent Dirichlet allocation

Jian-Feng Yan, Jia Zeng*, Yang Gao, Zhi-Qiang LIU

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

4 Citations (Scopus)

Abstract

Latent Dirichlet allocation (LDA) is a popular topic modeling method which has found many multimedia applications, such as motion analysis and image categorization. Communication cost is one of the main bottlenecks for large-scale parallel learning of LDA. To reduce communication cost, we introduce Zipf’s law and propose novel parallel LDA algorithms that communicate only partial important information at each learning iteration. The proposed algorithms are much more efficient than the current state-of-the art algorithms in both communication and computation costs. Extensive experiments on large-scale data sets demonstrate that our algorithms can greatly reduce communication and computation costs to achieve a better scalability.
Original languageEnglish
Pages (from-to)3-11
JournalSoft Computing
Volume19
Issue number1
Online published18 Jul 2014
DOIs
Publication statusPublished - Jan 2015

Research Keywords

  • Belief propagation
  • Gibbs sampling
  • Latent Dirichlet allocation
  • Parallel learning
  • Zipf’s law

Fingerprint

Dive into the research topics of 'Communication-efficient algorithms for parallel latent Dirichlet allocation'. Together they form a unique fingerprint.
  • GRF: Hypergraphical Models for Object Tagging

    LIU, Z.-Q. (Principal Investigator / Project Coordinator)

    1/11/1331/08/15

    Project: Research

Cite this