Copula Guided Parallel Gibbs Sampling for Nonparametric and Coherent Topic Discovery (Extended Abstract)

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

View graph of relations

Author(s)

  • Lihui Lin
  • Yanghui Rao
  • Haoran Xie
  • Jian Yin
  • Fu Lee Wang
  • Qing Li

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE 39th International Conference on Data Engineering (ICDE 2023)
PublisherInstitute of Electrical and Electronics Engineers, Inc.
Pages3823-3824
ISBN (electronic)979-8-3503-2227-9
Publication statusPublished - 2023

Publication series

NameProceedings - International Conference on Data Engineering
Volume2023-April
ISSN (Print)1084-4627

Conference

Title39th IEEE International Conference on Data Engineering (ICDE 2023)
LocationMarriott Anaheim
PlaceUnited States
CityAnaheim
Period3 - 7 April 2023

Abstract

In terms of the generative process, the Gamma-Gamma-Poisson Process (G2PP) is equivalent to the nonparametric topic model of Hierarchical Dirichlet Process (HDP). Considering the high computational cost of estimating parameters in HDP, a parallel G2PP was developed to generate topics efficiently via multi-threading. Unfortunately, the above model needs to predefine the number of topics. To address this issue, we first propose a Topic Self-Adaptive Model (TSAM) for nonparametric and parallel topic discovery. In TSAM, a monitor-executor mechanism is developed to manage the global topic information using a hierarchical structure of threads. Based on the apparatus of copulas, we further extend our TSAM to TSAMcop for coherent topic modeling by exploiting a copula guided parallel Gibbs sampling algorithm. Extensive experiments validate the effectiveness of both TSAM and TSAMcop. © 2023 IEEE.

Research Area(s)

  • copulas, parallel gibbs sampling, topic modelling

Citation Format(s)

Copula Guided Parallel Gibbs Sampling for Nonparametric and Coherent Topic Discovery (Extended Abstract). / Lin, Lihui; Rao, Yanghui; Xie, Haoran et al.
Proceedings - 2023 IEEE 39th International Conference on Data Engineering (ICDE 2023). Institute of Electrical and Electronics Engineers, Inc., 2023. p. 3823-3824 (Proceedings - International Conference on Data Engineering; Vol. 2023-April).

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review