Copula Guided Parallel Gibbs Sampling for Nonparametric and Coherent Topic Discovery (Extended Abstract)
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | Proceedings - 2023 IEEE 39th International Conference on Data Engineering (ICDE 2023) |
Publisher | Institute of Electrical and Electronics Engineers, Inc. |
Pages | 3823-3824 |
ISBN (electronic) | 979-8-3503-2227-9 |
Publication status | Published - 2023 |
Publication series
Name | Proceedings - International Conference on Data Engineering |
---|---|
Volume | 2023-April |
ISSN (Print) | 1084-4627 |
Conference
Title | 39th IEEE International Conference on Data Engineering (ICDE 2023) |
---|---|
Location | Marriott Anaheim |
Place | United States |
City | Anaheim |
Period | 3 - 7 April 2023 |
Link(s)
Abstract
In terms of the generative process, the Gamma-Gamma-Poisson Process (G2PP) is equivalent to the nonparametric topic model of Hierarchical Dirichlet Process (HDP). Considering the high computational cost of estimating parameters in HDP, a parallel G2PP was developed to generate topics efficiently via multi-threading. Unfortunately, the above model needs to predefine the number of topics. To address this issue, we first propose a Topic Self-Adaptive Model (TSAM) for nonparametric and parallel topic discovery. In TSAM, a monitor-executor mechanism is developed to manage the global topic information using a hierarchical structure of threads. Based on the apparatus of copulas, we further extend our TSAM to TSAMcop for coherent topic modeling by exploiting a copula guided parallel Gibbs sampling algorithm. Extensive experiments validate the effectiveness of both TSAM and TSAMcop. © 2023 IEEE.
Research Area(s)
- copulas, parallel gibbs sampling, topic modelling
Citation Format(s)
Copula Guided Parallel Gibbs Sampling for Nonparametric and Coherent Topic Discovery (Extended Abstract). / Lin, Lihui; Rao, Yanghui; Xie, Haoran et al.
Proceedings - 2023 IEEE 39th International Conference on Data Engineering (ICDE 2023). Institute of Electrical and Electronics Engineers, Inc., 2023. p. 3823-3824 (Proceedings - International Conference on Data Engineering; Vol. 2023-April).
Proceedings - 2023 IEEE 39th International Conference on Data Engineering (ICDE 2023). Institute of Electrical and Electronics Engineers, Inc., 2023. p. 3823-3824 (Proceedings - International Conference on Data Engineering; Vol. 2023-April).
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review