Projects per year
Abstract
Topic models like Latent Dirichlet Allocation (LDA)and its variants is a type of statistical model for discovering latent topics. However, as revealed by the previous research, some topics generated by LDA may be uninterpretable and semantically incoherent due to the occurrence of irrelevant words in these topics. To improve the semantic qualities of automatically discovered topics, we explore the distributional characteristics of words across topics to identify topic-indiscriminate words which are blamed for the low-quality topics. The main contribution of our research reported in this paper is that we develop a novel framework named Iterative Term Weighting Framework (ITWF)which can effectively identify and filter out topic-indiscriminate words from uncovered topics. In particular, the proposed framework first applies an entropy-based term weighting schemes and adopts a novel iterative method to identify topic-indiscriminate words. To the best of our knowledge, our research is among the very few successful work that aims to enhance both the semantic coherence and the interpretability of LDA-based topic modeling methods. The experimental results show that the proposed framework improves the effectiveness of LDA as well as its variants.
| Original language | English |
|---|---|
| Pages (from-to) | 248-260 |
| Journal | Neurocomputing |
| Volume | 350 |
| Online published | 10 Apr 2019 |
| DOIs | |
| Publication status | Published - 20 Jul 2019 |
Research Keywords
- Topic model
- Latent Dirichlet Allocation
- Term weighting scheme
- Knowledge acquisition
Fingerprint
Dive into the research topics of 'ITWF: A framework to apply term weighting schemes in topic model'. Together they form a unique fingerprint.Projects
- 2 Finished
-
GRF: BigCredit: A Novel Framework for Big Social Media Data Enhanced Online Credit Scoring
LAU, Y. K. R. (Principal Investigator / Project Coordinator), Li, C. (Co-Investigator) & WONG, C. S. M. (Co-Investigator)
1/01/17 → 3/06/21
Project: Research
-
GRF: Big Data Analytics for Detecting Deceptive Product Comments in Online Social Media
LAU, Y. K. R. (Principal Investigator / Project Coordinator) & Li, C. (Co-Investigator)
1/01/16 → 24/06/20
Project: Research