Topic associations in document models for ad hoc information retrieval
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | Proceeding of the 20th Pacific Asia Conference on Information Systems (PACIS 2016) |
Publisher | Pacific Asia Conference on Information Systems |
ISBN (print) | 9789860491029 |
Publication status | Published - Jul 2016 |
Conference
Title | 20th Pacific Asia Conference on Information Systems (PACIS 2016) |
---|---|
Location | Nice Prince Hotel |
Place | Taiwan |
City | Chiayi |
Period | 27 June - 1 July 2016 |
Link(s)
Abstract
The integration of topic models into ad hoc retrieval has been studied by many researchers in the past decade and has achieved improved effectiveness and efficiency under the language modeling framework. However, the relationships between topics (i.e., topic associations) are rarely explored in information retrieval (IR). Topic associations can potentially improve the performance of IR systems. For example, when searching for "viral marketing", one might also be interested in seeing the results related with "word-of-mouth" or "social media". We build a topic-cluster-based document model to incorporate topic associations into the Latent Dirichlet Allocation (LDA) topic model. To discover the relationships between topics, we propose two clustering approaches based on topic co-occurrence and semantic similarity. To evaluate the performance of our proposed model, we plan to conduct a user study on ISTopic, an intelligent literature search tool for the IS community, and compare it with the query likelihood model and LDA-based model in terms of effectiveness, efficiency, and satisfaction, which are frequently adopted in user studies as measures for usability of IR systems. In addition, we plan to evaluate it on several TREC ad hoc test collections using mean average precision as the quantitative measure following the IR literature.
Research Area(s)
- Ad hoc information retrieval, Clustering, ISTopic, LDA, Topic association
Citation Format(s)
Topic associations in document models for ad hoc information retrieval. / Xu, Ruiyun; Chen, Hailiang; Zhao, J. Leon.
Proceeding of the 20th Pacific Asia Conference on Information Systems (PACIS 2016). Pacific Asia Conference on Information Systems, 2016.
Proceeding of the 20th Pacific Asia Conference on Information Systems (PACIS 2016). Pacific Asia Conference on Information Systems, 2016.
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review