Topic associations in document models for ad hoc information retrieval

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)Not applicablepeer-review

View graph of relations

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationProceeding of the 20th Pacific Asia Conference on Information Systems (PACIS 2016)
PublisherPacific Asia Conference on Information Systems
ISBN (Print)9789860491029
StatePublished - Jul 2016

Conference

Title20th Pacific Asia Conference on Information Systems (PACIS 2016)
LocationNice Prince Hotel
PlaceTaiwan
CityChiayi
Period27 June - 1 July 2016

Abstract

The integration of topic models into ad hoc retrieval has been studied by many researchers in the past decade and has achieved improved effectiveness and efficiency under the language modeling framework. However, the relationships between topics (i.e., topic associations) are rarely explored in information retrieval (IR). Topic associations can potentially improve the performance of IR systems. For example, when searching for "viral marketing", one might also be interested in seeing the results related with "word-of-mouth" or "social media". We build a topic-cluster-based document model to incorporate topic associations into the Latent Dirichlet Allocation (LDA) topic model. To discover the relationships between topics, we propose two clustering approaches based on topic co-occurrence and semantic similarity. To evaluate the performance of our proposed model, we plan to conduct a user study on ISTopic, an intelligent literature search tool for the IS community, and compare it with the query likelihood model and LDA-based model in terms of effectiveness, efficiency, and satisfaction, which are frequently adopted in user studies as measures for usability of IR systems. In addition, we plan to evaluate it on several TREC ad hoc test collections using mean average precision as the quantitative measure following the IR literature.

Research Area(s)

  • Ad hoc information retrieval, Clustering, ISTopic, LDA, Topic association

Citation Format(s)

Topic associations in document models for ad hoc information retrieval. / Xu, Ruiyun; Chen, Hailiang; Zhao, J. Leon.

Proceeding of the 20th Pacific Asia Conference on Information Systems (PACIS 2016). Pacific Asia Conference on Information Systems, 2016.

Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45)32_Refereed conference paper (with ISBN/ISSN)Not applicablepeer-review