Exploring LDA-based document model for geographic information retrieval

Zhisheng Li, Chong Wang, Xing Xie, Xufa Wang, Wei-Ying Ma

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

8 Citations (Scopus)

Abstract

Latent Dirichlet Allocation (LDA) model, a formal generative model, has been used to improve ad-hoc information retrieval recently. However, its feasibility and effectiveness for geographic information retrieval has not been explored. This paper proposes an LDA-based document model to improve geographic information retrieval by inheriting the LDA model with text retrieval model. The proposed model has been evaluated on GeoCLEF2007 collection. This is a part of the experiments of Columbus Project of Microsoft Research Asia (MSRA) in GeoCLEF2007 (a cross-language geographical retrieval track which is part of Cross Language Evaluation Forum). This is the second time we participate in this event. Since the queries in GeoCLEF2007 are similar to those in GeoCLEF2006, we leverage most of the methods that we used in GeoCLEF2006, including MSRAWhitelist, MSRAExpansion, MSRALocation and MSRAText approaches. The difference is that MSRAManual approach is not included this time, and we use MSRALDA instead. The results show that the application of LDA model in GeoCLEF monolingual English task performs stably but needs to be further explored. © 2008 Springer-Verlag Berlin Heidelberg.
Original languageEnglish
Title of host publicationAdvances in Multilingual and Multimodal Information Retrieval - 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Revised Selected Papers
PublisherSpringer Verlag
Pages842-849
Volume5152 LNCS
ISBN (Print)3540857591, 9783540857594
DOIs
Publication statusPublished - 2008
Externally publishedYes
Event8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007 - Budapest, Hungary
Duration: 19 Sept 200721 Sept 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5152 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007
Country/TerritoryHungary
CityBudapest
Period19/09/0721/09/07

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Research Keywords

  • Evaluation
  • Geographic information retrieval
  • Latent Dirichlet Allocation
  • System design

Fingerprint

Dive into the research topics of 'Exploring LDA-based document model for geographic information retrieval'. Together they form a unique fingerprint.

Cite this