Skip to main navigation Skip to search Skip to main content

Searching web documents as location sets

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

A geographic search system named GeoXLS is presented, which enables users to submit a set of locations as a query object Q and to find documents containing locations similar to those in Q. Search results come from a collection of geotagged web documents, specifically a vast collection of spreadsheets obtained from the Web. The results are ranked according to their similarity to Q, using one of several user-selected similarity measures related to the Hausdorff distance. GeoXLS allows users to answer queries such as "I know the locations of n entities of type X. What sets of data contain points similar to my query points?" For example, given a set Q of known impact craters, find documents that contain locations similar to those in Q and beyond. In essence, this allows someone to "complete the set" by identifying sets containing similar locations. GeoXLS provides capabilities analogous to a standard keyword search engine, but with keywords specified geographically. In contrast to a search engine that handles only text queries, our geographic search system is capable of returning search result documents that are not exact matches to the query. For example, searching with query points in "Washington, DC", "Denver, Colorado", and "Chicago, Illinois" could return documents related to colleges with actual locations in "College Park, Maryland", "Boulder, Colorado", and "Evanston, Illinois", which are similar spatially, but not textually. GeoXLS can be useful in a wide variety of knowledge domains where the data can be represented as a collection of point sets. © 2011 Authors.
Original languageEnglish
Title of host publicationGIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems
Pages489-492
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2011 - Chicago, IL, United States
Duration: 1 Nov 20114 Nov 2011

Conference

Conference19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2011
PlaceUnited States
CityChicago, IL
Period1/11/114/11/11

Research Keywords

  • query processing
  • similarity search
  • spatial databases

Fingerprint

Dive into the research topics of 'Searching web documents as location sets'. Together they form a unique fingerprint.

Cite this