Clustering non-uniform-sized spatial objects to reduce I/O cost for spatial-join processing

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

11 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Pages (from-to)384-397
Journal / PublicationComputer Journal
Volume44
Issue number5
Publication statusPublished - 2001

Abstract

The cost of spatial-join processing can be very high due to the large sizes of spatial objects and the computation-intensive spatial operations. A filter-and-refine strategy is usually used to reduce the computing cost of spatial join when the number of spatial objects is large. In this paper we propose a method that aims to minimize the I/O cost at the refinement step. A graph model is introduced to formalize the I/O cost, and a matrix-based algorithm is developed to cluster objects (data) such that the objects in the same cluster are closely related. The objects in the same cluster will be brought together into the main memory for the refinement process, and the I/O cost of fetching objects into memory can, be reduced. Experiments have been conducted and the results have shown that our method can save 20-35% of I/O cost compared to the cases where no clustering or a little clustering is done.