Hierarchical clustering of WWW image search results using visual, textual and link information

Deng Cai, Xiaofei He, Zhiwei Li, Wei-Ying Ma, Ji-Rong Wen

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

280 Citations (Scopus)

Abstract

We consider the problem of clustering Web image search results. Generally, the image search results returned by an image search engine contain multiple topics. Organizing the results into different semantic clusters facilitates users' browsing. In this paper, we propose a hierarchical clustering method using visual, textual and link analysis. By using a vision-based page segmentation algorithm, a web page is partitioned into blocks, and the textual and link information of an image can be accurately extracted from the block containing that image. By using block-level link analysis techniques, an image graph can be constructed. We then apply spectral techniques to find a Euclidean embedding of the images which respects the graph structure, Thus for each image, we have three kinds of representations, i.e. visual feature based representation, textual feature based representation and graph based representation. Using spectral clustering techniques, we can cluster the search results into different semantic clusters. An image search example illustrates the potential of these techniques.
Original languageEnglish
Title of host publicationACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
PublisherAssociation for Computing Machinery
Pages952-959
ISBN (Print)1581138938, 9781581138931
DOIs
Publication statusPublished - 2004
Externally publishedYes
EventACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia - New York, NY, United States
Duration: 10 Oct 200416 Oct 2004

Publication series

NameACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia

Conference

ConferenceACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
Country/TerritoryUnited States
CityNew York, NY
Period10/10/0416/10/04

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Research Keywords

  • Graph Model
  • Image Clustering
  • Link Analysis
  • Search Result Organization
  • Spectral Analysis
  • Vision Based Page Segmentation
  • Web Image Search

Fingerprint

Dive into the research topics of 'Hierarchical clustering of WWW image search results using visual, textual and link information'. Together they form a unique fingerprint.

Cite this