Real-time near-duplicate elimination for web video search with content and context

Xiao Wu, Chong-Wah Ngo, Alexander G. Hauptmann, Hung-Khoon Tan

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

141 Citations (Scopus)

Abstract

With the exponential growth of social media, there exist huge numbers of near-duplicate web videos, ranging from simple formatting to complex mixture of different editing effects. In addition to the abundant video content, the social web provides rich sets of context information associated with web videos, such as thumbnail image, time duration and so on. At the same time, the popularity of Web 2.0 demands for timely response to user queries. To balance the speed and accuracy aspects, in this paper, we combine the contextual information from time duration, number of views, and thumbnail images with the content analysis derived from color and local points to achieve real-time near-duplicate elimination. The results of 24 popular queries retrieved from YouTube show that the proposed approach integrating content and context can reach real-time novelty re-ranking of web videos with extremely high efficiency, where the majority of duplicates can be rapidly detected and removed from the top rankings. The speedup of the proposed approach can reach 164 times faster than the effective hierarchical method proposed in [31], with just a slight loss of performance. © 2009 IEEE.
Original languageEnglish
Article number4757425
Pages (from-to)196-207
JournalIEEE Transactions on Multimedia
Volume11
Issue number2
DOIs
Publication statusPublished - Feb 2009

Research Keywords

  • Content
  • Context
  • Copy detection
  • Filtering
  • Near-duplicates
  • Novelty and redundancy detection
  • Similarity measure
  • Web video

Fingerprint

Dive into the research topics of 'Real-time near-duplicate elimination for web video search with content and context'. Together they form a unique fingerprint.

Cite this