Real-time near-duplicate elimination for web video search with content and context

Research output: Journal Publications and Reviews (RGC: 21, 22, 62)21_Publication in refereed journalpeer-review

127 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Article number4757425
Pages (from-to)196-207
Journal / PublicationIEEE Transactions on Multimedia
Volume11
Issue number2
Publication statusPublished - Feb 2009

Abstract

With the exponential growth of social media, there exist huge numbers of near-duplicate web videos, ranging from simple formatting to complex mixture of different editing effects. In addition to the abundant video content, the social web provides rich sets of context information associated with web videos, such as thumbnail image, time duration and so on. At the same time, the popularity of Web 2.0 demands for timely response to user queries. To balance the speed and accuracy aspects, in this paper, we combine the contextual information from time duration, number of views, and thumbnail images with the content analysis derived from color and local points to achieve real-time near-duplicate elimination. The results of 24 popular queries retrieved from YouTube show that the proposed approach integrating content and context can reach real-time novelty re-ranking of web videos with extremely high efficiency, where the majority of duplicates can be rapidly detected and removed from the top rankings. The speedup of the proposed approach can reach 164 times faster than the effective hierarchical method proposed in [31], with just a slight loss of performance. © 2009 IEEE.

Research Area(s)

  • Content, Context, Copy detection, Filtering, Near-duplicates, Novelty and redundancy detection, Similarity measure, Web video

Citation Format(s)

Real-time near-duplicate elimination for web video search with content and context. / Wu, Xiao; Ngo, Chong-Wah; Hauptmann, Alexander G.; Tan, Hung-Khoon.

In: IEEE Transactions on Multimedia, Vol. 11, No. 2, 4757425, 02.2009, p. 196-207.

Research output: Journal Publications and Reviews (RGC: 21, 22, 62)21_Publication in refereed journalpeer-review