TY - JOUR
T1 - Real-time near-duplicate elimination for web video search with content and context
AU - Wu, Xiao
AU - Ngo, Chong-Wah
AU - Hauptmann, Alexander G.
AU - Tan, Hung-Khoon
PY - 2009/2
Y1 - 2009/2
N2 - With the exponential growth of social media, there exist huge numbers of near-duplicate web videos, ranging from simple formatting to complex mixture of different editing effects. In addition to the abundant video content, the social web provides rich sets of context information associated with web videos, such as thumbnail image, time duration and so on. At the same time, the popularity of Web 2.0 demands for timely response to user queries. To balance the speed and accuracy aspects, in this paper, we combine the contextual information from time duration, number of views, and thumbnail images with the content analysis derived from color and local points to achieve real-time near-duplicate elimination. The results of 24 popular queries retrieved from YouTube show that the proposed approach integrating content and context can reach real-time novelty re-ranking of web videos with extremely high efficiency, where the majority of duplicates can be rapidly detected and removed from the top rankings. The speedup of the proposed approach can reach 164 times faster than the effective hierarchical method proposed in [31], with just a slight loss of performance. © 2009 IEEE.
AB - With the exponential growth of social media, there exist huge numbers of near-duplicate web videos, ranging from simple formatting to complex mixture of different editing effects. In addition to the abundant video content, the social web provides rich sets of context information associated with web videos, such as thumbnail image, time duration and so on. At the same time, the popularity of Web 2.0 demands for timely response to user queries. To balance the speed and accuracy aspects, in this paper, we combine the contextual information from time duration, number of views, and thumbnail images with the content analysis derived from color and local points to achieve real-time near-duplicate elimination. The results of 24 popular queries retrieved from YouTube show that the proposed approach integrating content and context can reach real-time novelty re-ranking of web videos with extremely high efficiency, where the majority of duplicates can be rapidly detected and removed from the top rankings. The speedup of the proposed approach can reach 164 times faster than the effective hierarchical method proposed in [31], with just a slight loss of performance. © 2009 IEEE.
KW - Content
KW - Context
KW - Copy detection
KW - Filtering
KW - Near-duplicates
KW - Novelty and redundancy detection
KW - Similarity measure
KW - Web video
UR - http://www.scopus.com/inward/record.url?scp=59149089043&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-59149089043&origin=recordpage
U2 - 10.1109/TMM.2008.2009673
DO - 10.1109/TMM.2008.2009673
M3 - RGC 21 - Publication in refereed journal
SN - 1520-9210
VL - 11
SP - 196
EP - 207
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
IS - 2
M1 - 4757425
ER -