Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 3257-3269 |
Journal / Publication | Pattern Recognition |
Volume | 41 |
Issue number | 10 |
Publication status | Published - Oct 2008 |
Link(s)
Abstract
This paper presents an unified approach in analyzing and structuring the content of videotaped lectures for distance learning applications. By structuring lecture videos, we can support topic indexing and semantic querying of multimedia documents captured in the traditional classrooms. Our goal in this paper is to automatically construct the cross references of lecture videos and textual documents so as to facilitate the synchronized browsing and presentation of multimedia information. The major issues involved in our approach are topical event detection, video text analysis and the matching of slide shots and external documents. In topical event detection, a novel transition detector is proposed to rapidly locate the slide shot boundaries by computing the changes of text and background regions in videos. For each detected topical event, multiple keyframes are extracted for video text detection, super-resolution reconstruction, binarization and recognition. A new approach for the reconstruction of high-resolution textboxes based on linear interpolation and multi-frame integration is also proposed for the effective binarization and recognition. The recognized characters are utilized to match the video slide shots and external documents based on our proposed title and content similarity measures. © 2008 Elsevier Ltd. All rights reserved.
Research Area(s)
- Super-resolution reconstruction, Synchronization of lecture videos and electronic slides, Topic detection, Video text analysis
Citation Format(s)
Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis. / Wang, Feng; Ngo, Chong-Wah; Pong, Ting-Chuen.
In: Pattern Recognition, Vol. 41, No. 10, 10.2008, p. 3257-3269.
In: Pattern Recognition, Vol. 41, No. 10, 10.2008, p. 3257-3269.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review