Projects per year
Abstract
In recent years, sparse representation has been
successfully utilized for video summarization (VS). However,
most of the sparse representation based VS methods characterize each video frame with global features. As a result, some
important local details could be neglected by global features,
which may compromise the performance of summarization. In
this paper, we propose to partition each video frame into a
number of patches and characterize each patch with global
features. Instead of concatenating the features of each patch
and utilizing conventional sparse representation, we formulate
the VS problem with such video frame representation as block
sparse representation by considering each video frame as a block
containing a number of patches. By taking the reconstruction
constraint into account, we devise a simultaneous version of
block-based OMP (Orthogonal Matching Pursuit) algorithm,
namely SBOMP, to solve the proposed model. The proposed
model is further extended to a neighborhood based model which
considers temporally adjacent frames as a super block. This
is one of the first sparse representation based VS methods
taking both spatial and temporal contexts into account with
blocks. Experimental results on two widely used VS datasets
have demonstrated that our proposed methods present clear
superiority over existing sparse representation based VS methods
and are highly comparable to some deep learning ones requiring
supervision information for extra model training.
Original language | English |
---|---|
Article number | 9069290 |
Pages (from-to) | 732-747 |
Journal | IEEE Transactions on Multimedia |
Volume | 23 |
Online published | 16 Apr 2020 |
DOIs | |
Publication status | Published - 2021 |
Research Keywords
- Video summarization (VS)
- Keyframe extraction
- Orthogonal Matching Pursuit (OMP)
- Sparse representation
- Block sparse representation
Fingerprint
Dive into the research topics of 'Patch Based Video Summarization With Block Sparse Representation'. Together they form a unique fingerprint.Projects
- 1 Finished
-
GRF: Learning Based Hyperspectral Image Reconstruction and Discriminative Representation
HOU, J. (Principal Investigator / Project Coordinator)
1/01/20 → 22/12/23
Project: Research