Fast covariant VLAD for image search
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Article number | 7499824 |
Pages (from-to) | 1843-1854 |
Journal / Publication | IEEE Transactions on Multimedia |
Volume | 18 |
Issue number | 9 |
Publication status | Published - 1 Sept 2016 |
Link(s)
Abstract
Vector of locally aggregated descriptor (VLAD) is a popular image encoding approach for its simplicity and better scalability over conventional bag-of-visual-word approach. In order to enhance its distinctiveness and geometric invariance, covariant VLAD (CVLAD) is proposed to pool local features based on their dominant orientations/characteristic scales, which leads to a geometric-aware representation. This representation achieves rotation/scale invariance when being associated with circular matching. However, the circular matching induces several times of computation overhead, which makes CVLAD hardly suitable for large-scale retrieval tasks. In this paper, the issue of computation overhead is alleviated by performing the circular matching in CVLAD's frequency domain. In addition, by operating PCA on CVLAD in its frequency domain, much better scalability is achieved than when it is undertaken in the original feature space. Furthermore, the high-dimensional CVLAD subvectors are converted to dozens of very low-dimensional subvectors, which is possible when transforming the feature into its frequency domain. Nearest neighbor search is therefore undertaken on very low-dimensional subspaces, which becomes easily tractable. The effectiveness of our approach is demonstrated in the retrieval scenario on popular benchmarks comprising up to 1 million database images.
Research Area(s)
- Circular matching, covariant pooling, covariant vector of locally aggregated descriptor (CVLAD), similar image search
Citation Format(s)
Fast covariant VLAD for image search. / Zhao, Wan-Lei; Ngo, Chong-Wah; Wang, Hanzi.
In: IEEE Transactions on Multimedia, Vol. 18, No. 9, 7499824, 01.09.2016, p. 1843-1854.
In: IEEE Transactions on Multimedia, Vol. 18, No. 9, 7499824, 01.09.2016, p. 1843-1854.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review