Spectral learning of latent semantics for action recognition

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

16 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationProceedings of the IEEE International Conference on Computer Vision
Pages1503-1510
Publication statusPublished - 2011

Conference

Title2011 IEEE International Conference on Computer Vision, ICCV 2011
PlaceSpain
CityBarcelona
Period6 - 13 November 2011

Abstract

This paper proposes novel spectral methods for learning latent semantics (i.e. high-level features) from a large vocabulary of abundant mid-level features (i.e. visual keywords), which can help to bridge the semantic gap in the challenging task of action recognition. To discover the manifold structure hidden among mid-level features, we develop spectral embedding approaches based on graphs and hypergraphs, without the need to tune any parameter for graph construction which is a key step of manifold learning. In particular, the traditional graphs are constructed by linear reconstruction with sparse coding. In the new embedding space, we learn high-level latent semantics automatically from abundant mid-level features through spectral clustering. The learnt latent semantics can be readily used for action recognition with SVM by defining a histogram intersection kernel. Different from the traditional latent semantic analysis based on topic models, our two spectral methods for semantic learning can discover the manifold structure hidden among mid-level features, which results in compact but discriminative high-level features. The experimental results on two standard action datasets have shown the superior performance of our spectral methods. © 2011 IEEE.

Citation Format(s)

Spectral learning of latent semantics for action recognition. / Lu, Zhiwu; Peng, Yuxin; Ip, Horace H.S.
Proceedings of the IEEE International Conference on Computer Vision. 2011. p. 1503-1510 6126408.

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review