Multi-View Action Recognition using Contrastive Learning

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

19 Scopus Citations
View graph of relations

Author(s)

Detail(s)

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE Winter Conference on Applications of Computer Vision, WACV 2023
PublisherInstitute of Electrical and Electronics Engineers, Inc.
Pages3370-3380
ISBN (electronic)978-1-6654-9346-8
ISBN (print)978-1-6654-9347-5
Publication statusPublished - 2023
Externally publishedYes

Publication series

NameProceedings - IEEE Winter Conference on Applications of Computer Vision, WACV
ISSN (Print)2472-6737
ISSN (electronic)2642-9381

Conference

Title23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2023)
PlaceUnited States
CityWaikoloa
Period3 - 7 January 2023

Abstract

In this work, we present a method for RGB-based action recognition using multi-view videos. We present a supervised contrastive learning framework to learn a feature embedding robust to changes in viewpoint, by effectively leveraging multi-view data. We use an improved supervised contrastive loss and augment the positives with those coming from synchronized viewpoints. We also propose a new approach to use classifier probabilities to guide the selection of hard negatives in the contrastive loss, to learn a more discriminative representation. Negative samples from confusing classes based on posterior are weighted higher. We also show that our method leads to better domain generalization compared to the standard supervised training based on synthetic multi-view data. Extensive experiments on real (NTU-60, NTU-120, NUMA) and synthetic (RoCoG) data demonstrate the effectiveness of our approach. ©2023 IEEE.

Research Area(s)

  • Algorithms: Video recognition and understanding (tracking, action recognition, etc.), and algorithms (including transfer, low-shot, semi-, self-, and un-supervised learning), formulations, Machine learning architectures

Citation Format(s)

Multi-View Action Recognition using Contrastive Learning. / Shah, Ketul; Shah, Anshul; Lau, Chun Pong et al.
Proceedings - 2023 IEEE Winter Conference on Applications of Computer Vision, WACV 2023. Institute of Electrical and Electronics Engineers, Inc., 2023. p. 3370-3380 (Proceedings - IEEE Winter Conference on Applications of Computer Vision, WACV).

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review