Person-level Action Recognition in Complex Events via TSD-TSM Networks

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

9 Scopus Citations
View graph of relations

Author(s)

  • Yanbin Hao
  • Zi-Niu Liu
  • Hao Zhang
  • Yu-Gang Jiang

Detail(s)

Original languageEnglish
Title of host publicationMM '20 - Proceedings of the 28th ACM International Conference on Multimedia
PublisherAssociation for Computing Machinery, Inc
Pages4699-4702
ISBN (electronic)9781450379885
Publication statusPublished - Oct 2020

Publication series

NameMM - Proceedings of the ACM International Conference on Multimedia

Conference

Title28th ACM International Conference on Multimedia (MM 2020)
LocationVirtual
PlaceUnited States
CitySeattle
Period12 - 16 October 2020

Abstract

The task of person-level action recognition in complex events aims to densely detect pedestrians and individually predict their actions from surveillance videos. In this paper, we present a simple yet efficient pipeline for this task, referred to as TSD-TSM networks. Firstly, we adopt the TSD detector for the pedestrian localization on each single keyframe. Secondly, we generate the sequential ROIs for a person proposal by replicating the adjusted bounding box coordinates around the keyframe. Particularly, we propose to conduct straddling expansion and region squaring on the original bounding box of a person proposal to widen the potential space of motion and interaction and lead to a square box for ROI detection. Finally, we adapt the TSM classifier on the generated ROI sequences to perform action classification and further adopt late fusion to promote the prediction. Our proposed pipeline achieved the 3rd place in the ACM-MM 2020 grand challenge, i.e., Large-scale Human-centric Video Analysis in Complex Events (Track-4), obtaining final 15.31% wf-mAP@avg and 20.63% f-mAP@avg on the testing set.

Research Area(s)

  • complex events, human action recognition, pedestrian detection

Bibliographic Note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Citation Format(s)

Person-level Action Recognition in Complex Events via TSD-TSM Networks. / Hao, Yanbin; Liu, Zi-Niu; Zhang, Hao et al.
MM '20 - Proceedings of the 28th ACM International Conference on Multimedia. Association for Computing Machinery, Inc, 2020. p. 4699-4702 (MM - Proceedings of the ACM International Conference on Multimedia).

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review