Skip to main navigation Skip to search Skip to main content

Spatial-Temporal attention-based human dynamics retrospection

Minjing DONG, Chang XU*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

53 Downloads (CityUHK Scholars)

Abstract

Motivated by impressive success of deep recurrent neural networks (RNNs), sequence-To-sequence (seq2seq) architecture has been widely adapted to tackle human motion prediction. However, forecasting in longer time horizons always leads to implausible human poses or converges to mean poses. To address these challenges, we dig into the root causes and lay emphasis on two key principles. First, error can be easily accumulated on seq2seq architecture without modifications and thus RNNs cannot recover from its own mistakes in longer time horizons. Second, all the frames or joints are treated equally, whereas both of them often have different levels of importance in human motion. To mitigate this gap, we propose to retrospect human dynamics with attention. We design a retrospection module built upon seq2seq architecture to recollect previous subsequences and correct mistakes in time which enables a self-correction ability. This assists the original seq2seq architecture to eliminate error accumulation which improves significantly both short-Term and long-Term performances. Besides, we present two attention techniques to explore correlations among different joints as well as different frames in both spatial and temporal domains, which successfully capture key properties of different actions and enable our model to generate more realistic human poses. Quantitative and qualitative experiments have been both conducted to evaluate the our proposed model. Experimental results clearly demonstrate the superiority of proposed model over other baselines. © 2013 IEEE.

Original languageEnglish
Article number8788507
Pages (from-to)107300-107310
Number of pages11
JournalIEEE Access
Volume7
Online published5 Aug 2019
DOIs
Publication statusPublished - 2019
Externally publishedYes

Funding

This work was supported in part by the Australian Research Council under Project DE180101438 and in part by the FEIT Early Career Researcher Development Grant.

Research Keywords

  • attention mechanisms
  • Human motion prediction
  • predictive models
  • recurrent neural networks

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'Spatial-Temporal attention-based human dynamics retrospection'. Together they form a unique fingerprint.

Cite this