Abstract
Motivated by impressive success of deep recurrent neural networks (RNNs), sequence-To-sequence (seq2seq) architecture has been widely adapted to tackle human motion prediction. However, forecasting in longer time horizons always leads to implausible human poses or converges to mean poses. To address these challenges, we dig into the root causes and lay emphasis on two key principles. First, error can be easily accumulated on seq2seq architecture without modifications and thus RNNs cannot recover from its own mistakes in longer time horizons. Second, all the frames or joints are treated equally, whereas both of them often have different levels of importance in human motion. To mitigate this gap, we propose to retrospect human dynamics with attention. We design a retrospection module built upon seq2seq architecture to recollect previous subsequences and correct mistakes in time which enables a self-correction ability. This assists the original seq2seq architecture to eliminate error accumulation which improves significantly both short-Term and long-Term performances. Besides, we present two attention techniques to explore correlations among different joints as well as different frames in both spatial and temporal domains, which successfully capture key properties of different actions and enable our model to generate more realistic human poses. Quantitative and qualitative experiments have been both conducted to evaluate the our proposed model. Experimental results clearly demonstrate the superiority of proposed model over other baselines. © 2013 IEEE.
| Original language | English |
|---|---|
| Article number | 8788507 |
| Pages (from-to) | 107300-107310 |
| Number of pages | 11 |
| Journal | IEEE Access |
| Volume | 7 |
| Online published | 5 Aug 2019 |
| DOIs | |
| Publication status | Published - 2019 |
| Externally published | Yes |
Funding
This work was supported in part by the Australian Research Council under Project DE180101438 and in part by the FEIT Early Career Researcher Development Grant.
Research Keywords
- attention mechanisms
- Human motion prediction
- predictive models
- recurrent neural networks
Publisher's Copyright Statement
- This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/
Fingerprint
Dive into the research topics of 'Spatial-Temporal attention-based human dynamics retrospection'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver