Abstract
We propose a heterogeneous multi-task learning framework for human pose estimation from monocular images using a deep convolutional neural network. In particular, we simultaneously learn a human pose regressor and sliding-window body-part and joint-point detectors in a deep network architecture. We show that including the detection tasks helps to regularize the network, directing it to converge to a good solution. We report competitive and state-of-art results on several datasets. We also empirically show that the learned neurons in the middle layer of our network are tuned to localized body parts.
| Original language | English |
|---|---|
| Pages (from-to) | 19-36 |
| Journal | International Journal of Computer Vision |
| Volume | 113 |
| Issue number | 1 |
| Online published | 26 Sept 2014 |
| DOIs | |
| Publication status | Published - May 2015 |
Research Keywords
- Deep Learning
- Human Pose Estimation
Fingerprint
Dive into the research topics of 'Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network'. Together they form a unique fingerprint.Projects
- 2 Finished
-
GRF: A New Hierarchical EM Algorithm for Reducing Mixture Models
CHAN, A. B. (Principal Investigator / Project Coordinator) & LANCKRIET, G. (Co-Investigator)
1/01/14 → 7/06/18
Project: Research
-
GRF: Hypergraphical Models for Object Tagging
LIU, Z.-Q. (Principal Investigator / Project Coordinator)
1/11/13 → 31/08/15
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver