EHPE: Skeleton Cues-based Gaussian Coordinate Encoding for Efficient Human Pose Estimation

Hai Liu, Tingting Liu*, Yu Chen*, Zhaoli Zhang, You-Fu Li*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

118 Downloads (CityUHK Scholars)

Abstract

Human pose estimation (HPE) has many wide applications such as multimedia processing, behavior understanding and human-computer interaction. Most previous studies have encountered many constraints, such as restricted scenarios and RGB inputs. To mitigate constraints to estimating the human poses in general scenarios, we present an efficient human pose estimation model (i.e., EHPE) with joint direction cues and Gaussian coordinate encoding. Specifically, we propose an anisotropic Gaussian coordinate coding method to describe the skeleton direction cues among adjacent keypoints. To the best of our knowledge, this is the first time that the skeleton direction cues is introduced to the heatmap encoding in HPE task. Then, a multi-loss function is proposed to constrain the output to prevent the overfitting. The Kullback-Leibler divergence is introduced to measure the predication label and its ground truth one. The performance of EHPE is evaluated on two HPE datasets: MS COCO and MPII. Experimental results demonstrate that EHPE can obtain robust results, and it significantly outperforms existing state-of-the-art HPE methods. Lastly, we extend the experiments on infrared images captured by our research group. The experiments achieved the impressive results regardless of insufficient color and texture information. © 2022 The Authors.
Original languageEnglish
Pages (from-to)8464-8475
JournalIEEE Transactions on Multimedia
Volume26
Online published8 Aug 2022
DOIs
Publication statusPublished - 2024

Funding

This work was supported in part by the National Key RandD Program of China under Grant 2021YFC3340802, in part by the National Natural Science Foundation of China under Grant 62277041, Grant 62077020, Grant 62173286, Grant 62211530433, and Grant 62177018, in part by the Research Grants Council of Hong Kong under Project 9043323 and Project CityU 11213420, in part by Science and Technology Development Fund, Macau, under Grant 0022/2019/AKP, in part by Jiangxi Provincial Natural Science Foundation under Grant 20232BAB212026, in part by the National Natural Science Foundation of Hubei Province under Project 2022CFB971, in part by the University Teaching Reform Research Project of Jiangxi Province under Grant JXJG-23-27-6, and in part by Shenzhen Science and Technology Program under Grant JCYJ20230807152900001.

Research Keywords

  • Biological system modeling
  • Deep learning
  • Encoding
  • Feature extraction
  • gaussian coordinate encoding
  • Heating systems
  • human pose estimation
  • Pose estimation
  • regularization
  • Skeleton
  • skeleton direction
  • Task analysis

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'EHPE: Skeleton Cues-based Gaussian Coordinate Encoding for Efficient Human Pose Estimation'. Together they form a unique fingerprint.

Cite this