Skip to main navigation Skip to search Skip to main content

RHPENet: Robust Head Pose Estimation by Learning Heterogeneous Relationships from Facial Region Cues in Human Robot Interaction

  • Tingting Liu
  • , Shijia Qian
  • , Hai Liu
  • , Minhong Wang
  • , Bing Yang
  • , You-Fu Li*
  • *Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Head pose estimation (HPE) techniques frequently encounter difficulties when handling extreme angles, occlusions, and uneven lighting conditions. In this paper, we present a novel heterogeneous relationship learning framework designed to mitigate these limitations by exploiting facial regions of interest (FRoI) and their complex interdependencies. Our approach stems from two fundamental discoveries: first, the critical importance of FRoI for pose determination, and second, the heterogeneous relationship between neighboring postures. The proposed architecture consists of three main modules: region feature generator (RFG), hierarchical structural representation (HSR), and cross-relation aggregator (CRA). The RFG incorporates an adaptive attention mechanism that prioritizes diagnostically significant facial zones. Within the HSR component, we implement a novel "Rugby-style"cross-level connectivity pattern to enhance feature integration. The CRA employs Transformer-based techniques to uncover both spatial and angular dependencies. Comprehensive evaluations conducted on major HPE benchmarks (300W_LP, AFLW2000, and BIWI) demonstrate that our RHPENet model consistently outperforms existing approaches. © 2025 IEEE.
Original languageEnglish
Title of host publication2025 8th International Conference on Artificial Intelligence and Big Data (ICAIBD 2025)
PublisherIEEE
Pages639-645
Number of pages7
ISBN (Electronic)979-8-3315-1936-0, 979-8-3315-1935-3
ISBN (Print)979-8-3315-1937-7
DOIs
Publication statusPublished - 2025
Event8th International Conference on Artificial Intelligence and Big Data (ICAIBD 2025) - Chengdu, China
Duration: 23 May 202526 May 2025

Publication series

NameInternational Conference on Artificial Intelligence and Big Data, ICAIBD
ISSN (Print)2769-3546
ISSN (Electronic)2769-3554

Conference

Conference8th International Conference on Artificial Intelligence and Big Data (ICAIBD 2025)
PlaceChina
CityChengdu
Period23/05/2526/05/25

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant 6247077114, Grant 62377037, Grant 62277041, Grant 62173286, Grant 62177019 and Grant 62177018; in part by the Jiangxi Provincial Natural Science Foundation under Grant 20242BAB2S107, Grant 20232BAB212026; in part by the National Natural Science Foundation of Hubei Province under Grant 2022CFB529 and Grant 2022CFB971; in part by the University Teaching Reform Research Project of Jiangxi Province under Grant JXJG-23-27- 6; and in part by the Shenzhen Science and Technology Program under Grant JCYJ20230807152900001, and Hubei Provincial Natural Science Foundation of China-Innovation and Development Joint Fund Project under Grant 2025AFD621, and Guangdong Basic and Applied Basic Research Foundation under Grant 2025A1515010266.

Research Keywords

  • Facial regions of interest
  • Head pose estimation
  • Heterogeneous relationship
  • Robot vision
  • Transformer

Fingerprint

Dive into the research topics of 'RHPENet: Robust Head Pose Estimation by Learning Heterogeneous Relationships from Facial Region Cues in Human Robot Interaction'. Together they form a unique fingerprint.

Cite this